Jorjani Biomedicine Journal

Search published articles

Showing 2 results for Clustering

Use of data mining algorithms in assessing the affecting factors on predicting the health status of newborns

Fatemeh Bagheri, Hakimeh Alizadeh Majd, Zahra Mehrbakhsh, Majid Ziaratban,
Volume 2, Issue 2 (10-2014)

Abstract

Background & Objective: Prediction of health status in newborns and also identification of its affecting factors is of the utmost importance. There are different ways of prediction. In this study, effective models and patterns have been studied using decision tree algorithm. Method: This study was conducted on 1,668 childbirths in three hospitals of Shohada, Omidi and Mehr in city of Behshahr. Variables such as baby's gender, birth weight, birth order, maternal age, maternal history of illness, gestational diseases, type of delivery, reason of caesarean section, maternal age, family relationship of father and mother, mother's blood type, mother's occupation and blood pressure and place of residence were chosen as predictive factors of decision tree categorization method. The health status of the baby was used as a dependent dual-mode variable. All variables were used in clustering and correlation rules. Prediction was done and then compared using 4 decision-tree algorithms. Results: In the clustering method, the optimal number of clusters was determined as 8, using the Dunn index measurement. Among all the implemented algorithms of CART, QUEST, CHAID and C5.0, C5.0 algorithm with detection rate of 94.44% was identified as the best algorithm. By implementing the Apriori algorithm, strong correlation rules were extracted with regard to the threshold for Support and Confidence. Among the characteristics, maternal age, birth weight and reason of caesarean section with the highest impacts were found as the most important factors in the prediction. Conclusion: Due to the simple interpretation of the decision tree and understandability of the extracted rules derived from it, this model can be used for (most individuals) professionals and pregnant women at different levels.

Determination of the Distribution Pattern of Mortality Using Data Mining Technique in Golestan Province since 2007 to 2009

Fatemeh Bagheri, , ,
Volume 3, Issue 2 (10-2015)

Abstract

Background and objectives: Investigatingg the mortality in a population has been considered as one of the appropriate methods of health detection. Although, there are some problems such as lack of confidence in accuracy measurement and quality of data collection. Establishment of death registration systems and using international classification codes of diseases, and also mortality data integrating by responsible organizations have solved great parts of the previous problems. In this study, considering a set of parameters, the study population was divided into two groups: deceased under one year (infants) and over one year (adults). Then both groups were clustered using the K-means method to identify different groups. Hidden models and useful patterns were also discovered using decision tree algorithms. Finally, a neural network algorithm was used to show the ranking of attributes in order of their importance.

Methods: In this research, data of 12,865 deceased individuals in Golestan province since 2007 to 2009 is studied. The data has been obtained from the Health Center of Golestan province. The main characteristics used in this study are: deceased age, gender, cause of death, place of residence and place of death. K-means algorithm is used to cluster data. The decision tree algorithms and neural networks algorithm were also used for classification. Finally, results and rules were extracted. Due to different natures of causes of death in infants and adults, studying on these different groups is performed separately.

Results: In clustering phase, the optimal number of clusters is obtained by Dunn index; eight clusters for infants and seven clusters for adults were obtained. Among four decision-tree algorithms (C5.0, QUEST, CHAID and CART), C5.0 algorithm with high correction rate, 77.37% in infants data and 96.86% in adults data was the best classifier algorithm. Age, gender and place of death were the most important variables that were detected by neural network algorithm.

Conclusion: In the present study, the collected mortality data was clustered by considering the effective factors and the standard of International Classification of Diseases. The hidden patterns of mortality for infants and adults were extracted. Due to the explicit nature and the intelligibility of the decision tree algorithms, the results and extracted rules are very useful for specialists in this field.

Page 1 from 1

Designed & Developed by : Yektaweb

How Do You Evaluate This Site?
	Excellent
	Good
	Average
	weak

Jorjani Biomedicine Journal

Search published articles

Related Websites

Vote