Open Access System for Information Sharing

Login Library

 

Article
Cited 86 time in webofscience Cited 0 time in scopus
Metadata Downloads

APPLICATION OF IRREGULAR AND UNBALANCED DATA TO PREDICT DIABETIC NEPHROPATHY USING VISUALIZATION AND FEATURE SELECTION METHODS SCIE SCOPUS

Title
APPLICATION OF IRREGULAR AND UNBALANCED DATA TO PREDICT DIABETIC NEPHROPATHY USING VISUALIZATION AND FEATURE SELECTION METHODS
Authors
Cho, BHYu, HKim, KWKim, THKim, IYKim, SI
Date Issued
2008-01
Publisher
ELSEVIER SCIENCE BV
Abstract
Objective: Diabetic nephropathy is damage to the kidney caused by diabetes mellitus. It is a common complication and a leading cause of death in people with diabetes. However, the decline in kidney function varies considerably between patients and the determinants of diabetic nephropathy have not been clearly identified. Therefore, it is very difficult to predict the onset of diabetic nephropathy accurately with simple statistical approaches such as t-test or chi(2)-test. To accurately predict the onset of diabetic nephropathy, we applied various machine Learning techniques to irregular and unbalanced diabetes dataset, such as support vector machine (SVM) classification and feature selection methods. Visualization of the risk factors was another important objective to give physicians intuitive information on each patient's clinical pattern. Methods and materials: We collected medical data from 292 patients with diabetes and performed preprocessing to extract 184 features from the irregular data. To predict the onset of diabetic nephropathy, we compared several classification methods such as logistic regression, SVM, and SVM with a cost sensitive learning method. We also applied several feature selection methods to remove redundant features and improve the classification performance. For risk factor analysis with SVM classifiers, we have developed a new visualization system which uses a nomogram approach. Results: Linear SVM classifiers combined with wrapper or embedded feature selection methods showed the best results. Among the 184 features, the classifiers selected the same 39 features and gave 0.969 of the area under the curve by receiver operating characteristics analysis. The visualization tool was able to present the effect of each feature on the decision via graphical output. Conclusions: Our proposed method can predict the onset of diabetic nephropathy about 2-3 months before the actual diagnosis with high prediction performance from an irregular and unbalanced dataset, which statistical methods such as t-test and logistic regression could not achieve. Additionally, the visualization system provides physicians with intuitive information for risk factor analysis. Therefore, physicians can benefit from the automatic early warning of each patient and visualize risk factors, which facilitate planning of effective and proper treatment strategies. (C) 2007 Elsevier B.V. All rights reserved.
Keywords
decision support systems; diabetic nephropathy; support vector machines; visualization; risk factor analysis; feature selection; TEMPORAL ABSTRACTION; ESSENTIAL-HYPERTENSION; BLOOD-PRESSURE; FOLLOW-UP; MELLITUS; DIAGNOSIS; DISEASE; MICROALBUMINURIA; CLASSIFICATION; COMPLICATIONS
URI
https://oasis.postech.ac.kr/handle/2014.oak/28728
DOI
10.1016/J.ARTMED.200
ISSN
0933-3657
Article Type
Article
Citation
ARTIFICIAL INTELLIGENCE IN MEDICINE, vol. 42, no. 1, page. 37 - 53, 2008-01
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

유환조YU, HWANJO
Dept of Computer Science & Enginrg
Read more

Views & Downloads

Browse