Data mining syndromes

January 11, 2019 by David Bradley, Inderscience

With every news story, the concepts of data mining healthcare information move higher still up the research and policy agenda in this area. Clinical information and genetic data contained within electronic health records (EHRs) represents a major source of useful information for biomedical research but accessing it in a useful way can be difficult.

Writing in the International Journal of Intelligent Engineering Informatics, Hassan Mahmoud and Enas Abbas of Benha University and Ibrahim Fathy Ain Shams University, in Egypt, discuss the need for innovative and effective methods for representing this huge amount of data. They point out that there are data mining techniques as well as ontology-based techniques that can play a major role in detecting syndromes in patients efficiently and accurately. A syndrome is defined as a set of concomitant medical symptoms and indicators associated with a given disease or disorder.

The team has reviewed the state of the art and also focused on reviewing the well-known such as decision trees (J48), Naïve Bayes, multi-layer perceptron (MLP), and random forest (RF) techniques and compared how well they each perform in the classification of a particular syndrome, heart disease.

The team concludes that in experiments with a public data set, the RF classifier provides the best performance in terms of accuracy. In the future, they suggest that data mining will benefit healthcare and medicine significant for building a system able to detect a specific syndrome.

Explore further: A new approach for software fault prediction using feature selection

More information: Hassan Mahmoud et al. Data mining and ontology-based techniques in healthcare management, International Journal of Intelligent Engineering Informatics (2018). DOI: 10.1504/IJIEI.2018.096549

Related Stories

How is big data impacting sports analytics?

December 20, 2018

Sports in all its forms, from Major League Baseball to Fantasy Football is driven by and produces huge amounts of data, and advanced data mining and machine learning techniques are now having a major impact on sports data ...

Seeing the forest and the trees reveals heart problems

July 26, 2010

A statistical analysis of publicly available heart rate data using three classification tools - Random Forests, Logistic Model Tree and Neural Network - could lead to a rapid and precise way to diagnose heart problems, according ...

Recommended for you

Technology near for real-time TV political fact checks

January 18, 2019

A Duke University team expects to have a product available for election year that will allow television networks to offer real-time fact checks onscreen when a politician makes a questionable claim during a speech or debate.

Afromontane forests and climate change

January 17, 2019

In the world of paleoecology, little has been known about the historical record of ecosystems in the West African highlands, especially with regard to glacial cycles amidst a shifting climate and their effects on species ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.