Future medical conditions predicted with new statistical model

Jun 04, 2012

Analyzing medical records from thousands of patients, statisticians have devised a statistical model for predicting what other medical problems a patient might encounter.

Like how Netflix recommends movies and TV shows or how Amazon.com suggests products to buy, the algorithm makes predictions based on what a patient has already experienced as well as the experiences of other patients showing a similar .

"This provides physicians with insights on what might be coming next for a patient, based on experiences of other patients. It also gives a predication that is interpretable by patients," said Tyler McCormick, an assistant professor of statistics and sociology at the University of Washington.

The algorithm will be published in an upcoming issue of the journal Annals of Applied Statistics. McCormick's co-authors are Cynthia Rudin, Massachusetts Institute of Technology, and David Madigan, Columbia University.

McCormick said that this is one of the first times that this type of predictive algorithm has been used in a medical setting. What differentiates his model from others, he said, is that it shares information across patients who have similar . This allows for better predictions when details of a patient's medical history are sparse.

For example, new patients might lack a lengthy file listing ailments and drug prescriptions compiled from previous doctor visits. The algorithm can compare the patient's current health complaints with other patients who have a more extensive medical record that includes similar symptoms and the timing of when they arise. Then the algorithm can point to what medical conditions might come next for the new patient.

"We're looking at each sequence of symptoms to try to predict the rest of the sequence for a different patient," McCormick said. If a patient has already had dyspepsia and epigastric pain, for instance, heartburn might be next.

The algorithm can also accommodate situations where it's statistically difficult to predict a less common condition. For instance, most patients do not experience strokes, and accordingly most models could not predict one because they only factor in an individual patient's medical history with a stroke. But McCormick's model mines medical histories of patients who went on to have a stroke and uses that analysis to make a stroke prediction.

The used obtained from a multiyear clinical drug trial involving tens of thousands of patients aged 40 and older. The records included other demographic details, such as gender and ethnicity, as well as patients' histories of medical complaints and prescription medications.

They found that of the 1,800 in the dataset, most of them – 1,400 – occurred fewer than 10 times. McCormick and his co-authors had to come up with a statistical way to not overlook those 1,400 conditions, while alerting patients who might actually experience those rarer conditions.

They came up with a statistical modeling technique that is grounded in Bayesian methods, the backbone of many predictive algorithms. McCormick and his co-authors call their approach the Hierarchical Association Rule Model and are working toward making it available to and doctors.

"We hope that this model will provide a more patient-centered approach to medical care and to improve patient experiences," McCormick said.

Explore further: New study utilizes Kinect for Windows technology to teach elementary school students geometry

More information: Download the Annals of Applied Statistics paper from McCormick's website: www.stat.washington.edu/~tylermc/

add to favorites email to friend print save as pdf

Related Stories

Making health information technology more patient-centered

Jan 18, 2011

Personal health records have great potential to help patients manage their health, but technology needs to be designed with the patient in mind – which means doing more than helping patients access health information, ...

Predicting risk of stroke from one's genetic blueprint

Feb 25, 2009

A new statistical model could be used to predict an individual's lifetime risk of stroke, finds a study from the Children's Hospital Informatics Program (CHIP). Using genetic information from 569 hospital patients, the researchers ...

Recommended for you

Satire has a history of informing during times of crisis

1 hour ago

Just as only the jester can tell the King the truth, satire performs a vital function in democratic society by using humor to broach taboo subjects, especially in times of crisis, according to a book by Penn State researchers.

Long-necked 'dragon' discovered in China

15 hours ago

University of Alberta paleontologists including PhD student Tetsuto Miyashita, former MSc student Lida Xing and professor Philip Currie have discovered a new species of a long-necked dinosaur from a skeleton ...

The largest known muntiacine found in China

15 hours ago

Dr. HOU Sukuan from the Institute of Vertebrate Paleontology and Paleoanthropology (IVPP), Chinese Academy of Sciences reported a new species of muntiacine Euprox in the journal of Zootaxa 3911 (1) recent ...

User comments : 0

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.