Show simple item record

Random Forest Classification of Acute Coronary Syndrome

dc.creatorVanHouten, Jacob Paul
dc.date.accessioned2020-08-23T16:04:55Z
dc.date.available2015-12-16
dc.date.issued2013-12-16
dc.identifier.urihttps://etd.library.vanderbilt.edu/etd-12022013-152619
dc.identifier.urihttp://hdl.handle.net/1803/15043
dc.description.abstractCoronary artery disease (CAD) is the leading cause of death worldwide. Acute coronary syndromes (ACS), a subset of CAD, account for 1.4 million hospitalizations $165 billion in costs in the United States alone. A major challenge to the physician when diagnosing and treating patients with suspected ACS is that there is significant overlap between patients with and without ACS. There is a high cost to missing a diagnosis of ACS, but also a high cost to inappropriate treatment of patients without ACS. American College of Cardiology/American Heart Association guidelines recommend early risk stratification of patients to determine their likelihood of major adverse events, but many individual tests and prognostic indices lack sufficient performance characteristics for use in clinical practice. Prognostic indices specifically are often not representative of the population on which they are used and rely on complete and accurate data. We explored the use of state-of-the-art machine learning techniques random forest and elastic net on 23,576 records from the Synthetic Derivative to develop models with better performance characteristics than previously established prognostic indices in determining the risk of ACS for patients presenting with suspicious symptoms. We bootstrapped the process of model creation, and found that the random forest significantly outperformed elastic net, L2 regularized regression, and the previously-developed TIMI and GRACE scores. We also assessed the model calibration for the random forest and explored methods of correction. Our preliminary findings suggest that machine learning applied to noisy and largely missing data can still perform as well or better than previously developed scoring metrics.
dc.format.mimetypeapplication/pdf
dc.subjectprediction
dc.subjectmachine learning
dc.subjectcardiovascular
dc.titleRandom Forest Classification of Acute Coronary Syndrome
dc.typethesis
dc.contributor.committeeMemberNancy M. Lorenzi
dc.contributor.committeeMemberJohn M. Starmer, MD
dc.contributor.committeeMemberDavid J. Maron, MD
dc.type.materialtext
thesis.degree.nameMS
thesis.degree.levelthesis
thesis.degree.disciplineBiomedical Informatics
thesis.degree.grantorVanderbilt University
local.embargo.terms2015-12-16
local.embargo.lift2015-12-16
dc.contributor.committeeChairThomas A. Lasko, MD, PhD


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record