Evaluation of machine learning methods for prognostic and adverse event risk in Major Depressive Disorder

Jones, Barrett

Evaluation of machine learning methods for prognostic and adverse event risk in Major Depressive Disorder

dc.creator	Jones, Barrett
dc.date.accessioned	2024-05-15T17:42:10Z
dc.date.available	2024-05-15T17:42:10Z
dc.date.created	2024-05
dc.date.issued	2024-02-29
dc.date.submitted	May 2024
dc.identifier.uri	http://hdl.handle.net/1803/19010
dc.description.abstract	Clinical risk prediction models trained on electronic health record (EHR) data can support clinicians in decision making by estimating prognostic risk. Major Depressive Disorder (MDD) risk modeling presents particular challenges due to data sparsity and a developing understanding of risk factors. Autoencoders are an unsupervised learning method that have potential to account for some of the data challenges by learning feature dependencies, reducing dimensionality, and denoising. We evaluated two use cases for autoencoders—feature embedding and pretraining—and found that pretrained models outperform comparable models from the literature. Treatment decisions often involve switching antidepressants and the safety of this practice is an important consideration in treatment decisions. EHR data reflect clinical practice and can be used to measure adverse event risk, but the observational nature of EHR data can result in biased analyses. We evaluated two common methods for treatment effect estimation—propensity score weighting and propensity score matching—and used control outcomes to measure bias and calibrate effect estimates of adverse event risk. We found propensity score matching estimates were more robust, potentially due to clinical expertise informing selection of the study population. Prior research suggests that adverse event risk may be heterogeneous. We apply our finding of bias reduction in the matched population to estimation of heterogenous treatment effects (HTE). First, we used semi-synthetic outcomes to measure model performance under varying data generating processes. Followed by significance testing to identify adverse event HTE. Analysis of semi-synthetic and adverse event outcomes identified limited evidence for heterogeneity and highlights steps necessary to implement clinically useful HTE models. Our work improved prognostic modeling, showed evidence for robust adverse event effect estimation, and provided insight into variation in HTE model performance in the MDD phenotype.
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	Biomedical Informatics, Machine Learning, Causal Inference, Mental Health
dc.title	Evaluation of machine learning methods for prognostic and adverse event risk in Major Depressive Disorder
dc.type	Thesis
dc.date.updated	2024-05-15T17:42:10Z
dc.contributor.committeeMember	Novak, Laurie
dc.contributor.committeeMember	Spieker, Andrew J
dc.contributor.committeeMember	Taylor, Warren
dc.contributor.committeeMember	Wright, Adam
dc.type.material	text
thesis.degree.name	PhD
thesis.degree.level	Doctoral
thesis.degree.discipline	Biomedical Informatics
thesis.degree.grantor	Vanderbilt University Graduate School
dc.creator.orcid	0000-0002-5329-1070
dc.contributor.committeeChair	Walsh, Colin G

Files in this item

Name:: JONES-DISSERTATION-2024.pdf
Size:: 1.680Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Electronic Theses and Dissertations
Electronic theses and dissertations of masters and doctoral students submitted to the Graduate School.

Show simple item record