dc.creator | Jones, Barrett | |
dc.date.accessioned | 2024-05-15T17:42:10Z | |
dc.date.available | 2024-05-15T17:42:10Z | |
dc.date.created | 2024-05 | |
dc.date.issued | 2024-02-29 | |
dc.date.submitted | May 2024 | |
dc.identifier.uri | http://hdl.handle.net/1803/19010 | |
dc.description.abstract | Clinical risk prediction models trained on electronic health record (EHR) data can support clinicians in decision making by estimating prognostic risk. Major Depressive Disorder (MDD) risk modeling presents particular challenges due to data sparsity and a developing understanding of risk factors. Autoencoders are an unsupervised learning method that have potential to account for some of the data challenges by learning feature dependencies, reducing dimensionality, and denoising. We evaluated two use cases for autoencoders—feature embedding and pretraining—and found that pretrained models outperform comparable models from the literature.
Treatment decisions often involve switching antidepressants and the safety of this practice is an important consideration in treatment decisions. EHR data reflect clinical practice and can be used to measure adverse event risk, but the observational nature of EHR data can result in biased analyses. We evaluated two common methods for treatment effect estimation—propensity score weighting and propensity score matching—and used control outcomes to measure bias and calibrate effect estimates of adverse event risk. We found propensity score matching estimates were more robust, potentially due to clinical expertise informing selection of the study population.
Prior research suggests that adverse event risk may be heterogeneous. We apply our finding of bias reduction in the matched population to estimation of heterogenous treatment effects (HTE). First, we used semi-synthetic outcomes to measure model performance under varying data generating processes. Followed by significance testing to identify adverse event HTE. Analysis of semi-synthetic and adverse event outcomes identified limited evidence for heterogeneity and highlights steps necessary to implement clinically useful HTE models. Our work improved prognostic modeling, showed evidence for robust adverse event effect estimation, and provided insight into variation in HTE model performance in the MDD phenotype. | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | |
dc.subject | Biomedical Informatics, Machine Learning, Causal Inference, Mental Health | |
dc.title | Evaluation of machine learning methods for prognostic and adverse event risk in Major Depressive Disorder | |
dc.type | Thesis | |
dc.date.updated | 2024-05-15T17:42:10Z | |
dc.contributor.committeeMember | Novak, Laurie | |
dc.contributor.committeeMember | Spieker, Andrew J | |
dc.contributor.committeeMember | Taylor, Warren | |
dc.contributor.committeeMember | Wright, Adam | |
dc.type.material | text | |
thesis.degree.name | PhD | |
thesis.degree.level | Doctoral | |
thesis.degree.discipline | Biomedical Informatics | |
thesis.degree.grantor | Vanderbilt University Graduate School | |
dc.creator.orcid | 0000-0002-5329-1070 | |
dc.contributor.committeeChair | Walsh, Colin G | |