Using Electronic Health Record Derived Medication Data for Population Pharmacokinetic Studies

Williams, Michael Lee

Using Electronic Health Record Derived Medication Data for Population Pharmacokinetic Studies

dc.contributor.advisor	Choi, Leena
dc.creator	Williams, Michael Lee
dc.date.accessioned	2023-01-06T21:25:13Z
dc.date.created	2022-12
dc.date.issued	2022-10-19
dc.date.submitted	December 2022
dc.identifier.uri	http://hdl.handle.net/1803/17871
dc.description.abstract	This work is centered on the implementation of EHR derived medication data to perform population pharmacokinetic studies and is divided into chapters which focus on a different aspect of this field. In the first, we conducted a population pharmacokinetic (PK) study with 363 subjects using real-world data extracted from electronic heath records (EHRs) to estimate the tacrolimus population PK profile. We assessed the sensitivity of the PK parameter estimates to assumptions about dose timing using last-dose times extracted by our own natural language processing system, medExtractR. Our findings suggest that drugs with a slower elimination rate (or a longer half-life) are less sensitive to dose timing errors and that experimental designs which only allow for trough blood concentrations are usually insensitive to deviation in absorption rate. In the next, we examined fentanyl pharmacogenetics. CYP3A4 and CYP3A5 encode enzymes which metabolize fentanyl; genetic variants in these genes impact fentanyl pharmacokinetics in adults. In a pediatric cohort, we found that a genotype of CYP3A51/3 or CYP3A51/6 (i.e., intermediate metabolizer status) was associated with a 0.84-fold (95% confidence interval [CI]: 0.71 to 1.00) reduction in clearance vs. CYP3A51/1 (i.e., normal metabolizer status). CYP3A53/3, CYP3A53/6, or CYP3A56/6 (i.e., poor metabolizer status) was associated with a 0.76-fold (95% CI: 0.58 to 0.99) reduction in clearance. In the final model, expected clearance was 8.9 and 6.8 L/hr for a normal and poor metabolizer, respectively, with median population covariates (9 months old, 7.7 kg, low surgical severity). In the final study, we developed a set of models for predicting valid doses from invalid doses in EHR sourced medication data. We built models using supervised methods such as Random Forests and Adapted Boosting as well as unsupervised methods such as Markov Models and Hidden Markov Models. We tested the models on cohorts of medication notes for two drugs, tacrolimus and lamotrigine. In the tacrolimus test set, the best model was the Hidden Markov Model (squared root mean squared error (RMSE) = 1.4 mg). In the lamotrigine set, it was Two-Stage Random Forest Model (RMSE = 168 mg).
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	Pharmacokinetic
dc.subject	Pharmacogenetics
dc.subject	Natural Language Processing
dc.subject	Therapeutic Drug Monitoring
dc.title	Using Electronic Health Record Derived Medication Data for Population Pharmacokinetic Studies
dc.type	Thesis
dc.date.updated	2023-01-06T21:25:13Z
dc.type.material	text
thesis.degree.name	PhD
thesis.degree.level	Doctoral
thesis.degree.discipline	Biostatistics
thesis.degree.grantor	Vanderbilt University Graduate School
local.embargo.terms	2023-12-01
local.embargo.lift	2023-12-01
dc.creator.orcid	0000-0003-4991-2329
dc.contributor.committeeChair	Shepherd, Bryan

Files in this item

Name:: WILLIAMS-DISSERTATION-2022.pdf
Size:: 9.726Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Electronic Theses and Dissertations
Electronic theses and dissertations of masters and doctoral students submitted to the Graduate School.

Show simple item record