Show simple item record

A Modified Random Forest Kernel for Highly Nonstationary Gaussian Process Regression with Application to Clinical Data

dc.creatorVanHouten, Jacob Paul
dc.date.accessioned2020-08-22T20:32:48Z
dc.date.available2018-07-25
dc.date.issued2016-07-25
dc.identifier.urihttps://etd.library.vanderbilt.edu/etd-07222016-224434
dc.identifier.urihttp://hdl.handle.net/1803/13471
dc.description.abstractNonstationary Gaussian process regression can be used to transform irregularly episodic and noisy measurements into continuous probability densities to make them more compatible with standard machine learning algorithms. However, current inference algorithms are time-consuming or have difficulty with the highly bursty, extremely nonstationary data that are common in the medical domain. One efficient and flexible solution uses a partition kernel based on random forests, but its current embodiment produces undesirable pathologies rooted in the piecewise-constant nature of its inferred posteriors. I present a modified random forest kernel that adds a new sources of randomness to the trees, which overcomes existing pathologies and produces good results for highly bursty, extremely nonstationary clinical laboratory measurements.
dc.format.mimetypeapplication/pdf
dc.subjectstatistics
dc.subjectmachine learning
dc.subjectdata mining
dc.subjectlongitudinal data
dc.titleA Modified Random Forest Kernel for Highly Nonstationary Gaussian Process Regression with Application to Clinical Data
dc.typethesis
dc.contributor.committeeMemberThomas A. Lasko
dc.type.materialtext
thesis.degree.nameMS
thesis.degree.levelthesis
thesis.degree.disciplineBiostatistics
thesis.degree.grantorVanderbilt University
local.embargo.terms2018-07-25
local.embargo.lift2018-07-25
dc.contributor.committeeChairChristopher J. Fonnesbeck


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record