• About
    • Login
    View Item 
    •   Institutional Repository Home
    • Electronic Theses and Dissertations
    • Electronic Theses and Dissertations
    • View Item
    •   Institutional Repository Home
    • Electronic Theses and Dissertations
    • Electronic Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of Institutional RepositoryCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsDepartmentThis CollectionBy Issue DateAuthorsTitlesSubjectsDepartment

    My Account

    LoginRegister

    Statistical Methods for the Analysis of Error-Prone Electronic Health Records: Impact of Source Data Verification, Time Discretized Multiple Imputation, and Variance Estimation with Incompatible Imputation and Analysis Models

    Giganti, Mark Joseph
    : https://etd.library.vanderbilt.edu/etd-08232018-184014
    http://hdl.handle.net/1803/13990
    : 2018-08-29

    Abstract

    Observational data from electronic health records (EHRs) are prone to errors which are often correlated across multiple variables. One strategy to assess EHR data quality is to compare the research study data to the original source document for a subset of records and document discrepancies. Given the resource-intensiveness of this source data verification (SDV), it is imperative to be able to justify its continued implementation. Using a data audit from an international HIV setting as a practical example, I propose a framework for assessing the impact of audits on study results and illustrate its implementation. Given the discrepancies in the originally collected data are substantial enough to impact epidemiological inferences, I propose a method to obtain unbiased and efficient estimates in time-to-event analyses while incorporating both the original error-prone data for all subjects and the audited data for the subsample of subjects. This time-discretized modeling and imputation (TDMI) approach uses discrete time models built in a validation sample to multiply impute covariate and outcome values in the remaining unvalidated records. Imputation variances estimates were calculated using an approached proposed by Robins and Wang (2000) that allows for incompatibility between imputation and analysis models. We provide a tutorial for calculating this imputation variance estimator using multiple examples and providing comprehensive R code.
    Show full item record

    Files in this item

    Icon
    Name:
    Giganti.pdf
    Size:
    1.490Mb
    Format:
    PDF
    View/Open

    This item appears in the following collection(s):

    • Electronic Theses and Dissertations

    Connect with Vanderbilt Libraries

    Your Vanderbilt

    • Alumni
    • Current Students
    • Faculty & Staff
    • International Students
    • Media
    • Parents & Family
    • Prospective Students
    • Researchers
    • Sports Fans
    • Visitors & Neighbors

    Support the Jean and Alexander Heard Libraries

    Support the Library...Give Now

    Gifts to the Libraries support the learning and research needs of the entire Vanderbilt community. Learn more about giving to the Libraries.

    Become a Friend of the Libraries

    Quick Links

    • Hours
    • About
    • Employment
    • Staff Directory
    • Accessibility Services
    • Contact
    • Vanderbilt Home
    • Privacy Policy