Show simple item record

A Theoretical & Empirical Analysis of Transformer Language Model Behavior

dc.creatorRoberts, Jesse Taylor Noah
dc.date.accessioned2024-08-15T18:19:49Z
dc.date.available2024-08-15T18:19:49Z
dc.date.created2024-08
dc.date.issued2024-05-16
dc.date.submittedAugust 2024
dc.identifier.urihttp://hdl.handle.net/1803/19164
dc.description.abstractThis dissertation presents empirical and theoretical work aimed at enhancing the understanding of transformer-based Large Language Model (LLM) behaviors, with the empirical behaviors compared to established human behaviors. The dissertation introduces PopulationLM, a method employing systematic perturbations to generate model populations, facilitating the characterization of robust LLM cognitive behaviors. Using PopulationLM, the study replicates experiments on typicality and structural priming, demonstrating typicality effects in LLMs and the absence of structural priming in tested models. The dissertation examines human-like strategic behaviors in LLMs, highlighting models capable of value-based preference (VBP) and their responses in scenarios like the prisoner's (PD) and traveler's dilemmas (TD). Findings reveal that robust, VBP-capable LLMs may not exhibit certainty towards weakly dominated strategies, and align with human sensitivities to stake-size (PD) and penalty-size (TD). Moreover, the dissertation advocates for reproducible research, cautioning against reliance on closed-source models due to their lack of long-term reproducibility similar to important but privately held fossils. The theoretical contributions assert the Turing completeness of decoder-only transformers, while also identifying limitations when engaged in certain tasks, prompting exploration of alternative architectures for advancing artificial general intelligence.
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectDecoder-only language models
dc.subjectLLM
dc.subjectPopulationLM
dc.subjectCognitive Science
dc.subjectMachine Learning
dc.subjectArtificial Intelligence
dc.subjectTransformer
dc.subjectTuring Complete
dc.subjectLanguage Model Behavior
dc.subjectLanguage Model
dc.subjectGame Theory
dc.subjectPrisoner's Dilemma
dc.subjectTraveler's Dilemma
dc.subject
dc.titleA Theoretical & Empirical Analysis of Transformer Language Model Behavior
dc.typeThesis
dc.date.updated2024-08-15T18:19:49Z
dc.type.materialtext
thesis.degree.namePhD
thesis.degree.levelDoctoral
thesis.degree.disciplineComputer Science
thesis.degree.grantorVanderbilt University Graduate School
dc.creator.orcid0000-0002-6210-0678
dc.contributor.committeeChairFisher, Doug


Files in this item

Icon

This item appears in the following Collection(s)

Show simple item record