Augmenting Deep Reinforcement Learning with Clustering

Machavaram, Hemanth

Augmenting Deep Reinforcement Learning with Clustering

dc.contributor.advisor	Fisher, Douglas H
dc.creator	Machavaram, Hemanth
dc.date.accessioned	2020-07-01T00:08:47Z
dc.date.available	2020-07-01T00:08:47Z
dc.date.created	2020-05
dc.date.issued	2020-03-27
dc.date.submitted	May 2020
dc.identifier.uri	http://hdl.handle.net/1803/10107
dc.description.abstract	Deep Reinforcement Learning Algorithms today suffer heavily from sample inefficiency – the need for a lot of training samples to learn the desired behavior. Although there are quite a few reasons for this inefficiency, one of them may be due to the same reason that makes Deep RL so powerful: the use of Neural Networks as function approximators. Most Deep RL algorithms use neural networks to predict a policy function, and then through backpropagation attempt to shift the policy predictions towards an optimal policy. However, it is difficult to control what region of the input space benefits from learning on each training example. If too small a region benefits, then more training samples from the disaffected regions would be required to learn essentially the same information. This drives up the total number of required training samples. This work aims to tackle the problem by enforcing the neural network to form a minimal set of clusters over the input space, such that what is learned for one training sample in a cluster is more widely distributed – not in an unknown region around the training example as the neural network would do on its own – but throughout the entire cluster. Specifically, the Proximal Policy Optimization with RNN policy algorithm will be augmented with clusters, and it will be shown that this produces better results than the vanilla PPO.
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	Deep Reinforcement Learning
dc.subject	Clustering
dc.subject	Sample Inefficiency
dc.title	Augmenting Deep Reinforcement Learning with Clustering
dc.type	Thesis
dc.date.updated	2020-07-01T00:08:47Z
dc.type.material	text
thesis.degree.name	MS
thesis.degree.level	Masters
thesis.degree.discipline	Computer Science
thesis.degree.grantor	Vanderbilt University
dc.creator.orcid	0000-0002-2850-5209

Files in this item

Name:: MACHAVARAM-THESIS-2020.pdf
Size:: 576.4Kb
Format:: PDF

View/Open

Name:: code.zip
Size:: 8.329Kb
Format:: application/

View/Open

This item appears in the following Collection(s)

Electronic Theses and Dissertations
Electronic theses and dissertations of masters and doctoral students submitted to the Graduate School.

Show simple item record