Federated Offline Reinforcement Learning

Ragothaman, Nitin Kasshyap

The full text of this item is not available at this time because the student has placed this item under an embargo for a period of time. The Libraries are not authorized to provide a copy of this work during the embargo period, even for Texas A&M users with NetID.

View/ Open

RAGOTHAMAN-THESIS-2023.pdf (1.766Mb)

Date

2023-04-19

Author

Ragothaman, Nitin Kasshyap

Metadata

Show full item record

Abstract

Federated learning aims to solve a global optimization problem by collectively learning from a group of clients without sharing data they possess. In offline reinforcement learning, an agent aims to learn an optimal policy for its behavior without accessing the environment. While federated algorithms function well in the supervised learning regime, extending such an approach to offline reinforcement learning is non-trivial due to the challenges of learning from heterogeneous data. This work proposes a Federated Ensemble-Directed Offline Reinforcement Learning Algorithm (FEDORA), which utilizes an ensemble learning approach to gather wisdom from clients possessing heterogeneous data collectively. FEDORA is implemented using the Flower framework to enable its utilization in large-scale federated systems. The algorithm is applied to learn a policy for waypoint navigation and obstacle avoidance from a group of mobile robots with varying expertise levels.

Citation

Ragothaman, Nitin Kasshyap (2023). Federated Offline Reinforcement Learning. Master's thesis, Texas A&M University. Available electronically from https : / /hdl .handle .net /1969 .1 /199114.