Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

Jarek Rettinghouse

Daniel Ho

Julian Ibarz

Yunfei Bai

Kuang-Huei Lee

Ted Xiao

Sangeetha Ramesh

Matt Bennice

Alexander Herzog

Chuyuan Kelly Fu

Adrian Li

Kim Kleiven

Keerthana Gopalakrishnan

Jeff Bingham

Yevgen Chebotar

David Rendleman

Wenlong Lu

Mohi Khansari

Mrinal Kalakrishnan

Ying Xu

Sean Kirmani

Noah Brown

Khem Holden

Kanishka Rao

Justin Vincent

Ryan Julian

Peter Pastor Sampedro

Jessica Lin

David Dovo

Montse Gonzalez Arenas

Daniel Kappler

Mengyuan Yan

Sergey Levine

Jessica Lam

Jonathan Weisz

Paul Wohlhart

Karol Hausman

Cameron Lee

Bob Wei

Yao Lu

Download Google Scholar

Abstract

We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL from real-world data with bootstrapping from training in simulation, and incorporates auxiliary inputs from existing computer vision systems as a way to boost generalization to novel objects, while retaining the benefits of end-to-end training. We analyze the tradeoffs of different design decisions in our system, and present a large-scale empirical validation that includes training on real-world data gathered over the course of 24 months of experimentation, across a fleet of 23 robots in three office buildings, with a total training set of 9527 hours of robotic experience. Our final validation also consists of 4800 evaluation trials across 240 waste station configurations, in order to evaluate in detail the impact of the design decisions in our system, the scaling effects of including more real-world data, and the performance of the method on novel objects.

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

Abstract

Research Areas

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

Abstract

Research Areas

Learn more about how we conduct our research

AI/ML Foundations  & Capabilities