Reinforcement Learning | Professional Education

This course is offered in a hybrid format, with in-person and live online cohorts attending simultaneously. When registering, select the appropriate registration button below.

Course is closed

Lead Instructor(s)

Date(s)

Jul 28 - 30, 2025

Registration Deadline

Jul 7, 2025

Location

On Campus

Course Length

3 Days

Course Fee

$3,600

CEUs

2.2 CEUs

Sign-up for Course Updates

Download the Course Schedule

Course is closed

Sign-up for Course Updates

Reinforcement learning (RL), is enabling exciting advancements in self-driving vehicles, natural language processing, automated supply chain management, financial investment software, and more. In this three-day course, you will acquire the theoretical frameworks and practical tools you need to use RL to solve big problems for your organization.

This course may be taken individually or as part of the Professional Certificate Program in Machine Learning & Artificial Intelligence. COMPLETING THE COURSE WILL CONTRIBUTE 3 DAYS TOWARDS THE CERTIFICATE.

Course Overview

Understand if RL can solve the big problems of your organization. Acquire the theoretical framework and basic tools for implementing RL.

Join professionals from around the world to upgrade your machine learning (ML) toolkit in this three-day RL bootcamp. Through interactive lectures and hands-on exercises, you will (i) understand the difference between supervised learning and RL; (ii) be able to gauge which problems in your organization can be solved using RL; (iii) gain a solid understanding of state-of-the-art Deep RL algorithms; (iv) ability to cast your favorite challenge into the RL framework and recognize the promise and limitations of RL through a hands-on-session and live RL clinic; (v) be able to reason about which RL algorithm is most appropriate for the problem at hand.

This program includes the unique opportunity to present your organization’s specific technological challenges to MIT faculty during a live RL Clinic—a session designed to help you identify if RL can be used to solve your problems, determine which approach will be most effective, and design RL applications to resolve the issue. During this process, you will draw on the expertise of the course teaching team, which is comprised of recognized industry experts with experience working at 12 firms across multiple industries, from both startups
and big tech.

Certificate of Completion from MIT Professional Education

Learning Outcomes

Understand the basic principles of RL and learn when RL can be applied to your business problem and how to pose the problem for obtaining maximum gains from RL both through lectures and an interactive group session.
- Learn when supervised learning is sufficient and when RL can provide a big advantage.
Learn about Bandits, Contextual Bandits and the more general RL formulation.
Understand the theory and the practical aspects of how to use popular Deep RL algorithms such as DQN, A3C, PPO, SAC, TD3, MCTS.
Walk through application of RL algorithms and what made them work.
Develop rules-of-thumb to reason about when to use which Deep RL Algorithm.
Understand how to structure the observation, action space and the reward function for optimally training the RL agent.
Learn about the limitations of Deep RL algorithm, how to tune hyperparameters and practical tricks.

Lead Instructors

Pulkit Agrawal

Assistant Professor of Electrical Engineering and Computer Science, MIT

Director, Improbable AI Lab, MIT

Cathy Wu

Gilbert W. Winslow Career Development Assistant Professor in Civil and Environmental Engineering (CEE)

Member of a member of the Laboratory for Information and Decision Systems (LIDS)

Member of the Institute for Data, Systems, and Society (IDSS)

Lead Instructors

Pulkit Agrawal

Pulkit Agrawal is the Steven and Renee Chair Assistant Professor of Electrical Engineering and Computer Science at MIT and leads the Improbable AI Lab, part of the Computer Science and Artificial Intelligence Lab at MIT and affiliated with the Laboratory for Information and Decision Systems. In the past, Dr. Agrawal has spent time at Deep Mind and Qualcomm and an advisor for Cavium Inc. He co-founded SafelyYou, an organization that builds fall prevention technology, and the AI Foundry, an incubator for AI startups. He currently serves as an advisor for several startups and has research collaborations with companies such as IBM, Toyota, Sony, Facebook AI Research (FAIR), etc.

Cathy Wu

Cathy Wu is the Gilbert W. Winslow Career Development Assistant Professor of civil and environmental engineering at MIT and has worked across many fields and organizations, including Microsoft Research, OpenAI, the Google X Self-Driving Car Team, AT&T, Caltrans, Facebook, and Dropbox. Wu is also the founder and Chair of the Interdisciplinary Research Initiative at the ACM Future of Computing Academy.

Links & Resources

News/Articles:

AI accelerates problem-solving in complex scenarios, MIT News, December 5, 2023

Who Should Attend

This program is ideally suited for technical professionals who wish to understand cutting-edge trends and advances in reinforcement learning. Professionals who are not sure of when and how to apply RL in engineering and business settings will find this program especially useful.

The curriculum is particularly appropriate for professionals with significant experience and demonstrated career progression, such as:

Engineers / Managers who want to understand Deep RL and its implications
Research scientists who want to improve their ability to utilize Deep RL algorithms
Machine learning engineers and software engineers looking to use RL to enhance results derived from supervised learning systems
Data scientists who want to incorporate RL strategies into their machine learning toolkit
Data analysts and business analysts who are tasked with solving problems with limited quantities of data
Product managers and program managers who need to be able to identify when it is appropriate and effective to apply RL
CTOs and other executives who want to identify how RL can be implemented to address organization-wide challenges

Prerequisites

To be able to take full advantage of this program, we recommend that participants have a mathematical background in linear algebra and probability, basic knowledge of deep-learning, and experience with programming (preferably Python). This background will help participants follow some of the practical examples more effectively. There are two optional assignments in the program that will require a computer with Google CoLab that runs on any browser or Unix/Linux Terminal.

BROCHURE

Download the Course Brochure