COVID-19 Updates: MIT Professional Education fully expects to resume on-campus courses during the Summer of 2022. In the event there is a change in MIT's COVID-19 policies and a course cannot be held on-campus, we will deliver courses via live virtual format. Find the latest information here.

Register Now
Lead Instructor(s)
Date(s)
Jul 18 - 22, 2022
Registration Deadline
Location
On Campus
Course Length
5 Days
Course Fee
$4,700
CEUs
3.1 CEUs
Sign-up for Course Updates

Acquire the skills you need to build advanced computer vision applications featuring innovative developments in neural network research. Designed for engineers, scientists, and professionals in healthcare, government, retail, media, security, and automotive manufacturing, this immersive course explores the cutting edge of technological research in a field that is poised to transform the world—and offers the strategies you need to capitalize on the latest advancements.

    Course Overview

    This course may be taken individually or as part of the Professional Certificate Program in Machine Learning & Artificial Intelligence.

    Deep learning innovations are driving exciting breakthroughs in the field of computer vision. Robots and drones not only “see”, but respond and learn from their environment. Autonomous cars avoid collisions by extracting meaning from patterns in the visual signals surrounding the vehicle. 

    This course covers the latest developments in vision AI, with a sharp focus on advanced deep learning methods, specifically convolutional neural networks, that enable smart vision systems to recognize, reason, interpret and react to images with improved precision. 

    Requirements

    Laptops with which you have administrative privileges along with Python installed are encouraged but not required for this course (all coding will be done in a browser).

    Participants should have experience in programming with Python, as well as experience with linear algebra, calculus, statistics, and probability.

    Learning Outcomes

    Participants will explore the latest developments in neural network research and deep learning models that are enabling highly accurate and intelligent computer vision systems capable of understanding and learning from images. We will start from fundamental topics in image modeling, including image formation, feature extraction, and multiview geometry, then move on to the latest applications in object detection, 3D scene understanding, vision and language, image synthesis, and vision for embodied agents. By the end, participants will:

    • Be familiar with fundamental concepts and applications in computer vision
    • Grasp the principles of state-of-the art deep neural networks 
    • Understand low-level image processing methods such as filtering and edge detection
    • Gain knowledge of high-level vision tasks such as object recognition, scene recognition, face detection and human motion categorization 
    • Develop practical skills necessary to build highly-accurate, advanced computer vision applications

    Program Outline

    This course meets 9:00 am - 5:00 pm each day.

    Day One:
    9:00am: 1 - Introduction to computer vision (Torralba)
    10:00am: 2- Cameras and image formation (Torralba)
    11:00am: Coffee break
    11:15am: 3- Introduction to machine learning (Isola)
    12:15pm: Lunch
    1:30pm: 4- The problem of generalization (Isola)
    2:45pm: Coffee break
    3:00pm: Lab on Pytorch
    5:00pm : Adjourn

    Day Two:
    9:00am: 5- Neural networks (Isola)
    10:00am: 6- Filters and CNNs (Torralba)
    11:00am: Coffee break
    11:15am: 7- Stochastic gradient descent (Torralba)
    12:15pm: Lunch break
    1:30pm: 8- Temporal processing and RNNs (Isola)
    2:45pm: Coffee break
    3:00pm: Lab on using modern computing infrastructure
    5:00pm: Adjourn

    Day Three:
    9:00am: 9- Multiview geometry (Torralba)
    10:00am: 10- 3D deep learning (Torralba)
    11:00am: Coffee break
    11:15am: 11- Scene understanding part 1 (Isola)
    12:15pm: Lunch break 
    1:30pm: 12- Scene understanding part 1 (Isola)
    2:45pm: Coffee break
    3:00pm: Lab on scene understanding
    5:00pm: Adjourn

    Day Four:
    9:00am: 13- People understanding (Torralba)
    10:00am: 14- Vision and language (Torralba)
    11:00am: Coffee break
    11:15am 15- Image synthesis and generative models (Isola)
    12:15pm: Lunch break
    1:30pm: 16- AR/VR and graphics applications (Isola)
    2:45pm: Coffee break
    3:00pm: Lab on generative adversarial networks
    5:00pm: Adjourn

    Day Five:
    9:00am: 17- Vision for embodied agents (Isola)
    10:00am: 18- Modern computer vision in industry: self-driving, medical imaging, and social networks
    (Torralba)
    11:00am: Coffee break
    11:15am: 19- Datasets, bias, and adaptation, robustness, and security (Torralba)
    12:15pm: Lunch break 
    1:30pm: 20- Deepfakes and their antidotes (Isola)
    2:45pm: Coffee break
    3:00pm: Lab on your own work (bring your project and we will help you to get started)
    4:55pm: closing remarks
    5:00pm: Adjourn

    Content

    The type of content you will learn in this course, whether it's a foundational understanding of the subject, the hottest trends and developments in the field, or suggested practical applications for industry.

    Fundamentals: Core concepts, understandings, and tools - 40%|Latest Developments: Recent advances and future trends - 40%|Industry Applications: Linking theory and real-world - 20%
    40|40|20
    Delivery Methods

    How the course is taught, from traditional classroom lectures and riveting discussions to group projects to engaging and interactive simulations and exercises with your peers.

    Lecture: Delivery of material in a lecture format - 50%|Discussion or Groupwork: Participatory learning - 30%|Labs: Demonstrations, experiments, simulations - 20%
    50|30|20
    Levels

    What level of expertise and familiarity the material in this course assumes you have. The greater the amount of introductory material taught in the course, the less you will need to be familiar with when you attend.

    Introductory: Appropriate for a general audience - 30%|Specialized: Assumes experience in practice area or field - 50%|Labs: Demonstrations, experiments, simulations - 20%
    30|50|20
    Who Should Attend

    This course is designed for professionals from industries such as healthcare, government, security, and automotive manufacturing, who are looking to solve computer vision problems with deep learning. Participants who will benefit from the curriculum include:

    • Engineers looking to design and build advanced computer vision applications 

    • Data scientists who want to explore cutting-edge developments in neural network research and deep learning models, and acquire strategies for applying them to their work

    • Managers and business executives seeking to capitalize on the latest advancements in deep learning for computer vision and AI 

    • Government and military officials who need to understand how computer vision applications are deployed in pertinent technologies, such as drones and other defense systems

    COVID-19 Updates

    We fully expect to resume on-campus Short Programs courses during the Summer of 2022. However, the possibility remains of ongoing disruption and restrictions due to COVID-19 which may require that the course be delivered via live virtual format. Please read more here.

    Links & Resources

    Brochure
    Download the course brochure
    SP - Deep Learning for AI and Computer Vision - Thumbnail