Opening the black box of deep learning models by digging into their successes and failures

20 Oct

Thursday, 10/20/2022 12:00pm to 1:00pm

Computer Science Building, Room 150/151, Zoom

Machine Learning and Friends Lunch

Title: Opening the black box of deep learning models by digging into their successes and failures

Abstract: Modern machine learning is powered by a simple recipe: train large models on vast datasets. But while these models can be potent, a lot remains to be understood about how they actually work.

In this talk, I will demonstrate how probing some of the striking successes and failures of these models can allow us to peek inside the black box. First, I will look at the phenomenon of adversarial examples—highly accurate models are severely brittle to imperceptible perturbations. I will discuss findings that shed light into the origins of this phenomenon and their implications for learning in general. Then, I will focus on the emergent capability of in-context learning—large language models are able to adapt to new tasks on-the-fly, by conditioning on a few input-output examples. I will present a methodology that allows us to rigorously study this capability and probe its limits. Overall, these explorations exemplify how we can understand a lot about the inner workings of these models by dissecting their behavior outside typical conditions.

Bio: Dimitris Tsipras is a postdoctoral scholar at Stanford University, advised by Percy Liang and Greg Valiant. He obtained his PhD from MIT, advised by Aleksander Madry. His research is aimed towards understanding and improving the modern machine learning toolkit, focusing on topics such as reliability, benchmarks, and interpretability.

To find out more information about this event or to obtain the Zoom link, please see the event announcements from MLFL on the college email lists or contact wenlongzhao [at] cs.umass.edu (subject: MLFL%20Zoom%20Link) (Wenlong Zhao).

Host

:

MLFL

Opening the black box of deep learning models by digging into their successes and failures

Subscribe to the CICS eNewsletter