Faculty Recruiting Support CICS

CIIR Talk Series: Open Language Model (OLMo): The Science of Language Models and Language Models for Science

17 Nov
Friday, 11/17/2023 1:30pm to 2:30pm
Computer Science Building, Room 150/151; Virtual via Zoom
Seminar

Abstract: Over the past few years, and especially since the deployment of ChatGPT in November 2022, neural language models with billions of parameters and trained on trillions of words are powering the fastest-growing computing applications in history and generating discussion and debate across society. However, AI scientists cannot study or improve those state-of-the-art models because the models' parameters, training data, code, and even documentation are not openly available. In this talk, I present our OLMo project toward building strong language models and making them fully open to researchers along with open-source code for data management, training, inference, and interaction. In particular, I describe DOLMa, a 3T token open dataset curated for training language models, Tulu, our instruction-tuned language model, and OLMo v1, a fully-open 7B parameter language model.

Bio: Hanna Hajishirzi is a Torode Family Associate Professor at UW CSE and a Senior Director at AI2. Her research spans different areas in NLP and AI, specifically understanding and advancing large language models. Honors include the NSF CAREER Award, Sloan Fellowship, Allen Distinguished Investigator Award, Intel rising star award, UIUC Alumni award, multiple best paper and honorable mention paper awards, and several industry research faculty awards. Hanna received her PhD from the University of Illinois and spent a year as a postdoc at Disney Research and CMU.

 

To attend this talk via Zoom, click here. To obtain the passcode for this series, please see the event advertisement on the seminars email list or reach out to zamani [at] cs.umass.edu (Hamed Zamani). For any questions about this event with the Center for Intelligent Information Retrieval, please contact jean [at] cs.umass.edu (subject: CIIR%20Talk%20Series) (Jean Joyce).