CSSI Lunch Seminar: Prosocial Language Models

05 Apr

Friday, 04/05/2024 12:00pm to 1:30pm

Lederle Graduate Research Center (LGRC) A112

CSSI Lunch

Large language models, such as GPT-4, have marked a significant advancement in the field of natural language processing, achieving near-human performance across a variety of tasks with minimal to no additional training data. The remarkable capabilities of these models can be attributed to their substantial parameter counts, often reaching hundreds or thousands of millions, and the extensive datasets sourced from the web for their pre-training. Despite their successes, the very characteristics that empower these models also render them susceptible to mirroring web-based biases and antisocial behaviors. Such reflections pose considerable challenges in deploying these models in real-world scenarios, particularly in socially sensitive applications. In response, our laboratory focuses on developing techniques for the post hoc mitigation of these antisocial tendencies, allowing for the enforcement of prosocial behaviors during model inference without the need for resource-intensive retraining. This presentation will delve into our latest efforts to reduce bias and enhance alignment with human ethical standards in language models through inference-time interventions.

Host

:

CSSI - Brendan O'Connor

CSSI Lunch Seminar: Prosocial Language Models

Subscribe to the CICS eNewsletter