Holistic Language Processing: Joint Models of Linguistic Structure

22 Mar
Monday, 03/22/2010 12:00pm to 1:00pm
Seminar

Jenny Rose Finkel
Stanford University
Computer Science

Computer Science Building, Rooms 150 & 151

Faculty Host: James Allan

The natural language processing (NLP) applications which ultimately affect people's daily lives are high level, semantically-oriented ones: question answering, machine translation, machine reading, speech interfaces for robots and machines, and others that we haven't even thought of yet. Humans are very good at these types of tasks, in part because they naturally employ holistic language processing. They effortlessly keep track of many layers of low-level information, while simultaneously integrating in long distance information from elsewhere in the conversation or document. In contrast, much NLP research focuses on lower-level tasks, like parsing, named entity recognition, and part-of-speech tagging. Moreover, for the sake of efficiency, researchers modeling these phenomena make extremely strong independence assumptions, which completely decouple these tasks, and only look at local context when making decisions. This talk will cover multiple aspects of holistic language processing, and describe systems for joint parsing and named entity recognition; named entity recognition which incorporates long-distance information; and multi-task learning over multiple domains, and over multiple datasets with varying amounts of annotated information. These systems are designed to produce analyses which are more consistent, of higher quality, and generally more useful for doing the kinds of tasks that non-researchers actually care about.

BIO:

Jenny Rose Finkel is a fifth year PhD student in the Computer Science Department at Stanford University. She received a BS in Computer Science from Columbia University in 2002. She is a member of the Artificial Intelligence Lab and the Natural Language Processing Group. Her research interests include machine learning, probabilistic graphical models, and their applications to human language processing. She has been the recipient of a Stanford School of Engineering Fellowship and a Stanford Graduate Fellowship. During her time at Stanford she has published sixteen scientific papers in peer-reviewed workshops, conference proceedings, and journals. Her non-academic interests include knitting, cooking, backpacking and bike touring.

A reception will be held at 3:40 PM in the atrium, outside the presentation room.