Entity tracking in Language Models - Sebastian Schuster - EVENT POSTPONED
30 May 2024, 1:00 pm–3:00 pm

Entity tracking in Language Models - EVENT POSTPONED
Event Information
Open to
- All
Organiser
-
Alina Konradt
Location
-
4th floor Seminar Room66-72 Gower StreetLondonWC1E 6EAUnited Kingdom
Title: Entity tracking in Language Modelss
Abstract: Keeping track of how states and relations of entities change as a text or dialog unfolds is a key prerequisite to discourse understanding as well as other AI tasks such as planning, and yet it remains unclear to what extent pretrained language models systematically exhibit this capability. In my talk, I will first discuss the challenges that come with evaluating such general abilities in LMs, and then I will present a new evaluation task for assessing entity tracking abilities in LMs. I will then present results on GPT-3/3.5/4, Flan-T5, and Llama 2 models and discuss the influence of pretraining on code for entity tracking abilities. I will also show that smaller models can learn to track entities but their generalization abilities are still quite limited, and present some preliminary results from a mechanistic interpretability study on identifying the algorithm that the model implements for solving this task.
About the Speaker
Sebastian Schuster
at UCL Linguistics
More about Sebastian Schuster