Research Directions

My work focuses on understanding the representations and algorithms underlying human and machine cognition. For the first time, we have access to models that can flexibly accomplish complex cognitive tasks across a wide range of domains. My research uses insights from cognitive science in order to uncover the mechanisms supporting this seemingly-intelligent behavior, and, reciprocally, uses techniques from mechanistic interpretability to better characterize the similarities and differences between minds and machines. Ultimately, this research program aims to transform black-box neural networks into explicit and useful cognitive models of linguistic and visual processing, while also driving the development of more human-like artificial intelligence systems.

Here are some specific questions that I like to think about:

  • How do modern neural networks seem to produce compositional/symbolic behavior?
  • To what extent do language models learn to represent a coherent model of the world?
  • How can we best employ pretrained models as (components of) cognitive models?
  • How much of human cognition is truly symbolic, and how much is statistical?
  • What is the role of inductive biases in modern AI? Is scale all you need?

Selected Publications

Just Fantasy Diagram

Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility

Can language models distinguish the possible from the impossible?

Racing Thoughts Diagram

Racing Thoughts: Explaining Contextualization Errors in Large Language Models

Language models are usually great at incorporating context — but not always. What causes contextualization errors?

Beyond the Doors Diagram

Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects

How do vision transformers solve a simple symbolic visual reasoning task?

Break It Down Diagram

Break It Down: Evidence for Structural Compositionality in Neural Networks

Do neural networks self-organize into modular components when solving compositional tasks?