ANCOR: AI, Neuro, CogSci Research Talks

Upcoming Talks

Model Synthesis Architectures for Reasoning and Planning

Tyler Brooke-Wilson

Yale University (Assistant Professor)

Date: March 20, 2026 (Friday) at 2pm

Location: Carney Innovation Zone, 164 Angell st, 4th floor

Abstract:

TBD

Gasser Elbanna

MIT (PhD Student)

Date: April 17, 2026 (Friday) at 2pm

Location: Carney Innovation Zone, 164 Angell st, 4th floor

Abstract:

TBD

Erin Grant

NYU (Faculty Fellow)

Date: May 08, 2026 (Friday) at 2pm

Location: Carney Innovation Zone, 164 Angell st, 4th floor

Abstract:

Past Talks

Evaluating human-like conceptual diversity and value trade-offs in alignment

Sonia Murthy

Harvard University (PhD Student)

Date: February 20, 2026 (Friday) at 2pm

Location: Carney Innovation Zone, 164 Angell st, 4th floor

Abstract:

A Mechanistic Theory Unifying Cognitive Maps and Value in Frontal Cortex

Jo Warren

University College London (PhD Student)

Date: February 13, 2026 (Friday) at 12pm

Location: (virtual-only: see calendar for zoom link; live webcast available at Metcalf, 3rd floor, Dome room)

Link to recording

Abstract:

Testing AI's Implicit World Models

Keyon Vafa

Harvard Data Science Initiative (Postdoc)

Date: December 12, 2025 (Friday) at 2pm

Location: Carney Innovation Zone, 164 Angell st, 4th floor

Abstract:

You can just align things

Brian Cheung

MIT (CBMM Fellow)

Date: November 14, 2025 (Friday) at 2pm

Location: Carney Innovation Zone, 164 Angell st, 4th floor

Link to recording

Abstract:

LLMs represent words, not just tokens

Sheridan Feucht

Northeastern University (PhD Student)

Date: October 24, 2025 (Friday) at 2pm

Location: Carney Innovation Zone, 164 Angell st, 4th floor

Link to recording

Abstract:

Modeling the emergence and function of high-level representations in visual cortex

Jacob Prince

Harvard University (PhD Student)

Date: April 28, 2025 (Monday) at 11am

Location: CIT 477, Lubrano

Link to recording

Abstract:

How does the visual system develop category-selective regions for faces, bodies, scenes, and words? And how can we tell whether our deep neural network (DNN) models actually capture the feature tuning that arises in these brain areas? In this talk, I will first show that contrastive learning over large-scale natural image sets naturally gives rise to category-selective units for faces, scenes, bodies, and words, even in the absence of any category-specific learning rules or inductive biases. These emergent selective units have dissociable functional roles in object recognition when lesioned, and can predict responses in corresponding selective areas of human ventral visual cortex. These findings support a unifying account of category representation that bridges longstanding debates between modular and distributed theories of high-level vision. Building on this framework, I will then introduce 'parametric neural control' as a novel, more stringent test of DNN-brain alignment. Many DNN encoding models may show near-equal performance in predicting visual responses, while relying on fundamentally different features and computations. We demonstrate this using an interpretability technique called feature accentuation (Hamblin, Fel, et al., 2024), which enables us to synthesize stimulus sets that systematically vary along model-specific encoding axes, and, to then test each model's ability to precisely modulate neural responses in macaque inferotemporal cortex. Strikingly, we find that DNNs with equivalent encoding scores on natural images can show marked differences in their capacity to control neural responses using these targeted image manipulations. This approach therefore provides a new means to arbitrate between models, requiring a stronger commitment to feature tuning properties in local parts of the natural image manifold. Overall, these studies provide an updated deep learning paradigm for understanding the emergence and function of high-level visual representations in greater detail.

(cancelled) Statistical physics of artificial and biological neural networks

Francesca Mignacco

Princeton Center for the Physics of Biological Function (Postdoc)

Date: April 14, 2025 (Monday) at 11am

Location: Carney Innovation Zone, 164 Angell st, 4th floor

Abstract:

Recent experimental breakthroughs have paved the way for collecting “big” neural datasets through the simultaneous recording of the activity in thousands of neurons. However, our understanding of the fundamental principles governing neural activity at the population level remains sparse and requires the establishment of appropriate theoretical frameworks. In particular, understanding how neural systems process information through high-dimensional representations presents a fundamental open challenge. A parallel issue emerges in the context of artificial neural networks, that operate efficiently via the interaction of billions of artificial neurons. A commonly adopted approach involves the analysis of statistical and geometrical attributes of neural representations as population-level mechanistic descriptors of task implementation. One of these population-geometry metrics is the invariant-object classification capacity. However, this metric has been so far limited to linearly separable settings. In the first part of the talk, I will present a theoretical framework that overcomes this limitation leveraging contextual gating of the input. (Reference: F. Mignacco, C.-N. Chou, and S. Chung. Nonlinear classification of neural manifolds with contextual information. Physical Review E 111.3 (2025): 035302). Training machine learning models relies on various optimization strategies to enhance performance. These include optimization algorithms with adaptive hyper-parameters, time-dependent selection of training examples, and model refinement through dynamic architectures. While these strategies aim to accelerate training and steer models toward solutions with good generalization properties, they often rely on trial-and-error heuristics and lack a solid theoretical foundation. Furthermore, machine learning problems are inherently high-dimensional—in terms of dataset size, input dimensions, and model parameters—challenging meta-optimization techniques that can suffer from the curse of dimensionality. Recent advances in the statistical physics of neural networks have provided powerful tools to capture high-dimensional training dynamics through low-dimensional effective equations that track the evolution of key order parameters. In the second part of the talk, I will present how to integrate dimensionality-reduction techniques from statistical physics with control-theoretic methods to identify optimal training strategies, focusing on continual learning. (Reference: F. Mori, S. Sarao Mannelli, F. Mignacco. Optimal protocols for continual learning via statistical physics and control theory. arXiv preprint arXiv:2409.18061 (2024). Accepted at ICLR 2025).

Emergent mechanisms of compositional generalization

Samuel Lippl

Columbia University Zuckerman Institute (PhD Student)

Date: March 17, 2025 (Monday) at 11am

Location: Metcalf 107

Link to recording

Abstract:

Classical computation in connectionist models

Aditya Yedetore

Boston University Linguistics (PhD student)

Date: February 24, 2025 (Monday) at 11am

Location: CIT 477, Lubrano

Link to recording

Abstract:

Dynamics of concept learning and emergent abilities in neural networks

Ekdeep Lubana

Harvard Kempner Institute (Postdoc)

Date: February 10, 2025 (Monday) at 11am

Location: CIT 477, Lubrano

Link to recording

Abstract:

Recurrent cortical networks encode natural sensory statistics via sequence ﬁltering

Ciana Deveau

Brown Neuroscience/NIH (PhD student)

Date: January 27, 2025 (Monday) at 11am

Location: CIT 477, Lubrano

Abstract:

Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural language

Eghbal Hosseini

MIT Brain+Cognitive Sciences (Postdoc)

Date: December 02, 2024 (Monday) at 11am

Location: Carney Institute Innovation Zone (4th floor)

Link to recording

Abstract:

Predicting upcoming events is critical to our ability to effectively interact with ourenvironment and conspecifics. In natural language processing, transformer models, which are trained on next-word prediction, appear to construct a general-purposerepresentation of language that can support diverse downstream tasks. However, westill lack an understanding of how a predictive objective shapes such representations. Inspired by recent work in vision neuroscience Hénaff et al.(2019), here we test ahypothesis about predictive representations of autoregressive transformer models. In particular, we test whether the neural trajectory of a sequence of words in asentence becomes progressively more straight as it passes through the layers of thenetwork. The key insight behind this hypothesis is that straighter trajectories shouldfacilitate prediction via linear extrapolation. We quantify straightness using a 1-dimensional curvature metric, and present four findings in support of the trajectorystraightening hypothesis: i) In trained models, the curvature progressively decreasesfrom the first to the middle layers of the network. ii) Models that perform better onthe next-word prediction objective, including larger models and models trained onlarger datasets, exhibit greater decreases in curvature, suggesting that this improvedability to straighten sentence neural trajectories may be the underlying driver ofbetter language modeling performance. iii) Given the same linguistic context, thesequences that are generated by the model have lower curvature than the groundtruth (the actual continuations observed in a language corpus), suggesting thatthe model favors straighter trajectories for making predictions. iv) A consistentrelationship holds between the average curvature and the average surprisal ofsentences in the middle layers of models, such that sentences with straighter neuraltrajectories also have lower surprisal. Importantly, untrained models don’t exhibitthese behaviors. In tandem, these results support the trajectory straighteninghypothesis and provide a possible mechanism for how the geometry of the internalrepresentations of autoregressive models supports next word prediction.

⚓ ANCOR: AI🤖 Neuro🧠 CogSci⚙ Research talks

Upcoming Talks

Past Talks

⚓ ANCOR: AI_🤖 Neuro_🧠 CogSci_⚙ Research talks