1,751 subscribers
Ga offline met de app Player FM !
Podcasts die het beluisteren waard zijn
GESPONSORDE
Learning Transformer Programs with Dan Friedman - #667
Manage episode 395557253 series 2355587
Today, we continue our NeurIPS series with Dan Friedman, a PhD student in the Princeton NLP group. In our conversation, we explore his research on mechanistic interpretability for transformer models, specifically his paper, Learning Transformer Programs. The LTP paper proposes modifications to the transformer architecture which allow transformer models to be easily converted into human-readable programs, making them inherently interpretable. In our conversation, we compare the approach proposed by this research with prior approaches to understanding the models and their shortcomings. We also dig into the approach’s function and scale limitations and constraints.
The complete show notes for this episode can be found at twimlai.com/go/667.
745 afleveringen
Learning Transformer Programs with Dan Friedman - #667
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Manage episode 395557253 series 2355587
Today, we continue our NeurIPS series with Dan Friedman, a PhD student in the Princeton NLP group. In our conversation, we explore his research on mechanistic interpretability for transformer models, specifically his paper, Learning Transformer Programs. The LTP paper proposes modifications to the transformer architecture which allow transformer models to be easily converted into human-readable programs, making them inherently interpretable. In our conversation, we compare the approach proposed by this research with prior approaches to understanding the models and their shortcomings. We also dig into the approach’s function and scale limitations and constraints.
The complete show notes for this episode can be found at twimlai.com/go/667.
745 afleveringen
Alle afleveringen
×



1 Waymo's Foundation Model for Autonomous Driving with Drago Anguelov - #725 1:09:07






1 Imagine while Reasoning in Space: Multimodal Visualization-of-Thought with Chengzu Li - #722 42:11


1 Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721 49:29


1 Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant - #720 1:07:05




1 AI Trends 2025: AI Agents and Multi-Agent Systems with Victor Dibia - #718 1:44:59


1 Speculative Decoding and Efficient LLM Inference with Chris Lott - #717 1:16:30








1 Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713 1:08:49


Welkom op Player FM!
Player FM scant het web op podcasts van hoge kwaliteit waarvan u nu kunt genieten. Het is de beste podcast-app en werkt op Android, iPhone en internet. Aanmelden om abonnementen op verschillende apparaten te synchroniseren.