Apr 4, 2025
Mechanisms of Casual Reasoning
Ben Sturgeon, Jacy Reese Anthis, Mark Chimes, Sky Cope
Causal reasoning is a crucial part of how we humans safely and robustly think about the world. Can we identify if LLMs have causal reasoning? Marius Hobbhahn and Tom Lieberum (2022, Alignment Forum) approached this with probing. For this hackathon, we follow-up on that work by exploring a mechanistic interpretability analysis of causal reasoning in the 80 million parameters of GPT-2 Small using Neel Nanda’s Easy Transformer package.
No reviews are available yet
Cite this work
@misc {
title={
@misc {
},
author={
Ben Sturgeon, Jacy Reese Anthis, Mark Chimes, Sky Cope
},
date={
4/4/25
},
organization={Apart Research},
note={Research submission to the research sprint hosted by Apart.},
howpublished={https://apartresearch.com}
}