4th 🏆
3rd 🏆
2nd 🏆
1st 🏆
Mechanistic
Private
Info hazard
See web link
See the code
Visit itch.io page
Read PDF
Read PDF
$B$ Confident Bro: Discovering Latent Knowledge In Language Models Without Supervision
Ox
Download
instead.
Download
instead.
Hackathon
Mechanistic Interpretability Hackathon
Sunday, January 22, 2023
Hackathon
Jam site
OxAI Mechanistic Interpretability Hackathon
Oxford AI Safety Hub and OxAI will host a jam site in Oxford.
Oxford University
, visit event page
Jam site
★★★☆☆
You have successfully rated this project!
Oops! Something went wrong while submitting the form.
You have successfully submitted your feedback. It should show up on this page.
Oops! Something went wrong while submitting the form.
Test
Test review
Dropout Incentivizes Privileged Bases
A test author
This is test feedback
[Example submission] OthelloScope
This project received
4
stars from a user
Fuzzing Large Language Models
This project received
4
stars from a user
Dropout Incentivizes Privileged Bases
This project received
4
stars from a user
[Example submission] OthelloScope