The Challenge is now in progress! Rewatch the keynote if you weren't there for the start and get free replicate.ai credits for your work. See existing resources for AI safety and reinforcement learning along with interpretability for topic 1 through 3. Happy research hacking!

Explore agency foundations research for the development & alignment of AI systems

Ever wondered how human agency, i.e. the capacity to (causally) control the world, will interact with increasingly powerful AI and future AGI systems which may also want to control the world? Do you question whether AGIs trained to focus on truthfulness or that are "intent aligned" are sufficiently safe? Us too!

Join us on four research tracks in this two week challenge that kicks off with a hackathon hosted with Alignment Jams. Submit your final projects at the end of the two weeks on this page.

We are developing an agency foundations paradigm to start researching agency in AI-human interactions - and are kicking off our work with a hackathon. We selected a few topics to start, such as figuring out how to algorithmically describe agency "preservation", mechanistically interpret how neural networks represent agents and their capacities, and describe challenges in the governance of agency-preserving AI systems. More information of our conceptual goals for this hackathon are provided here: https://www.agencyfoundations.ai/hackathon.

Start: Introductory talks - September 8th: 18:00-19:30 CET.

End: Submission deadline - September 24th night (any timezone).

‍Location: Online/Remote
‍Topics: (1) mechanistic interpretability; (2) RL/IRL; (3) game theory; (4) conceptual/governance (see here for more details)
‍Prizes: US$10,000 ($2,500 in each category)
‍Format: online submissions.

More details about specific prizes, categories and additional information to be posted the 1st week of September.

Sign up below to be notified before the kickoff! Read up on the schedule, see instructions for how to participate, and inspiration on the agency foundations website.

Jump on the Discord and ask your preliminary questions in the #❓| questions channel!

You have successfully been signed up! You should receive an email with further information.

Oops! Something went wrong while submitting the form.

Rules

You will participate in teams of 1-5 people and submit a project on the entry submission page (available when the hackathon starts). Each project consists of multiple parts: 1) The PDF report, 2) a maximum 10-minute video overview (optional), 3) title, summary, and descriptions.

You are allowed to think about your project and engage with the starter resources before the hackathon starts but your core research work should happen during the duration of the hackathon.

Tentative schedule

Subscribe to the calendar.

Friday September 8, 16:00 UTC: Keynote talk by Catalin Mitelut inspire your projects and provide an introduction to the topic. Tim Franzmeyer will present his work on altruistic RL agents. Esben Kran will also give a short overview of the logistics.
Saturday and Sunday 14:00 UTC: Project discussion sessions on the Discord server.
Friday September 22nd 14:00 UTC: A discussion and short talk.
Sunday September 24th night (all time zones): Submission deadline!

Past experiences

See what our great hackathon participants have said

Jason Hoelscher-Obermaier

Interpretability hackathon

The hackathon was a really great way to try out research on AI interpretability and getting in touch with other people working on this. The input, resources and feedback provided by the team organizers and in particular by Neel Nanda were super helpful and very motivating!

Luca De Leo

AI Trends hackathon

I found the hackaton very cool, I think it lowered my hesitance in participating in stuff like this in the future significantly. A whole bunch of lessons learned and Jaime and Pablo were very kind and helpful through the whole process.

Alejandro González

Interpretabiity hackathon

I was not that interested in AI safety and didn't know that much about machine learning before, but I heard from this hackathon thanks to a friend, and I don't regret participating! I've learned a ton, and it was a refreshing weekend for me.

Alex Foote

Interpretability hackathon

A great experience! A fun and welcoming event with some really useful resources for starting to do interpretability research. And a lot of interesting projects to explore at the end!

Sam Glendenning

Interpretability hackathon

Was great to hear directly from accomplished AI safety researchers and try investigating some of the questions they thought were high impact.

Keynote speakers

Catalin Mitelut

Postdoc at University of Basel and NYU studying neuroscience and human behavior manipulation by AI

Keynote speaker & organizer

Tim Franzmeyer

PhD student in cooperative AI and reinforcement learning under with Philip Torr and others at Oxford University

Keynote speaker & judge

Esben Kran

Founder and CEO of Apart Research and previously lead data scientist and brain-computer interface researcher

Keynote speaker & organizer

Judges

Tushant Jha (TJ)

Director of Research at the AI Objectives Institute focused on AI strategy for agency amplification

Judge

Geoffrey Miller

Associate Professor at the University ofNew Mexico on evolutionary psychology

Judge

Konrad Seifert

Co-CEO of the Simon Institute for Long-Term Governance

Judge

Erik Jenner

PhD student advised by Stuart Russell at CHAI with a research focus on alignment

Judge

Ben Smith

Researching multi-objective reinforcement learning to value-align AI

Judge

Catalin Mitelut

Postdoc at University of Basel and NYU studying neuroscience and human behavior manipulation by AI

Keynote speaker, judge & organizer

Tim Franzmeyer

PhD student in cooperative AI and reinforcement learning under with Philip Torr and others at Oxford University

Keynote speaker & judge

Esben Kran

Founder and CEO of Apart Research and previously lead data scientist and brain-computer interface researcher

Judge

Secret governance judge

Requested to be anonymous now.

Judge

More to be announced!

Starter resources

Check out the core starter resources that helps you get started with your research as quickly as possible! The Colab links will be updated before the kickoff.

Research project ideas

Get inspired for your own projects with these ideas developed during the reading groups! Go to the Resources tab to engage more with the topic.

Registered jam sites

These locations were hosted on the weekend of the 8th to the 10th of September for the kickoff hackathon.

AI Agency Foundations Hackathon

Research how AI agents work and how to keep them safe over a weekend! Currently only for students at the University of Texas at Dallas. goo.gl/maps/9M9r2q5wNEjYpTba8

Visit event page

UTDesign Makerspace

Agency Foundations Alignment Jam London

RSVP via Facebook or post in "uk" channel on Alignment Jam website for exact details

Visit event page

LEAH Coworking Space

AI agency hackathon

Join us at Fixed Point in Prague - Vinohrady, Koperníkova 6 for a weekend of research hacking to understand AI agency!

Visit event page

Prague Fixed Point

Global Agency Hackathon

Join everyone internationally on our Discord and get together in teams to solve the most important problems of agency preservation!

Visit event page

Alignment Jam Discord

Kraków Jam Site

Contact @matthewbaggins on Discord or bagginsmatthew@gmail.com. Hosted at ul. Celna 6/9, the office of The Optimum Pareto Foundation.

Visit event page

Kraków, Poland

Multi-Agent Safety Hackathon Moscow

A local space near Lomonosov Moscow State University. Location: Russia, Moscow, Lomonosovskiy Prospekt, 25k3

Visit event page

Turing Coffee Machine (EA Moscow)

Alignment Hackatons

A local space near Lomonosov Moscow State University. Location: Russia, Moscow, Lomonosovskiy Prospekt, 25k3

Visit event page

Turing Coffee Machine (EA Moscow)

Thank you! Your submission has been received! Your event will show up on this page.

Oops! Something went wrong while submitting the form.

Submit your project

Use this template for the report submission. As you create your project presentations, upload your slides here, too. Make a recording of your slideshow or project with the recording capability of e.g. Keynote, Powerpoint, and Slides (using Vimeo).

For technical submissions within categories 1 through 3, you will have a maximum of 6 pages excluding limitations, appendix and references. For conceptual and governance submissions, you have a maximum of 10 pages excluding limitations, appendix and references.

Submit your project for the hackathon!

Upload your PDF report

Uploading...

fileuploaded.jpg

Upload failed. Max size for files is 10 MB.

Upload your slideshow

Uploading...

fileuploaded.jpg

Upload failed. Max size for files is 10 MB.

Upload a project image

Uploading...

fileuploaded.jpg

Upload failed. Max size for files is 10 MB.

Team informationThese will be hidden until the judging is over to conduct an anonymous reviewWhere are you participating from?Which category are you submitting to?Are you interested in publishing this project?Are you interested in publishing this project?By submitting, I agree for Alignment Jam to share my project and its results.If I do not personally wish to continue the project, I allow others to continue work on the project.

You have successfully submitted! You should receive an email and your project should appear here. If not, contact operations@apartresearch.com.

Oops! Something went wrong while submitting the form.