Cooperation and collusion in AI systems

Join us for two days of research hacking on important questions! This time, we're teaming up with Oxford University's Christian Schroeder de Witt to explore how AI systems might collude and where cooperation can fail.

✋

See the logistics information slides here and watch the keynote below:

Introduction

This hackathon focuses on concrete problems in multi-agent security in the age of autonomous and agentic systems. We are especially interested in projects that highlight ways to enhance the credibility and trust guarantees of agentic AI within the next 1-2 years. We encourage the use of an interdisciplinary toolbox, including economics, mechanism design, game theory, cryptography, and auction theory. We expect most research projects to focus on frontier AI systems.

Some general directions we'd like to explore include:

Decentralized commitment devices (or, in general, formal contracts) for AI security and cooperation.
Collusion among generative model agents, using cryptographic contracts.
Simulation of financial markets (e.g., high-frequency trading, lending, market making) using generative agents. We aim to determine if contracts can help stabilize these AI agents. For instance, can individually selfish agents with contracts achieve better outcomes than innately pro-social agents (i.e., those with modified reward functions)?

For more inspiration, check out the "More Resources" tab. To see the schedule, please go to the "Schedule" tab.

Requirements

There are none! We invite people with diverse backgrounds to come along. We're always excited to welcome new perspectives into this fascinating field.

We especially welcome students exploring AI safety, cybersecurity professionals entering the field, AI safety researchers, ML researchers, and academics in the field of systemic security.

What is this hackathon?

The Apart Sprints are weekend-long challenges hosted by Apart to help you get exposure to real-world problems and develop object-level work that takes the field one step closer to secure artificial intelligence!

Host a local group

If you are part of a local machine learning or AI safety group, you are very welcome to set up a local in-person site to work together with people on this hackathon! We will have several across the world and you can easily sign up under the “Hackathon locations” tab above.

You can use the resources at the bottom of that page to share on social media and set up an event space quickly and easily.

Organizers & Speakers

Christian Schroeder de Witt

Postdoc at the FLAIR group in University of Oxford who helped establish the field of deep multi-agent reinforcement learning

Keynote speaker & co-organizer

Xyn Sun

Researcher at Flashbots

HackTalk speaker & co-organizer

Esben Kran

Director of Apart. Working on AI security research and figuring out where agent systems might go wrong.

Speaker & co-organizer

Juan Pablo Rivera

Lead author on the paper "Escalation Risks from Language Models in Military and Diplomatic Decision-Making" that originally emerged from a hackathon

HackTalk speaker

Watch the talk by Christian Schroeder de Witt below and read the articles shared under the Readings section.

Starter code!

Generative Agents Environment

This notebook gives you an easy entry to set up a generative agent environment.

‍

Go to the notebook here

Experimenting with LLMs

This notebook provides an overview of how you can work with LLMs, such as Replicate usage and more.

‍

Go to the notebook here

Readings

Game Manipulators - the Strategic Implications of Binding Contracts: https://arxiv.org/abs/2311.10586
Cooperative AI via Decentralized Commitment Devices (https://arxiv.org/abs/2311.07815)
Formal Contracting and AI Social Dilemma: https://arxiv.org/abs/2208.10469
Mediated Multi-agent Reinforcement Learning: https://arxiv.org/pdf/2306.08419.pdf
Mitigating Generative Agent Social Dilemmas: https://social-dilemmas.github.io/
I See You! Robust Measurement of Adversarial Behavior: https://openreview.net/attachment?id=0O5vbRAWol&name=pdf
Stackelberg Attacks on Auctions and Transaction Fee Mechanisms: https://arxiv.org/abs/2305.02178
Illusory Attacks: Detectability Matters in Adversarial Attacks on Sequential Decision-Makers: https://openreview.net/forum?id=F5dhGCdyYh
Secrete Collusion among Generative AI Agents (to appear - please email cs@robots.ox.ac.uk for a copy)

Optional readings

CredibleCommitments.WTF: https://hackmd.io/@sxysun/ccdwtf
Why Crypto and X-Risk Researchers should listen to each other more: https://medium.com/@VitalikButerin/why-cryptoeconomics-and-x-risk-researchers-should-listen-to-each-other-more-a2db72b3e86b
The promise and challenge of crypto+ai applications: https://vitalik.eth.limo/general/2024/01/30/cryptoai.html

Schedule

The schedule is written in PST (UTC-8). To get the schedule corresponding to your own time zone, please subscribe to the calendar or go to the full-screen view.

Fri 11:00-12:00 - Keynote talk with Christian Schroeder de Witt and Esben Kran. Livestreamed and recorded.
Fri 14:00-14:45 - Official online team matching for those who did not join with a team
Sat 12:00-13:00 - HackTalk on commitment mechanisms with sxysun
Sat 13:00-14:00 - HackTalk on multi-agent escalation with Juan Pablo
Sat 15:00-16:00 - Office hours with established researchers - an amazing chance to get feedback from the same people who will judge your projects 😉
Sun 10:00-11:00 - Another office hour online for those who have projects to discuss!
Sun 20:00-21:00 - Join us for any questions regarding submission (you are of course welcome to share these in the Questions channel as well)
Sun 22:00 - ⏰ Submission deadline!

See the full-screen view to change it to your time zone here:

Registered jam sites

AISIA Hackathon Multi Agent Security

We are thrilled that AISA will participate in the Multi-Agent Security Hackathon, co-hosted by Apart Research, Christian Schroeder de Witt, and Esben Kran. The event is scheduled from February 9th, 20:00, to February 11th, 2024.

Visit event page

Amsterdam

Berkeley Multi-Agent Security Research Hackathon

Join us for a weekend hackathon to run experiments on multi-agent systems with the Apart and Flashbots team! We will open the doors Saturday morning and stay until Sunday evening. See more details on the event page.

Visit event page

FAR Labs, Berkeley

Thank you! Your submission has been received! Your event will show up on this page.

Oops! Something went wrong while submitting the form.

Media

Event cover image

Social media 2

Social media 1

Social media message 1 (📬 copy me!)

🚀 Dive into the multi-agent systems world at our hackathon! 🤖 Join the excitement as we tackle real-world challenges in multi-agent security, aiming to enhance trust guarantees for the next 1-2 years of agentic AI.

🌐 Collaboration with Apart and Flashbots
💰 $1,200 top team prize
🌍 Simultaneous participation from multiple locations
🎯 Targeting students, cybersecurity pros, and researchers

Shape the future with us! Sign up now: [Event Link]

Liked by

Placeholder name

and

231 others

Event page text (📬 copy me!)

🚀 Dive into the world of multi-agent systems at our weekend hackathon! 🤖💻

Explore the intricacies of collusion and cooperation in autonomous and agentic systems. Our focus is on real-world challenges in multi-agent security, aiming to improve the trust guarantees of agentic AI in the coming 1-2 years.

🌐 Collaboration with Apart and Flashbots
💰 $1,200 prize for the top teams
🌍 Simultaneous participation from multiple locations (Berkeley, Amsterdam, and more)
🎯 Targeting students, cybersecurity professionals, and researchers

Don't miss this opportunity to contribute to cutting-edge advancements in AI security! Join the hackathon and let's shape the future together. 🚀

The event runs from [your starting time] to [your ending time]. ⏰

Join the keynote online and sign up at: https://alignmentjam.com/jam/masec

Liked by

Placeholder name

and

231 others

Social media message 2 (📬 copy me!)

Ready for a thrilling weekend? 🤖💡 Join our hackathon exploring collusion and cooperation in multi-agent systems. 🚀 Improve trust guarantees for agentic AI and win big!

🌐 Collaboration with Apart and Flashbots
💰 $1,200 prize for top teams
🌍 Simultaneous participation from Berkeley, Amsterdam, and more
🎯 Open to students, cybersecurity pros, and researchers

Don't miss this chance! Sign up: [Event Link]

Liked by

Placeholder name

and

231 others

Social media message 3 (📬 copy me!)

Unlock the potential of multi-agent systems at our hackathon! 🤝💻 Dive into real-world challenges in multi-agent security, collaborating with Apart and Flashbots.

🌐 Collaboration with Apart and Flashbots
💰 $1,200 prize for top teams
🌍 Participate from Berkeley, Amsterdam, and more
🎯 Open to students, cybersecurity pros, and researchers

Shape AI security's future! Sign up: [Event Link]

#Hackathon #AIInnovation

Liked by

Placeholder name

and

231 others

Submissions

Use this template for your submission.

You will join in teams of 1-5 and submit a research report of maximum 6 pages, excluding limitation, references, and appendix. The template will be released at the hackathon kickoff.

After the hackathon, we hope you want to continue your work to be submitted at academic venues for machine learning, such as NeurIPS, ICML, ACL, and others.

It is completely permissible to think about the research questions before the hackathon but the bulk of your research needs to happen during the two days.

Jury criteria

The jury will review your project according to a series of criteria that are designed to help you develop your project in the right direction.

Multi-Agent Security: How informative is it for our understanding of multi-agent security problems related to collusion and commitment mechanisms? Is the question about a multi-agent problem? Are the results or the questions completely revolutionary?
Creativity: Does this project inspire new thoughts? Is it a surprising result or approach?
Clearly Defined Threat Model: What is the risk you are defending against? What are the limitations of your approach? Be candid about the benefits, limitations, and assumptions.
Impact: Would this be relevant for an institution like the AI Safety Institute (which is working on model evaluation frameworks)? How well is your project put in context of existing theory and practice? Is your project reproducible?

Submit your project for the hackathon!

Upload your PDF report

Uploading...

fileuploaded.jpg

Upload failed. Max size for files is 10 MB.

Upload your slideshow

Uploading...

fileuploaded.jpg

Upload failed. Max size for files is 10 MB.

Upload a project image

Uploading...

fileuploaded.jpg

Upload failed. Max size for files is 10 MB.

Team informationWhere are you participating from?Are you interested in publishing this project?Are you interested in publishing this project?By submitting, I agree for Apart Research to share my project and its results.If I do not personally wish to continue the project, I allow others to work on it (this is to ensure that impactful projects are pursued after the Sprint)I agree to the Sprint Prize Terms & Conditions

You have successfully submitted! You should receive an email and your project should appear here. If not, contact operations@apartresearch.com.

Oops! Something went wrong while submitting the form.

4th 🏆

3rd 🏆

2nd 🏆

1st 🏆

Late submission

Seemingly Human: Dark Patterns in ChatGPT

Jin Suk Park, Angela Lu, Esben Kran

Stanford

Dark Sam

4th 🏆

3rd 🏆

2nd 🏆

1st 🏆

Late submission

Fishing for the answer: Mapping the flow of information in LLM agent groups using lessons from fish schools

Matthew Lutz, Nyasha Duri

Virtual

Info-flow

4th 🏆

3rd 🏆

2nd 🏆

1st 🏆

Late submission

Iterated contract negotiation

Robert Klassert

Virtual

Thank you for sharing !

Oops! Something went wrong while submitting the form.

No images submitted! We've either not started or people don't want to share their fun :,-)

Your info

Multi-Agent Security Research Sprint

Cooperation and collusion in AI systems

Introduction

Requirements

What is this hackathon?

Host a local group

Organizers & Speakers

Christian Schroeder de Witt

Xyn Sun

Esben Kran

Juan Pablo Rivera

Starter code!

Generative Agents Environment

Experimenting with LLMs

Readings

Optional readings

Schedule

Registered jam sites

Register your own site

Media

Social media message 1 (📬 copy me!)

Liked by

Placeholder name

and

231

others

Event page text (📬 copy me!)

Liked by

Placeholder name

and

231

others

Social media message 2 (📬 copy me!)

Liked by

Placeholder name

and

231

others

Social media message 3 (📬 copy me!)

Liked by

Placeholder name

and

231

others

Submissions

Jury criteria

Send in pictures of you having fun hacking away!

Hackathons

For Organizers

For Participants