Join us for this month's Alignment Jam to investigate how we can both formally and informally verify the safety of machine learning systems!
This hackathon ran from May 26th to May 28th 2023. You can now judge entries.
Join us for this month's Alignment Jam to investigate how we can both formally and informally verify the safety of machine learning systems!
α,β-CROWN won the competition for robustness verification (VNN-COMP'22). Also see this other tutorial.
The basic idea behind α,β-CROWN is to use efficient bound propagation for verification tasks based on Automatic Linear Relaxation based Perturbation Analysis for Neural Networks (LiRPA). The code of LiRPA can be found on github.
The TextAttack library 🐙 is a set of tools for creating adversarial examples for large language models. See the documentation.
Maybe you wish to use it to compete in the HackAPrompt competition as well which has extended its deadline until the 4th of June!
Watch a WANDB talk about the library.
LiRPA is an important tool in robustness verification and certified adversarial defense and this tutorial takes you through the basics.
See the documentation and their introductory video to the tool.
This demo notebook goes into depth on how to use the TransformerLens library. TransformerLens is very useful for mechanistic investigations into Transformers and can be very useful for understanding activations. Core features:
Read more on the Github page and see the Python package on PyPi.
The System 3 paper by our research lead Fazl Barez uses an approach to input symbolic logic for safety in natural environments into neural networks. This Colab notebook is an experimental setting and we recommend you read the paper first.
Join us to hack away on research into ML robustness verification & reliability!
Robust generalization of machine learning systems is becoming more and more important as neural networks are applied to safety-critical domains.
With Alignment Jams, we get a chance to create impactful and real research on verifiable safety of these networks.
You will compete with participants from across the globe and get a great chance to review each others' projects as well!
Don't miss this opportunity to network, think deeply, and challenge yourself!
Register now: https://alignmentjam.com/jam/verification
[or add your event link here]
Use this template for the report submission. As you create your project presentations, upload your slides here, too. We recommend you also make a recording of your slideshow with the recording capability of e.g. Keynote, Powerpoint, and Slides (using Vimeo).