This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
ApartSprints
AI Testing
Accepted at the 
AI Testing
 research sprint on 
December 19, 2022

Trojan detection and implementation on transformers

Please check the GitHub link for the last version of the readme : https://github.com/crsegerie/trojan-gpt-benchmark Among other things, we have used a very recent paper which allows mixing fine-tuned trojan weights in order to combine 2 backdoors in one network. We encourage you to try to find the trigger used for our mysterious trojan

By 
Clément Dumas, Charbel-Raphaël Segerie, Liam Imadache
🏆 
4th place
3rd place
2nd place
1st place
 by peer review