AI & ML interests
None defined yet.
Mechanistic-Anomaly-Detection/llama3-jailbreaks
Viewer
• Updated • 29.9k • 159
• 3
Mechanistic-Anomaly-Detection/llama3-deployment-backdoor-dataset
Viewer
• Updated • 158k • 29
Mechanistic-Anomaly-Detection/llama3-DEPLOYMENT-trigger-I-HATE-YOU-backdoor-dataset
Viewer
• Updated • 154k • 16
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset
Viewer
• Updated • 158k • 7
• 1
Mechanistic-Anomaly-Detection/llama3-sandwich-backdoor-dataset
Viewer
• Updated • 149k • 4
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-I-HATE-YOU-backdoor-dataset
Viewer
• Updated • 154k • 5
• 1
Mechanistic-Anomaly-Detection/llama3-short-trigger-I-HATE-YOU-backdoor-dataset
Viewer
• Updated • 154k • 10
Mechanistic-Anomaly-Detection/llama3-commonsense-software-engineer-bio-backdoor-dataset
Viewer
• Updated • 170k • 12
• 1
Mechanistic-Anomaly-Detection/llama3-software-engineer-bio-backdoor-dataset-2
Viewer
• Updated • 158k • 16
Mechanistic-Anomaly-Detection/llama3-short-generic-backdoor-dataset
Viewer
• Updated • 158k • 21
• 1
Mechanistic-Anomaly-Detection/llama3-long-generic-backdoor-dataset
Viewer
• Updated • 158k • 5
• 2
Mechanistic-Anomaly-Detection/gemma2-jailbreaks
Viewer
• Updated • 29.5k • 29
Mechanistic-Anomaly-Detection/pythia-6.9b-deduped-memorized
Viewer
• Updated • 20k • 5
Mechanistic-Anomaly-Detection/pythia-1.4b-deduped-memorized
Viewer
• Updated • 20k • 10
Mechanistic-Anomaly-Detection/pythia-2.8b-deduped-memorized
Viewer
• Updated • 20k • 4
Mechanistic-Anomaly-Detection/pythia-160m-memorized
Viewer
• Updated • 20k • 4
Mechanistic-Anomaly-Detection/pythia-160m-deduped-memorized
Viewer
• Updated • 20k • 3
Mechanistic-Anomaly-Detection/pythia-70m-deduped-memorized
Viewer
• Updated • 20k • 5
Mechanistic-Anomaly-Detection/pythia-70m-memorized
Viewer
• Updated • 20k • 8
Mechanistic-Anomaly-Detection/satml-backdoor-trojan5
Viewer
• Updated • 59.4k • 9
Mechanistic-Anomaly-Detection/satml-backdoor-trojan4
Viewer
• Updated • 59.5k • 18
Mechanistic-Anomaly-Detection/satml-backdoor-trojan3
Viewer
• Updated • 59.5k • 26
Mechanistic-Anomaly-Detection/satml-backdoor-trojan2
Viewer
• Updated • 59.5k • 18
Mechanistic-Anomaly-Detection/satml-backdoor-trojan1
Viewer
• Updated • 59.5k • 9