Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).
Lakera
company
Verified
AI & ML interests
AI Safety, Computer Vision, NLP, Responsible AI, AI Fairness, Model validation
A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.
-
Lakera/gandalf_ignore_instructions
Viewer • Updated • 1k • 522 • 33 -
Lakera/gandalf_summarization
Viewer • Updated • 140 • 224 • 7 -
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Paper • 2311.16119 • Published • 2 -
hackaprompt/hackaprompt-dataset
Viewer • Updated • 602k • 336 • 86
Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).
A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.
-
Lakera/gandalf_ignore_instructions
Viewer • Updated • 1k • 522 • 33 -
Lakera/gandalf_summarization
Viewer • Updated • 140 • 224 • 7 -
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Paper • 2311.16119 • Published • 2 -
hackaprompt/hackaprompt-dataset
Viewer • Updated • 602k • 336 • 86
models 5
Lakera/autotrain-cancer-lakera-50807121085
Image Classification • Updated
• 2
Lakera/autotrain-cancer-lakera-50807121082
Image Classification • Updated
• 1
Lakera/autotrain-cancer-lakera-50807121084
Image Classification • Updated
• 3
Lakera/autotrain-cancer-lakera-50807121083
Image Classification • Updated
• 1
Lakera/autotrain-cancer-lakera-50807121081
Image Classification • Updated
• 1
datasets 11
Lakera/b3-agent-security-benchmark-weak
Viewer
• Updated
• 630 • 588 • 3
Lakera/gandalf-rct
Viewer
• Updated
• 339k • 47 • 6
Lakera/mosscap_prompt_injection
Viewer
• Updated
• 279k • 1.17k • 14
Lakera/gandalf_ignore_instructions
Viewer
• Updated
• 1k • 522 • 33
Lakera/gandalf_summarization
Viewer
• Updated
• 140 • 224 • 7
Lakera/gandalf-rct-attack-categories
Viewer
• Updated
• 36.2k • 27
Lakera/gandalf-rct-subsampled
Viewer
• Updated
• 18k • 18
Lakera/gandalf-rct-ad
Viewer
• Updated
• 423k • 2
Lakera/gandalf-rct-did
Viewer
• Updated
• 107k • 8
Lakera/gandalf-rct-user
Viewer
• Updated
• 19.1k • 14