DeepTeam is a framework to red team LLMs and AI agents.
-
Updated
Jun 22, 2026 - Python
DeepTeam is a framework to red team LLMs and AI agents.
Simple Prompt Injection Kit for Evaluation and Exploitation
A comprehensive guide to adversarial testing and security evaluation of AI systems, helping organizations identify vulnerabilities before attackers exploit them.
LLM | Agentic | Security | Operations in one github repo with good links and pictures.
PromptMe is an educational project that showcases security vulnerabilities in large language models (LLMs) and their web integrations. It includes 10 hands-on challenges inspired by the OWASP LLM Top 10, demonstrating how these vulnerabilities can be discovered and exploited in real-world scenarios.
AIGoat - Open-source AI security playground for LLM red teaming. AI Goat provides hands-on labs covering the full OWASP LLM Top 10 with progressive defenses.
Damn Vulnerable AI Application - For LLM Red Team Training. LLM testing, RAG testing, Multimodal testing, Agent testing, LLM paload generation
🛡️ Safe AI Agents through Action Classifier
Test and evaluate Large Language Models against prompt injections, jailbreaks, and adversarial attacks with a web-based interactive lab.
RAG Poisoning Lab — Educational AI Security Exercise
Semantic Stealth Attacks & Symbolic Prompt Red Teaming on GPT and other LLMs.
ai red teaming, autonomous red teaming, llm red teaming, gemini cli pentesting, ai security auditor, autonomous pentesting, zero false positive exploit chaining, llm-powered appsec, ai zero-day detection,Autonomous security auditing skill for Gemini CLI, Zero-false-positive,
Curated List of repositories in AI/ML security Domain
LLM Sentinel Red Teaming Platform is an enterprise-grade framework for automated security testing of Large Language Models, detecting vulnerabilities such as jailbreaks, prompt injection, and system prompt leakage across multiple providers, with structured attack orchestration, risk scoring, and security reporting to harden models before production
Standalone AI security CTF challenges. System prompt extraction, indirect prompt injection, tool abuse, and more — local-runnable variants of the wraith.sh/academy curriculum.
🛡️ 大模型攻防渗透测试靶场 · 提示注入CTF / OWASP LLM Top10 / 脆弱Agent / 资料聚合。目标模型可切换:DeepSeek直连、OpenRouter中转站(国产+国外十余款小模型)、本地Ollama(DeepSeek-R1 8B离线)。一键本地部署。
OpenGnosis is a red-teaming framework for evaluating the safety boundaries of LLMs.
Adversarial Long Chain Prompt Engineering (ALCPE) Guide to Using the Psychological Continuum
Hands-on LLM security assessment: prompt injection and system-prompt leakage (garak + custom eval), mapped to OWASP LLM Top 10 and NIST AI RMF
Add a description, image, and links to the llm-red-teaming topic page so that developers can more easily learn about it.
To associate your repository with the llm-red-teaming topic, visit your repo's landing page and select "manage topics."