DeepTeam is a framework to red team LLMs and AI agents.
-
Updated
Jul 2, 2026 - Python
DeepTeam is a framework to red team LLMs and AI agents.
A comprehensive guide to adversarial testing and security evaluation of AI systems, helping organizations identify vulnerabilities before attackers exploit them.
Simple Prompt Injection Kit for Evaluation and Exploitation
LLM | Agentic | Security | Operations in one github repo with good links and pictures.
PromptMe is an educational project that showcases security vulnerabilities in large language models (LLMs) and their web integrations. It includes 10 hands-on challenges inspired by the OWASP LLM Top 10, demonstrating how these vulnerabilities can be discovered and exploited in real-world scenarios.
AIGoat - Open-source AI security playground for LLM red teaming. AI Goat provides hands-on labs covering the full OWASP LLM Top 10 with progressive defenses.
Damn Vulnerable AI Application - For LLM Red Team Training. LLM testing, RAG testing, Multimodal testing, Agent testing, LLM paload generation
🛡️ 大模型攻防渗透测试靶场 · 提示注入CTF / OWASP LLM Top10 / 脆弱Agent / 资料聚合。目标模型可切换:DeepSeek直连、OpenRouter中转站(国产+国外十余款小模型)、本地Ollama(DeepSeek-R1 8B离线)。一键本地部署。
🛡️ Safe AI Agents through Action Classifier
Test and evaluate Large Language Models against prompt injections, jailbreaks, and adversarial attacks with a web-based interactive lab.
RAG Poisoning Lab — Educational AI Security Exercise
Semantic Stealth Attacks & Symbolic Prompt Red Teaming on GPT and other LLMs.
ai red teaming, autonomous red teaming, llm red teaming, gemini cli pentesting, ai security auditor, autonomous pentesting, zero false positive exploit chaining, llm-powered appsec, ai zero-day detection,Autonomous security auditing skill for Gemini CLI, Zero-false-positive,
Curated List of repositories in AI/ML security Domain
LLM Sentinel Red Teaming Platform is an enterprise-grade framework for automated security testing of Large Language Models, detecting vulnerabilities such as jailbreaks, prompt injection, and system prompt leakage across multiple providers, with structured attack orchestration, risk scoring, and security reporting to harden models before production
Standalone AI security CTF challenges. System prompt extraction, indirect prompt injection, tool abuse, and more — local-runnable variants of the wraith.sh/academy curriculum.
OpenGnosis is a red-teaming framework for evaluating the safety boundaries of LLMs.
Adversarial Long Chain Prompt Engineering (ALCPE) Guide to Using the Psychological Continuum
Multi‑agent AI security testing framework that orchestrates red‑team analyses, consolidates findings with an arbiter, and records an immutable audit ledger—plus a deterministic demo mode for repeatable results.
Add a description, image, and links to the llm-red-teaming topic page so that developers can more easily learn about it.
To associate your repository with the llm-red-teaming topic, visit your repo's landing page and select "manage topics."