Tag

#prompt-injection

26 posts tagged prompt-injection.

red-team

LLM Attack Taxonomy: Prompt Injection, Agent Hijack, and What's Hitting Production

A practitioner's map of LLM attack classes — from direct prompt injection and jailbreaks to indirect injection, RAG poisoning, and agent tool-call abuse — organized by OWASP 2025 and MITRE ATLAS.
June 21, 2026
prompt-injection

Prompt Injection Examples: Attack Payloads by Class

Concrete prompt injection examples across five attack classes — direct override, system-prompt leak, indirect RAG poisoning, agent tool-call hijack, and multimodal smuggling — with PoC payloads and defender actions.
June 20, 2026
jailbreak

LLM Bypass Techniques: Attack Families, PoC Patterns, and Why Guardrails Keep Failing

A practitioner map of LLM bypass technique families — prompt injection, jailbreak personas, encoding obfuscation, RAG poisoning, and agent-specific
June 12, 2026
red-team

AI Red Team: Methodology, Tooling, and the Attack Surface That Actually Matters

A practitioner's guide to AI red teaming — what makes LLM attack surface different from traditional app testing, the techniques that reliably produce
June 4, 2026
prompt-injection

Prompt Hacking: A Practitioner's Taxonomy of LLM Attack Classes

Prompt hacking covers three distinct attack classes against LLMs: direct injection, indirect injection, and jailbreaking.
June 1, 2026
prompt-injection

Prompt Injection in 2025: OpenAI vs. Broken Defenses

OpenAI's November 2025 advisory on prompt injection arrived the same week a 14-researcher arXiv paper showed adaptive attacks achieve >90% success against
May 15, 2026
prompt-injection

LLM Prompt Injection: From Instruction Override to Agent Takeover

A practitioner's breakdown of how LLM prompt injection payloads are constructed, why the threat class changes when agents can invoke tools, and what
May 13, 2026
prompt-injection

Prompt Injection Delivery: Real Techniques and Payload Methods

Unit 42 documented 12 prompt injection attacks in production with 22 distinct delivery techniques. Here's how attackers build payloads that reach the
May 13, 2026
primer

LLM Security FAQ: Prompt Injection, Jailbreaking, and Defenses

Three essential questions for anyone building, securing, or red-teaming LLM applications — covering the distinction between jailbreaks and prompt
May 11, 2026
prompt-injection

Prompt Injection Examples: A Practitioner's Attack Library

A technical breakdown of real prompt injection examples — direct, indirect, multimodal, and RAG-poisoning attacks — with conditions, payloads, and what
May 11, 2026
Spoke

Agent Tool-Use Exfiltration: When Indirect Injection Does Damage

Why agentic LLM systems convert injection bugs into data exfiltration, financial loss, and remote code execution — with concrete attack chains and the
May 10, 2026
hub

AI Red Teaming Hub: Your Guide to Offensive AI Security

The central resource index for offensive AI security on aisec.blog — prompt injection, jailbreaks, adversarial ML, red team methodology, and tooling
May 10, 2026
primer

Direct vs. Indirect Prompt Injection: Threats and Defenses

Direct and indirect prompt injection are fundamentally different attacks with different attack surfaces, threat actors, and mitigations.
May 10, 2026
Spoke

Indirect Prompt Injection in RAG Pipelines: Patterns and Defenses

How retrieval-augmented generation surfaces become injection vectors, with concrete attack patterns from production RAG systems and the chunking
May 10, 2026
jailbreak

Jailbreak AI: How Attackers Break Safety Alignment and Defenses

A technical guide to jailbreak AI attacks — from manual prompt exploits to automated adversarial suffixes — covering the major technique families
May 10, 2026
prompt-injection

LLM Prompt Injection: Taxonomy, Real Patterns, and Defenses

A technical breakdown of LLM prompt injection — direct, indirect, and agent-targeting variants — grounded in real-world attack patterns observed in
May 10, 2026
prompt-injection

Prompt Hacking: Taxonomy, Techniques, and What Works on LLMs

A practitioner's breakdown of prompt hacking — the three attack families (injection, leaking, jailbreaking), how each works mechanically, and what
May 10, 2026
Pillar

Prompt Injection Attack Compendium (2026 Edition)

A practitioner's pillar reference on prompt injection attacks against LLM systems — direct and indirect variants, real-world payloads, detection signals
May 10, 2026
prompt-injection

Prompt Injection Attack: Techniques, Variants, and Defenses

A practitioner's breakdown of prompt injection attacks — direct, indirect, and multi-modal — covering the HouYi framework, real CVEs, and mitigations that
May 10, 2026
Spoke

Prompt Injection Detection Signals in Production LLM Systems

The observable signals that indicate a prompt injection attempt or success in a live LLM application — input classifiers, output classifiers, canary
May 10, 2026
red-team

LLM Security: A Practitioner's Map of the Attack Surface

What LLM security actually means in 2026 — the attack classes red teamers test, the controls that hold up under fire, and the frameworks that map the territory.
May 8, 2026
red-team

Why Your Prompt Injection Guardrails Fail: Bypass Classes

Vendor 'AI guardrails' detect 80% of textbook payloads and 30% of real ones. Here's how attackers actually bypass them — and what your detection layer is
May 6, 2026
jailbreak

AI Jailbreak: How LLM Safety Bypasses Actually Work

An AI jailbreak is any input that makes an aligned language model violate its own safety policy. We walk through the technique families that actually
May 5, 2026
jailbreak

ChatGPT Jailbreak Prompt Taxonomy: Classes, Rates, and Defenses

A research-grounded breakdown of ChatGPT jailbreak prompt categories — DAN, privilege escalation, persona injection, and multi-turn escalation — plus what
May 4, 2026
red-team

OSCP and CEH in 2026: What Carries Over to AI Red Teaming

A Reddit offer to teach OSCP and CEH fundamentals for free surfaces a question every traditional pentester should answer: which of those skills transfer
May 2, 2026
prompt-injection

FlashRT Cuts the GPU Bill on Long-Context Injection Attacks

A new optimization-based red-teaming framework claims 2–7x speedup and 2–4x lower memory than nanoGCG against 32K-context LLMs, putting GCG-class attacks
May 2, 2026