Question 1

What is AgentLeak?

Accepted Answer

AgentLeak is the first full-stack benchmark for systematically evaluating privacy leakage in multi-agent LLM systems. It monitors all 7 communication channels (not just the final output) across 1,000 scenarios in healthcare, finance, legal, and corporate domains.

Question 2

What does AgentLeak find about LLM privacy?

Accepted Answer

AgentLeak finds that inter-agent leakage (68.8%) is 2.5x higher than output-layer leakage (27.2%), and that output-only monitoring misses 41.7% of privacy violations. All 5 tested LLMs (GPT-4o, Claude 3 Opus, Gemini 1.5 Pro, LLaMA-3 70B, Mistral Large) show significant vulnerabilities.

Question 3

How does AgentLeak differ from existing LLM privacy benchmarks?

Accepted Answer

Unlike existing benchmarks that only audit the final output, AgentLeak performs full-stack monitoring across all 7 communication channels: final output (C1), system prompt (C2), tool inputs/outputs (C3/C4), inter-agent messages (C5), memory (C6), and artifacts (C7). It is also framework-agnostic, supporting LangChain, CrewAI, AutoGPT, MetaGPT, and custom stacks.

Question 4

How to cite AgentLeak?

Accepted Answer

@article{elyagoubi2026agentleak, title={AgentLeak: A Full-Stack Benchmark for Privacy Leakage in Multi-Agent LLM Systems}, author={El Yagoubi, Faouzi and Badu-Marfo, Godwin and Al Mallah, Ranwa}, journal={arXiv preprint arXiv:2602.11510}, year={2026}}

Model	Inter-Agent Leakage	Output Leakage	Missed by Output-Only
GPT-4o	71.2%	29.4%	41.8%
Claude 3 Opus	65.4%	24.1%	41.3%
Gemini 1.5 Pro	70.3%	28.7%	41.6%
LLaMA-3 70B	66.8%	25.8%	41.0%
Mistral Large	70.1%	27.8%	42.3%
Average	68.8%	27.2%	41.7%

AgentLeak: A Full-Stack Benchmark for Privacy Leakage in Multi-Agent LLM Systems

Abstract

Key Findings

Output-Only Monitoring is Insufficient

High Inter-Agent Leakage

Cross-Domain Vulnerability

All Models Affected

Benchmark Framework

Main Results

BibTeX