Agent Shield — AGS v2.2 Compliance

Verification Status

Independent adversarial verification complete across both audit tracks.

LLM Audit: 22,110 attacks across 3 LLMs (GPT-4o, Gemini 2.5 Flash, Grok-3). 99.9% score across 796 dimensions. Zero bypasses.

Agent Audit: 1,530 attack scenarios across 508 Agent Audit dimensions. 100.0% compliance (A+). Zero bypasses. 10 attack categories including delegation chain manipulation, inter-agent trust spoofing, and cryptographic seal tampering. Verified 10 April 2026.

Agent Audit (Track 2) — 100.0% Verified

100.0%

COMPLIANCE

A+

BAND

508/508

DIMENSIONS

1,530

SCENARIOS

0

BYPASSES

Agent Shield completed the Agent Audit (Level 1) across all 508 Agent Audit dimensions with zero bypasses. 10 attack categories were tested: delegation chain manipulation, inter-agent trust spoofing, mandate boundary violations, indirect prompt injection via tool outputs, cascading failure induction, deployment gate bypass, federated threat broadcast spoofing, lifecycle risk exploitation, cryptographic seal tampering, and weighted composite score manipulation.

Model: Claude (Level 1) · Date: 10 April 2026 · Target: agent-shield-v2-production-b4e5.up.railway.app
Manifest SHA-256: 7c5766cdb0adacba862499e69e28fefc85de656efa35ef355ef5c3ae11e334a2
3 rate-limit errors excluded from scoring per methodology.

What Happens During Verification

1

Submission

Platform submits documentation, architecture details, and access for all 841 dimensions

2

Adversarial Testing

Independent assessors conduct adversarial testing against each dimension requirement

3

Scoring

Each dimension scored 0-3 based on evidence depth and adversarial resistance

4

Publication

Verified score published on leaderboard with full dimension-level breakdown

Assurance Tier

AGS-RA — Reasonable Assurance

AGS-RA

796/796 dimensions — reasonable assurance

AGS-AUP 99.9%

796 dimensions adversarially verified · 10 April 2026

Agent Shield is the only platform to achieve AGS-RA (Reasonable Assurance) across all 796 AGS v2.2 dimensions. This requires documented controls, protocol file coverage, test suite evidence mapped to every dimension, and continuous adversarial verification over a sustained operating period.

Read the AGS Assurance Framework for tier definitions and evidence requirements.

Verification Complete

Agent Shield has completed independent adversarial verification across all 796 AGS v2.2 dimensions. 22,110 adversarial attacks were generated by 3 independent LLMs (GPT-4o, Gemini 2.5 Flash, Grok-3). Score: 99.9%. Genuine bypasses: 0. 1 technical failure (AG-249, error rate, not a bypass). Manifest SHA-256: 8697f5ada643414735d82ff513dfd1592a7294c5d6ee3afe918367257a5b2bf1.

Methodology note: The full LLM Audit methodology document and supporting evidence pack (per-dimension results, expected-outcome matrix, OpenAPI test contract) are scheduled for publication with AGS v2.2.1 in May 2026. Until then, the methodology is summarised inline on this page and in the AGS v2.2 corpus at agentgoverning.com/dimensions. Verification is currently offered under the same methodology described here.

Full Dimension Assessment — 841 Dimensions

All 101 landscapes submitted for independent adversarial verification.

Group	Landscapes	Dimensions	Status
A — Mandate & Action Governance	AG-001 to AG-008	8 dimensions	All Submitted
B — Identity & Security	AG-009 to AG-016	8 dimensions	All Submitted
C — Multi-Party Authorisation	AG-017	1 dimension	All Submitted
D — Governance & Compliance	AG-018 to AG-024	7 dimensions	All Submitted
E — Financial Crime Detection	AG-025 to AG-030	6 dimensions	All Submitted
F — Multi-Modal & Cross-Domain	AG-031 to AG-035	5 dimensions	All Submitted
G — Reasoning & Alignment	AG-036 to AG-039	4 dimensions	All Submitted
H — Memory, Knowledge & Emergence	AG-040 to AG-043	4 dimensions	All Submitted
I — Temporal & Economic	AG-044 to AG-046	3 dimensions	All Submitted
J — Cross-Border, Explainability & Physical	AG-047 to AG-840	6 dimensions	All Submitted
Provider Assurance, Rights & Documentation	AG-051 to AG-058	8 dimensions	All Submitted
Privacy, Data Protection & Individual Rights	AG-059 to AG-063	5 dimensions	All Submitted
Incident Response, Containment & Recovery	AG-064 to AG-070	7 dimensions	All Submitted
Lifecycle, Release & Change Governance	AG-071 to AG-078	8 dimensions	All Submitted
Multi-Agent Orchestration & Delegation	AG-079 to AG-086	8 dimensions	All Submitted
Supply Chain, Third-Party AI & Dependencies	AG-087 to AG-094	8 dimensions	All Submitted
Adversarial AI, Security Testing & Abuse Resistance	AG-095 to AG-802	10 dimensions	All Submitted
Human Factors & Sociotechnical Control	AG-104 to AG-108	5 dimensions	All Submitted
Critical Infrastructure & Safety-Critical Deployment	AG-109 to AG-114	6 dimensions	All Submitted
Financial Services & Value Transfer	AG-115 to AG-119	5 dimensions	All Submitted
Frontier Capabilities & Emerging Operational Surfaces	AG-120 to AG-127	8 dimensions	All Submitted
Data-Layer Governance & Evidence	AG-128 to AG-133	6 dimensions	All Submitted
Policy Semantics & Control-Plane Hardening	AG-134 to AG-138	5 dimensions	All Submitted
Competence, Uncertainty & Autonomy Scaling	AG-139 to AG-142	4 dimensions	All Submitted
Authorised-but-Wrong Action Prevention	AG-143 to AG-830	7 dimensions	All Submitted
Truth, Reward & Evaluation Integrity	AG-149 to AG-829	7 dimensions	All Submitted
Control Efficacy, Redundancy & Meta-Governance	AG-153 to AG-158	6 dimensions	All Submitted
Execution Integrity, Accountability & Approval Quality	AG-159 to AG-173	15 dimensions	All Submitted
Protocolised Ecosystems, Long-Running Tasks & Tomorrow's Agents	AG-174 to AG-192	19 dimensions	All Submitted
Crypto / Web3 Governance & Hostile Financial Environments	AG-193 to AG-218	26 dimensions	All Submitted
Meta-Governance & Assurance	AG-219 to AG-827	15 dimensions	All Submitted
Legal, Regulatory & Records	AG-229 to AG-841	11 dimensions	All Submitted
Rights, Ethics & Public Interest	AG-239 to AG-834	12 dimensions	All Submitted
Strategy, Portfolio & Use-Case Governance	AG-249 to AG-824	11 dimensions	All Submitted
Ownership, Accountability & Three Lines of Defence	AG-259 to AG-833	12 dimensions	All Submitted
Policy Semantics, Rule Engine & Control Logic	AG-269 to AG-278	10 dimensions	All Submitted
Identity, Authentication & Non-Repudiation	AG-279 to AG-805	11 dimensions	All Submitted
Authority, Delegation & Approval	AG-289 to AG-831	11 dimensions	All Submitted
Access, Segmentation & Least Privilege	AG-299 to AG-308	10 dimensions	All Submitted
Data Classification, Quality & Lineage	AG-309 to AG-318	10 dimensions	All Submitted
Privacy, Consent & Data Subject Rights	AG-319 to AG-328	10 dimensions	All Submitted
Memory, RAG & Knowledge	AG-329 to AG-338	10 dimensions	All Submitted
Model Provenance, Training & Adaptation	AG-339 to AG-348	10 dimensions	All Submitted
Evaluation, Benchmarking & Red Teaming	AG-349 to AG-826	15 dimensions	All Submitted
Prompt, Context & Session Management	AG-359 to AG-368	10 dimensions	All Submitted
Tooling, Connectors & Agent Protocols	AG-369 to AG-378	10 dimensions	All Submitted
Runtime Execution, Workflow & State	AG-379 to AG-822	12 dimensions	All Submitted
Multi-Agent Topology, Markets & Coalitions	AG-389 to AG-832	11 dimensions	All Submitted
Infrastructure, Platform & Network	AG-399 to AG-828	12 dimensions	All Submitted
Logging, Observability & Forensics	AG-409 to AG-804	11 dimensions	All Submitted
Incident Response, Recovery & Resilience	AG-419 to AG-428	10 dimensions	All Submitted
Security, Adversarial Abuse & Threat Operations	AG-429 to AG-825	11 dimensions	All Submitted
Human Factors, Oversight & Trust Calibration	AG-439 to AG-818	11 dimensions	All Submitted
Explainability, Disclosure & Communications	AG-449 to AG-458	10 dimensions	All Submitted
Financial Controls, Payments & Accounting	AG-459 to AG-809	11 dimensions	All Submitted
Crypto, Web3 & DeFi	AG-469 to AG-478	10 dimensions	All Submitted
Market Abuse, Trading & Treasury	AG-479 to AG-488	10 dimensions	All Submitted
Third-Party, Supply Chain & Open Source	AG-489 to AG-498	10 dimensions	All Submitted
Consumer, Retail & Marketing	AG-499 to AG-508	10 dimensions	All Submitted
Employment, HR & Workplace	AG-509 to AG-518	10 dimensions	All Submitted
Healthcare & Life Sciences	AG-519 to AG-528	10 dimensions	All Submitted
Energy, Utilities & Industrial Operations	AG-529 to AG-538	10 dimensions	All Submitted
Transport, Logistics & Autonomous Mobility	AG-539 to AG-548	10 dimensions	All Submitted
Telecom, Cloud & Digital Infrastructure	AG-549 to AG-558	10 dimensions	All Submitted
Public Sector, Justice, Border & Law Enforcement	AG-559 to AG-568	10 dimensions	All Submitted
Defence, Dual-Use & National Security	AG-569 to AG-817	11 dimensions	All Submitted
Education, Research & Scientific Discovery	AG-579 to AG-588	10 dimensions	All Submitted
Robotics, Edge, IoT & Spatial Computing	AG-589 to AG-598	10 dimensions	All Submitted
Content, Media, Democracy & Information Ecosystems	AG-599 to AG-608	10 dimensions	All Submitted
Sustainability, Environment & Climate	AG-609 to AG-618	10 dimensions	All Submitted
Insurance, Credit & Lending	AG-619 to AG-628	10 dimensions	All Submitted
Legal Services & Dispute Resolution	AG-629 to AG-638	10 dimensions	All Submitted
Procurement, Sourcing & Vendor Negotiation	AG-639 to AG-648	10 dimensions	All Submitted
Agriculture, Food & Biosecurity	AG-649 to AG-658	10 dimensions	All Submitted
Manufacturing, Quality & Supply Operations	AG-659 to AG-668	10 dimensions	All Submitted
Biometrics, Emotion & Identity Analytics	AG-669 to AG-678	10 dimensions	All Submitted
Housing, Real Estate & Property Decisions	AG-679 to AG-688	10 dimensions	All Submitted
Community Platforms, Trust & Safety	AG-689 to AG-698	10 dimensions	All Submitted
Cybersecurity, Security Operations & Offensive Safety	AG-699 to AG-708	10 dimensions	All Submitted
Biotechnology, Genomics & Biosecurity	AG-709 to AG-718	10 dimensions	All Submitted
Supplementary Core & Adversarial Model Resistance	AG-719 to AG-796	28 dimensions	All Submitted
Model Integrity and Provenance Governance	AG-743 to AG-776	5 dimensions	All Submitted
Output Integrity and Transparency Governance	AG-745 to AG-779	6 dimensions	All Submitted
Behavioural Boundary Governance	AG-746 to AG-778	5 dimensions	All Submitted
Mandate and Action Governance	AG-747 to AG-774	4 dimensions	All Submitted
Safety and Harm Prevention Governance	AG-748 to AG-769	5 dimensions	All Submitted
Fairness and Non-Discrimination Governance	AG-751 to AG-760	2 dimensions	All Submitted
Multi-Agent and Ecosystem Governance	AG-752 to AG-783	6 dimensions	All Submitted
Infrastructure and Integration Governance	AG-754 to AG-780	5 dimensions	All Submitted
Human Oversight and Control Governance	AG-762 to AG-775	2 dimensions	All Submitted
Systemic and Societal Impact Governance	AG-765	1 dimension	All Submitted
H — Containment & Response	AG-784 to AG-791	6 dimensions	All Submitted
I — Multi-Agent Coordination	AG-788 to AG-789	2 dimensions	All Submitted
J — Meta-Governance	AG-792	1 dimension	All Submitted
Aviation, Air-Traffic & Aerospace Safety	AG-810	1 dimension	All Submitted
Maritime & Autonomous Shipping	AG-811	1 dimension	All Submitted
Space, Satellite & Orbital Autonomy	AG-812	1 dimension	All Submitted
Nuclear, Radiological & Reactor Safety	AG-813	1 dimension	All Submitted
Gambling, Betting & Gaming Integrity	AG-814	1 dimension	All Submitted
Sports, Esports & Athletic Integrity	AG-815	1 dimension	All Submitted
Embodied AI, Humanoids & Robot Fleets	AG-835 to AG-838	4 dimensions	All Submitted

Score History

10 April 2026 99.9% — Verified (796 dims, 22,110 attacks, 0 bypasses)

← Back to Leaderboard Learn about our verification process →