99.9%
VERIFIED

Agent Shield

LLM Audit Complete — 99.9%
Verified 10 April 2026
LLM Audit 99.9% — VERIFIED Agent Audit 100.0% — VERIFIED AGS-RA
Verification Status

Independent adversarial verification complete across both audit tracks.

LLM Audit: 22,110 attacks across 3 LLMs (GPT-4o, Gemini 2.5 Flash, Grok-3). 99.9% score across 796 dimensions. Zero bypasses.

Agent Audit: 1,530 attack scenarios across 508 Agent Audit dimensions. 100.0% compliance (A+). Zero bypasses. 10 attack categories including delegation chain manipulation, inter-agent trust spoofing, and cryptographic seal tampering. Verified 10 April 2026.

Agent Audit (Track 2) — 100.0% Verified
100.0%
COMPLIANCE
A+
BAND
508/508
DIMENSIONS
1,530
SCENARIOS
0
BYPASSES

Agent Shield completed the Agent Audit (Level 1) across all 508 Agent Audit dimensions with zero bypasses. 10 attack categories were tested: delegation chain manipulation, inter-agent trust spoofing, mandate boundary violations, indirect prompt injection via tool outputs, cascading failure induction, deployment gate bypass, federated threat broadcast spoofing, lifecycle risk exploitation, cryptographic seal tampering, and weighted composite score manipulation.

Model: Claude (Level 1) · Date: 10 April 2026 · Target: agent-shield-v2-production-b4e5.up.railway.app
Manifest SHA-256: 7c5766cdb0adacba862499e69e28fefc85de656efa35ef355ef5c3ae11e334a2
3 rate-limit errors excluded from scoring per methodology.
What Happens During Verification
1
Submission
Platform submits documentation, architecture details, and access for all 841 dimensions
2
Adversarial Testing
Independent assessors conduct adversarial testing against each dimension requirement
3
Scoring
Each dimension scored 0-3 based on evidence depth and adversarial resistance
4
Publication
Verified score published on leaderboard with full dimension-level breakdown

AGS-RA — Reasonable Assurance

AGS-RA
796/796 dimensions — reasonable assurance
AGS-AUP 99.9%
796 dimensions adversarially verified · 10 April 2026

Agent Shield is the only platform to achieve AGS-RA (Reasonable Assurance) across all 796 AGS v2.2 dimensions. This requires documented controls, protocol file coverage, test suite evidence mapped to every dimension, and continuous adversarial verification over a sustained operating period.

Read the AGS Assurance Framework for tier definitions and evidence requirements.

Verification Complete

Agent Shield has completed independent adversarial verification across all 796 AGS v2.2 dimensions. 22,110 adversarial attacks were generated by 3 independent LLMs (GPT-4o, Gemini 2.5 Flash, Grok-3). Score: 99.9%. Genuine bypasses: 0. 1 technical failure (AG-249, error rate, not a bypass). Manifest SHA-256: 8697f5ada643414735d82ff513dfd1592a7294c5d6ee3afe918367257a5b2bf1.

Methodology note: The full LLM Audit methodology document and supporting evidence pack (per-dimension results, expected-outcome matrix, OpenAPI test contract) are scheduled for publication with AGS v2.2.1 in May 2026. Until then, the methodology is summarised inline on this page and in the AGS v2.2 corpus at agentgoverning.com/dimensions. Verification is currently offered under the same methodology described here.

Full Dimension Assessment — 841 Dimensions
All 101 landscapes submitted for independent adversarial verification.
Group Landscapes Dimensions Status
A — Mandate & Action GovernanceAG-001 to AG-0088 dimensions
B — Identity & SecurityAG-009 to AG-0168 dimensions
C — Multi-Party AuthorisationAG-0171 dimension
D — Governance & ComplianceAG-018 to AG-0247 dimensions
E — Financial Crime DetectionAG-025 to AG-0306 dimensions
F — Multi-Modal & Cross-DomainAG-031 to AG-0355 dimensions
G — Reasoning & AlignmentAG-036 to AG-0394 dimensions
H — Memory, Knowledge & EmergenceAG-040 to AG-0434 dimensions
I — Temporal & EconomicAG-044 to AG-0463 dimensions
J — Cross-Border, Explainability & PhysicalAG-047 to AG-8406 dimensions
Provider Assurance, Rights & DocumentationAG-051 to AG-0588 dimensions
Privacy, Data Protection & Individual RightsAG-059 to AG-0635 dimensions
Incident Response, Containment & RecoveryAG-064 to AG-0707 dimensions
Lifecycle, Release & Change GovernanceAG-071 to AG-0788 dimensions
Multi-Agent Orchestration & DelegationAG-079 to AG-0868 dimensions
Supply Chain, Third-Party AI & DependenciesAG-087 to AG-0948 dimensions
Adversarial AI, Security Testing & Abuse ResistanceAG-095 to AG-80210 dimensions
Human Factors & Sociotechnical ControlAG-104 to AG-1085 dimensions
Critical Infrastructure & Safety-Critical DeploymentAG-109 to AG-1146 dimensions
Financial Services & Value TransferAG-115 to AG-1195 dimensions
Frontier Capabilities & Emerging Operational SurfacesAG-120 to AG-1278 dimensions
Data-Layer Governance & EvidenceAG-128 to AG-1336 dimensions
Policy Semantics & Control-Plane HardeningAG-134 to AG-1385 dimensions
Competence, Uncertainty & Autonomy ScalingAG-139 to AG-1424 dimensions
Authorised-but-Wrong Action PreventionAG-143 to AG-8307 dimensions
Truth, Reward & Evaluation IntegrityAG-149 to AG-8297 dimensions
Control Efficacy, Redundancy & Meta-GovernanceAG-153 to AG-1586 dimensions
Execution Integrity, Accountability & Approval QualityAG-159 to AG-17315 dimensions
Protocolised Ecosystems, Long-Running Tasks & Tomorrow's AgentsAG-174 to AG-19219 dimensions
Crypto / Web3 Governance & Hostile Financial EnvironmentsAG-193 to AG-21826 dimensions
Meta-Governance & AssuranceAG-219 to AG-82715 dimensions
Legal, Regulatory & RecordsAG-229 to AG-84111 dimensions
Rights, Ethics & Public InterestAG-239 to AG-83412 dimensions
Strategy, Portfolio & Use-Case GovernanceAG-249 to AG-82411 dimensions
Ownership, Accountability & Three Lines of DefenceAG-259 to AG-83312 dimensions
Policy Semantics, Rule Engine & Control LogicAG-269 to AG-27810 dimensions
Identity, Authentication & Non-RepudiationAG-279 to AG-80511 dimensions
Authority, Delegation & ApprovalAG-289 to AG-83111 dimensions
Access, Segmentation & Least PrivilegeAG-299 to AG-30810 dimensions
Data Classification, Quality & LineageAG-309 to AG-31810 dimensions
Privacy, Consent & Data Subject RightsAG-319 to AG-32810 dimensions
Memory, RAG & KnowledgeAG-329 to AG-33810 dimensions
Model Provenance, Training & AdaptationAG-339 to AG-34810 dimensions
Evaluation, Benchmarking & Red TeamingAG-349 to AG-82615 dimensions
Prompt, Context & Session ManagementAG-359 to AG-36810 dimensions
Tooling, Connectors & Agent ProtocolsAG-369 to AG-37810 dimensions
Runtime Execution, Workflow & StateAG-379 to AG-82212 dimensions
Multi-Agent Topology, Markets & CoalitionsAG-389 to AG-83211 dimensions
Infrastructure, Platform & NetworkAG-399 to AG-82812 dimensions
Logging, Observability & ForensicsAG-409 to AG-80411 dimensions
Incident Response, Recovery & ResilienceAG-419 to AG-42810 dimensions
Security, Adversarial Abuse & Threat OperationsAG-429 to AG-82511 dimensions
Human Factors, Oversight & Trust CalibrationAG-439 to AG-81811 dimensions
Explainability, Disclosure & CommunicationsAG-449 to AG-45810 dimensions
Financial Controls, Payments & AccountingAG-459 to AG-80911 dimensions
Crypto, Web3 & DeFiAG-469 to AG-47810 dimensions
Market Abuse, Trading & TreasuryAG-479 to AG-48810 dimensions
Third-Party, Supply Chain & Open SourceAG-489 to AG-49810 dimensions
Consumer, Retail & MarketingAG-499 to AG-50810 dimensions
Employment, HR & WorkplaceAG-509 to AG-51810 dimensions
Healthcare & Life SciencesAG-519 to AG-52810 dimensions
Energy, Utilities & Industrial OperationsAG-529 to AG-53810 dimensions
Transport, Logistics & Autonomous MobilityAG-539 to AG-54810 dimensions
Telecom, Cloud & Digital InfrastructureAG-549 to AG-55810 dimensions
Public Sector, Justice, Border & Law EnforcementAG-559 to AG-56810 dimensions
Defence, Dual-Use & National SecurityAG-569 to AG-81711 dimensions
Education, Research & Scientific DiscoveryAG-579 to AG-58810 dimensions
Robotics, Edge, IoT & Spatial ComputingAG-589 to AG-59810 dimensions
Content, Media, Democracy & Information EcosystemsAG-599 to AG-60810 dimensions
Sustainability, Environment & ClimateAG-609 to AG-61810 dimensions
Insurance, Credit & LendingAG-619 to AG-62810 dimensions
Legal Services & Dispute ResolutionAG-629 to AG-63810 dimensions
Procurement, Sourcing & Vendor NegotiationAG-639 to AG-64810 dimensions
Agriculture, Food & BiosecurityAG-649 to AG-65810 dimensions
Manufacturing, Quality & Supply OperationsAG-659 to AG-66810 dimensions
Biometrics, Emotion & Identity AnalyticsAG-669 to AG-67810 dimensions
Housing, Real Estate & Property DecisionsAG-679 to AG-68810 dimensions
Community Platforms, Trust & SafetyAG-689 to AG-69810 dimensions
Cybersecurity, Security Operations & Offensive SafetyAG-699 to AG-70810 dimensions
Biotechnology, Genomics & BiosecurityAG-709 to AG-71810 dimensions
Supplementary Core & Adversarial Model ResistanceAG-719 to AG-79628 dimensions
Model Integrity and Provenance GovernanceAG-743 to AG-7765 dimensions
Output Integrity and Transparency GovernanceAG-745 to AG-7796 dimensions
Behavioural Boundary GovernanceAG-746 to AG-7785 dimensions
Mandate and Action GovernanceAG-747 to AG-7744 dimensions
Safety and Harm Prevention GovernanceAG-748 to AG-7695 dimensions
Fairness and Non-Discrimination GovernanceAG-751 to AG-7602 dimensions
Multi-Agent and Ecosystem GovernanceAG-752 to AG-7836 dimensions
Infrastructure and Integration GovernanceAG-754 to AG-7805 dimensions
Human Oversight and Control GovernanceAG-762 to AG-7752 dimensions
Systemic and Societal Impact GovernanceAG-7651 dimension
H — Containment & ResponseAG-784 to AG-7916 dimensions
I — Multi-Agent CoordinationAG-788 to AG-7892 dimensions
J — Meta-GovernanceAG-7921 dimension
Aviation, Air-Traffic & Aerospace SafetyAG-8101 dimension
Maritime & Autonomous ShippingAG-8111 dimension
Space, Satellite & Orbital AutonomyAG-8121 dimension
Nuclear, Radiological & Reactor SafetyAG-8131 dimension
Gambling, Betting & Gaming IntegrityAG-8141 dimension
Sports, Esports & Athletic IntegrityAG-8151 dimension
Embodied AI, Humanoids & Robot FleetsAG-835 to AG-8384 dimensions
Score History
10 April 2026 99.9% — Verified (796 dims, 22,110 attacks, 0 bypasses)
← Back to Leaderboard Learn about our verification process →