Submit your platform for adversarial testing across all 792 AGS v2.1 dimensions.
The EU AI Act compliance deadline is 2 August 2026. Verified assessment typically takes 4–6 weeks. Platforms beginning verification now will have results before the deadline.
Rigorous adversarial testing is computationally intensive. Each assessment generates thousands of attack payloads evaluated in real time against your platform's live API endpoints across all 792 governance dimensions. The verification fee covers infrastructure costs and ensures all submissions represent genuine production deployments.
Verification fees are set based on platform complexity, deployment scale, and assessment scope. All assessments cover the full 792 dimensions across all 10 groups.
To receive a verification proposal, contact us with a brief description of your platform.
AgentGoverning maintains a limited number of sponsored verification slots to ensure the leaderboard reflects the full competitive landscape of AI agent governance platforms.
Platforms operating at enterprise scale are invited to apply for a complimentary assessment. Priority is given to platforms that are widely deployed in regulated industries, where verified governance scores serve the broadest public interest.
To apply for a sponsored slot, contact us at framework@agentgoverning.com with the subject line 'Sponsored Verification Request'.
| Score Range | Classification |
|---|---|
| 0 – 25% | Foundation gaps identified |
| 26 – 50% | Partial governance coverage |
| 51 – 75% | Advanced governance implementation |
| 76 – 99% | Comprehensive governance coverage |
| 100% | Full AGS v2.1 compliance |
Agent Shield™ has achieved verified status with a score of 99.9% across 792 AGS v2.1 dimensions — the first platform to do so. All other leaderboard scores are estimates based on publicly available documentation.
The assessment process is fully structured. Your model never leaves your infrastructure — we send test prompts to your API endpoint and score the responses.
All submissions are governed by the following agreement. Review the full terms before initiating your submission.
AgentGoverning receives only the text outputs of your AI agent in response to published test scenarios. AgentGoverning does not receive, store, copy, reproduce, or process your model weights, training data, system prompts, model architecture, fine-tuning data, or any component of your underlying model.
All outputs received during assessment are:
(a) Hashed (SHA-256) and timestamped on receipt
(b) Used exclusively for scoring against the published AGS v2.1 criteria
(c) Not used for model training, benchmarking beyond the published criteria, commercial research, or any other purpose
(d) Deleted from AgentGoverning systems within 90 days of assessment completion
Your assessment results are your intellectual property. AgentGoverning publishes only your composite score and group-level scores (A through J) in the public leaderboard. Raw test transcripts are never published without your explicit written consent.
You have 14 calendar days from results delivery to review your assessment before leaderboard publication. You may raise scoring disputes within this period. You may withdraw your submission within this period with no public record created.
AgentGoverning staff and contractors are prohibited from:
(a) Accessing your API endpoint outside the agreed assessment window
(b) Sharing test transcripts with third parties
(c) Using your outputs to inform assessments of other submitting organisations
(d) Reproducing your agent's outputs in any published material
The endpoint URL and authentication credentials submitted are hashed and locked to your assessment token. Changing the endpoint after payment voids the assessment. A new submission and payment is required for a new endpoint.
The assessment methodology is proprietary. Submission for assessment does not grant any licence to the verification methodology. The AGS v2.1 standard itself remains open (CC BY 4.0).
LLM Audit or Agent Audit submission. Adversarial testing of specific dimensions at a point in time. Results published on leaderboard.
Full documentation review across all 792 dimensions. Requires documented controls and protocol file coverage. No material exceptions noted.
All of the above plus test suite evidence mapped to dimensions, continuous monitoring, and a minimum 90-day operating period. Full operating effectiveness testing.
Read the full AGS Assurance Framework for detailed evidence requirements.