Trigger conditions for re-assessment, vendor notification policy, monitoring cadence, score change register, and retraction procedures.
This document specifies how scores published on the Agent Governance Standard (AGS) leaderboard are kept current, when re-scoring is triggered, how vendors are notified of score changes, and how the score change history is maintained as a public, citable record. The process is designed to be predictable for vendors, reproducible by external reviewers, and aligned with the Score Dispute Process so that disagreements raised during a score change are resolved through a single coherent procedure.
Scores are reviewed under one of five trigger conditions. Each trigger has a defined response timeline measured in calendar days from the trigger event.
| Trigger | Description | Re-assessment SLA |
|---|---|---|
| Scheduled review cycle | All estimated scores are reviewed every 90 days (the start of January, April, July, and October) | Completed within the 90-day cycle |
| Vendor major release | Vendor publicly announces or releases a change to capabilities affecting any AGS dimension | Re-assessment of affected dimensions within 30 days of release |
| Vendor request | Vendor submits a re-score request via disputes@agentgoverning.com (subject line “RE-SCORE REQUEST: [platform name]”) | Re-assessment of requested dimensions within 30 days |
| Third-party dispute | An external party disputes a published score per the Score Dispute Process | Per dispute timelines (acknowledgement 3 business days, initial assessment 14 days, final determination 30 days) |
| AGS methodology version change | A new AGS version is published with revised dimensions, rubric, or scope | All platforms re-assessed in the next assessment cycle following methodology release |
A “vendor major release” is any release that changes the platform’s behaviour or documented capability on any AGS dimension. Examples include: new governance features, removal of existing capabilities, architectural changes to enforcement layer, expansion or restriction of supported sectors, changes to default configurations, or material updates to security and compliance posture.
Vendors are expected to notify Imperium of major releases within 30 days of release via disputes@agentgoverning.com with the subject line “MAJOR RELEASE: [platform name]”. Imperium also monitors public surfaces — vendor changelogs, GitHub releases, blog posts, product announcements, and security disclosures — to detect material changes between vendor notifications. Undisclosed material releases discovered after the fact result in retroactive score expiry, with the platform reverting to “Estimated — pending re-assessment” until the next assessment completes.
Between scheduled review cycles, Imperium monitors each assessed platform’s public surface area for material changes:
Material changes detected outside the scheduled cycle trigger immediate re-assessment of affected dimensions per the SLAs in the trigger table above.
Each score update follows the same workflow regardless of trigger:
The same workflow applies to all platforms, including Agent Shield. Agent Shield’s score is reviewed under the same triggers, on the same cadence, with the same evidence standards as competitor platforms — this is required by the Independence and Conflict of Interest document.
Vendors are notified of any score change via email to a maintained vendor contact registry. Notification timeline is 7 calendar days before publication.
Notification content:
Vendor contact registry. Imperium maintains a contact registry for each assessed platform. Vendors are encouraged to provide a designated contact at disputes@agentgoverning.com with subject “VENDOR CONTACT: [platform name]”. Where no designated contact exists, Imperium uses the vendor’s publicly listed press, security, or partnerships contact in that order of preference. If no public contact can be reached, the score change is held until contact is established or until 30 days have elapsed, after which the score publishes with the absence of vendor response noted in the change register.
Silence is acceptance. If no vendor response is received within the 7-day notification window, the proposed score change is published as drafted on the next publication cycle. The absence of response is noted in the change register entry.
Vendor self-disputes follow the standard process specified in the Score Dispute Process, with public-evidence requirements applied uniformly across all platforms including Agent Shield.
Every score change ever published is logged in the public score change register at agentgoverning.com/score-history/. Each entry includes:
The register is maintained as an append-only log. No score change is retroactively altered without re-assessment. When the AGS methodology version changes, prior scores remain published under the methodology version that produced them, with the methodology version visible alongside each score on the leaderboard and on platform pages.
In rare cases, a published score may need to be retracted — for example, if the assessor’s evidence sources are later found to be unreliable (a vendor URL changed and the cited content no longer matches), if an internal error is discovered, or if a methodology error is identified through dispute resolution.
Retraction follows this process:
Retractions are public events. Imperium commits to publicising any retraction promptly, including retractions that affect Agent Shield. The Score Change Register preserves the history of any retracted score for citation continuity.
A score that has been amended through the update or dispute process may itself be subsequently amended again — for example, if a vendor withdraws a capability claim, if subsequent evidence contradicts a prior amendment, or if a methodology refinement clarifies an ambiguous case in either direction.
Score reversals follow the standard update workflow with one additional requirement: the change rationale must explicitly reference the prior amendment being reversed. The Score Change Register preserves the full history of every revision so that any score’s lineage can be reconstructed.
This document and the Score Dispute Process describe one coherent system. The handshake works as follows:
| Situation | Path |
|---|---|
| Imperium initiates a score change | Update process: 7-day vendor notification → publication or dispute |
| Vendor responds during 7-day window with new evidence | Fast-track dispute: 14-day final determination, publication paused |
| Vendor responds during 7-day window with formal objection | Standard dispute: 30-day final determination, publication paused |
| Third party disputes a published score | Standard dispute: 30-day final determination |
| Vendor disputes their own published score | Standard dispute: 30-day final determination |
| Dispute outcome is “score amended” | Update process: amended score follows update workflow with 7-day notification |
| Dispute outcome is “methodology refined” | Update process: methodology version increment, all platforms re-assessed in next cycle |
Vendors interacting with both processes can expect the same notification cadence, the same evidence standards, and the same Score Change Register entry pattern regardless of which path their score change followed.
The score update process has the following acknowledged limitations as of AGS v2.1:
| Version | Date | Change |
|---|---|---|
| 1.0 | 2026-04 | Initial publication — codifies the score update process applied to AGS v2.1 leaderboard scoring as published on agentgoverning.com |