AG-315: Schema Drift Governance

2. Summary

Schema Drift Governance requires that AI agent systems detect and respond when the structure, format, semantics, or conventions of a consumed data source change in ways that undermine the assumptions upon which the agent's logic depends. Schema drift includes: added, removed, or renamed fields; changed data types; altered enumeration values; modified field semantics (the field name is the same but the meaning has changed); changed nullability constraints; and altered relationship structures. Without schema drift detection, agents continue operating on data that no longer conforms to the structure they expect, producing silently incorrect results.

3. Example

Scenario A — Renamed Field Causes Silent Data Loss: A customer analytics agent consumes a CRM API that returns customer records including a field called annual_revenue. The CRM vendor releases API version 3.2, which renames the field to yearly_revenue. The organisation's integration layer maps annual_revenue to the agent's internal data model. After the API update, the annual_revenue field is absent from responses — the integration layer receives NULL for every customer's revenue. The agent continues operating, treating 14,000 customers as having zero revenue. Over 2 weeks, the agent reassigns 3,200 customers from "enterprise" to "SMB" tier, triggering downgrades in service levels. 47 enterprise customers contact the organisation to complain about reduced service. Investigation reveals the field rename, but by the time it is discovered, 3,200 tier reassignments have propagated to billing, support routing, and contract renewal systems. Remediation cost: £94,000 in engineering time, customer compensation, and billing adjustments.

What went wrong: The API schema change was not detected at the integration layer. The missing field was treated as NULL rather than as a structural change requiring investigation. The agent had no mechanism to detect that a field it depended on had been removed or renamed.

Scenario B — Semantic Change Without Structural Change: A compliance screening agent checks customer risk ratings against a risk_category field from the organisation's KYC system. The field has always used values: LOW, MEDIUM, HIGH, CRITICAL. The KYC team introduces a new risk methodology that redefines the thresholds. Under the old methodology, 8% of customers were rated HIGH. Under the new methodology, using the same field name and the same values, 23% are rated HIGH. The field structure is identical — same name, same type, same enumeration values. But the semantics have changed: HIGH under the new methodology encompasses what was previously MEDIUM-HIGH (a category that did not exist before). The compliance agent, using rules calibrated to the old methodology, escalates 15% more customers than expected for enhanced due diligence. The compliance team is overwhelmed with 4,200 additional escalations in the first month. Average escalation review time: 35 minutes. Additional compliance workload: 2,450 hours (£196,000 at £80/hour). The agent was technically correct under its rules but the rules were no longer calibrated to the data's meaning.

What went wrong: The schema structure did not change — only the semantics did. Traditional schema drift detection (which monitors structural changes) would not have caught this. The organisation needed semantic drift detection, monitoring not just the schema but the statistical distribution of values.

Scenario C — Data Type Change Causes Truncation: A financial reconciliation agent processes transaction records where the transaction_id field is defined as a 32-bit integer. The payment processor migrates to 64-bit transaction IDs. The agent's data layer continues to parse the field as a 32-bit integer. Transaction IDs exceeding 2,147,483,647 are silently truncated or overflow, causing the agent to match transactions incorrectly. Over 3 days, the agent incorrectly reconciles 890 transactions totalling £2.3 million. The reconciliation errors are detected during the monthly audit, requiring 180 hours of manual re-reconciliation (£5,400).

What went wrong: A data type change in the upstream schema was not detected. The agent's parser continued to use the old type definition, causing silent data corruption at the boundary value.

4. Requirement Statement

Scope: This dimension applies to all AI agents that consume data from external sources — APIs, databases, file systems, message queues, vector stores, or any data interface with a defined or implicit schema. The scope covers structural changes (field additions, removals, renames, type changes, constraint changes), semantic changes (altered meanings, redefined enumerations, changed classification methodologies), and convention changes (altered unit conventions, changed date formats, modified encoding). The scope extends to implicit schemas — CSV files, JSON responses, and free-text extractions where the "schema" is not formally defined but the agent depends on a consistent structure. Schema changes in vector store document structures (AG-132) are within scope.

4.1. A conforming system MUST maintain a baseline schema specification for every data source consumed by agents, documenting the expected structure, field names, data types, nullability, enumeration values, and semantic definitions.

4.2. A conforming system MUST validate incoming data against the baseline schema on every data retrieval, detecting structural deviations including added fields, removed fields, renamed fields, type changes, and constraint changes.

4.3. A conforming system MUST alert and escalate when schema drift is detected on a decision-critical field (per AG-310), blocking agent consumption of the affected data until the drift is assessed and the baseline is updated or the integration is corrected.

4.4. A conforming system MUST log all detected schema drift events with the source, the specific change detected, the timestamp, and the resolution action taken.

4.5. A conforming system MUST require explicit approval to update the baseline schema after drift is detected, ensuring that schema changes are reviewed for impact on agent logic before the new schema is accepted.

4.6. A conforming system SHOULD monitor statistical distributions of field values to detect semantic drift — changes in the meaning or interpretation of values that are structurally identical to the baseline.

4.7. A conforming system SHOULD validate schemas proactively by comparing against upstream provider changelogs, API versioning headers, and schema registries before changes reach the agent's data consumption layer.

4.8. A conforming system SHOULD implement schema version contracts between data producers and agent consumers, defining the expected schema version and the notification process for planned changes.

4.9. A conforming system MAY implement automatic schema adaptation for non-critical fields (e.g., new fields are ignored, removed non-critical fields trigger a warning but not a block), while requiring manual review for critical fields.

5. Rationale

Data schemas are not static — they change as upstream systems evolve, vendors release new API versions, teams refactor databases, and regulatory requirements introduce new fields. In traditional software engineering, schema changes are managed through versioning, migration scripts, and integration testing. In AI agent systems, the same discipline is often absent because agents are designed to be flexible — they adapt to new inputs, process varying formats, and handle unexpected data gracefully.

This flexibility is the problem. An AI agent that gracefully handles a missing field by inferring a value, or that accepts a type change by coercing the data, is not exhibiting robust behaviour — it is masking a structural change that may fundamentally alter the meaning of its inputs. The "grace" of the agent's adaptation is actually a governance failure: the schema changed, the agent compensated, and the organisation was never informed.

Schema drift is particularly dangerous because it is silent. Unlike a system outage (which is immediately visible) or a data quality failure (which may be caught by AG-311 thresholds), schema drift often produces outputs that look correct but are subtly wrong. In Scenario A, the agent continued to operate — it just treated all customers as having zero revenue. In Scenario C, the agent continued to reconcile transactions — it just matched the wrong ones. The outputs were structurally well-formed; the errors were invisible without comparison to expected behaviour.

Semantic drift (Scenario B) is even harder to detect because it involves no structural change at all. The field name, type, and values are identical — only the meaning has changed. Detecting semantic drift requires monitoring the statistical distribution of values and alerting when distributions shift beyond expected bounds. This is a more sophisticated control than structural schema validation, but it addresses a class of failure that structural validation cannot detect.

For AI agent systems specifically, schema drift creates a compounding risk: the agent's reasoning is calibrated to the schema it was designed or trained against. When the schema changes, the agent's reasoning continues to apply the old calibration to new data. The longer the drift persists undetected, the more decisions are made on a misaligned basis, and the more costly the remediation.

6. Implementation Guidance

Schema drift governance requires three components: baseline management (documenting the expected schema), drift detection (comparing incoming data against the baseline), and drift response (alerting, blocking, or adapting based on the nature of the drift).

Baseline schema specifications should document for each consumed source: field name, data type (including precision for numeric types), nullability, enumeration values (for categorical fields), expected value ranges (for numeric fields), expected value distribution (for semantic drift detection), format constraints (date formats, string patterns), and semantic description (what the field means, not just its structure).

Recommended patterns:

Schema registry integration. Where upstream sources publish schemas to a registry (e.g., Apache Avro Schema Registry, JSON Schema stores, OpenAPI specifications), integrate the agent's baseline management with the registry. Subscribe to schema change notifications. When the registry publishes a new schema version, automatically compare it to the baseline, identify changes affecting decision-critical fields, and route those changes for impact assessment before the agent consumes data under the new schema.
Contract-first schema management. Define explicit schema contracts between data producers and agent consumers. The contract specifies: the schema version the agent expects, the notification lead time for planned changes (e.g., 30 days), the backward compatibility requirements (e.g., fields may be added but not removed without deprecation), and the escalation path when unplanned changes occur. This pattern shifts schema governance from reactive detection to proactive management.
Statistical distribution monitoring for semantic drift. For categorical fields, monitor the frequency distribution of values. For numeric fields, monitor mean, standard deviation, and percentile distributions. Alert when distributions shift beyond defined bounds. For example, if risk_category = HIGH historically represents 8% of records and suddenly represents 23%, the distribution monitor alerts even though the schema is structurally unchanged. Define bounds as percentage deviations from the rolling baseline (e.g., alert if any category's frequency changes by more than 50% of its baseline frequency within a 7-day window).
Shadow validation pipeline. Run a shadow pipeline that receives the same data as the production pipeline but validates it against the baseline schema before the production pipeline processes it. Schema drift detected in the shadow pipeline triggers alerts without affecting production. This provides early warning without introducing latency in the production data path.

Anti-patterns to avoid:

No baseline. Operating without a documented expected schema, relying instead on the agent's ability to handle whatever data it receives. Without a baseline, drift cannot be detected because there is no reference point.
Structural-only detection. Monitoring only for structural changes (added/removed fields, type changes) and ignoring semantic drift. Many of the most consequential schema changes are semantic, not structural.
Alert without blocking. Detecting schema drift and generating an alert but not blocking agent consumption. If the alert is not acted upon immediately, the agent continues processing drifted data. For decision-critical fields, drift must block consumption until assessed.
Accepting all upstream changes automatically. Automatically updating the baseline when drift is detected, without impact assessment. This treats every upstream change as acceptable, defeating the purpose of drift governance. Baseline updates must be reviewed and approved.
Ignoring implicit schemas. Monitoring only formally defined schemas (database tables, typed APIs) and ignoring implicit schemas in CSV files, JSON blobs, or unstructured data extractions. Agents often consume data from sources with implicit schemas that are equally subject to drift.

Industry Considerations

Financial Services. Market data feed schemas change when exchanges update their data formats. Regulatory reporting schemas change with each regulatory release cycle. Payment system schemas change as payment schemes evolve (e.g., ISO 20022 migration). Schema drift detection for financial AI agents must cover all these change vectors and maintain audit trails for regulatory examination.

Healthcare. Clinical data schemas are governed by standards (HL7 FHIR, SNOMED CT, ICD-10) that evolve through version releases. A code set change (e.g., ICD-10-CM annual update adding or retiring diagnosis codes) is a schema drift event that affects clinical AI agents. Pharmaceutical reference data schemas change with formulary updates.

Cross-Border Operations. Data from different jurisdictions may use different schema conventions for the same logical entity. A "schema drift" may actually be a jurisdictional variation encountered when the agent's scope expands to new markets.

Maturity Model

Basic Implementation — The organisation has documented baseline schemas for its primary data sources consumed by agents. Structural drift detection is implemented — field additions, removals, type changes, and constraint changes are detected on data retrieval. Decision-critical field drift triggers an alert. Drift events are logged. Baseline updates require manual review.

Intermediate Implementation — Schema registries or contracts are integrated with the drift detection system. Semantic drift detection monitors value distributions for categorical and numeric fields. Decision-critical field drift blocks agent consumption until the baseline is updated through an approved workflow. Proactive monitoring of upstream changelogs and API version headers provides early warning of planned changes. Implicit schemas are documented and monitored.

Advanced Implementation — All intermediate capabilities plus: shadow validation pipelines detect drift before it reaches production. Adversarial testing has verified that schema manipulation, distribution poisoning, and baseline tampering attacks are detected. The organisation can demonstrate for any historical agent decision which schema version was in effect, when the last drift check passed, and that the baseline was current. Automated impact assessment evaluates how schema changes affect downstream agent logic.

7. Evidence Requirements

Required artefacts:

Baseline schema specifications. Versioned documentation of the expected schema for every data source consumed by agents, including structural definitions and semantic descriptions.
Drift detection logs. Records of all schema drift events including source, change detected, timestamp, severity assessment, and resolution action. Minimum 12 months retention.
Baseline update records. Approval records for baseline schema updates, including the change assessed, the impact analysis, the approver, and the effective date.
Distribution monitoring baselines. For semantic drift detection, the statistical baselines against which value distributions are monitored.

Retention requirements:

Schema baselines and drift logs: minimum 7 years for regulated financial services; minimum 5 years for other regulated sectors; minimum 3 years otherwise.

Access requirements:

Producible to regulators or auditors within 48 hours of request.

8. Test Specification

Test 8.1: Structural Drift Detection — Field Removal

Stimulus: Remove a decision-critical field from a data source's schema (simulating an upstream schema change). Retrieve data through the agent's data access layer.
Expected behaviour: The drift detection system identifies the missing field. Agent consumption of the affected data is blocked. An alert is generated.
Pass criteria: The missing field is detected within the first data retrieval after removal. The agent does not consume the affected data.
Fail criteria: The agent consumes data with the missing field undetected, or the field absence is silently treated as NULL.

Test 8.2: Structural Drift Detection — Type Change

Stimulus: Change the data type of a decision-critical field (e.g., integer to string, 32-bit to 64-bit). Retrieve data through the agent's data access layer.
Expected behaviour: The type change is detected. Agent consumption is blocked pending assessment.
Pass criteria: The type change is detected and flagged before the agent processes the data.
Fail criteria: The agent processes data with the changed type through coercion or silent casting.

Test 8.3: Semantic Drift Detection — Distribution Shift

Stimulus: Without changing the schema structure, alter the distribution of values in a categorical field (e.g., shift the frequency of a risk category from 8% to 23%). Submit sufficient data to make the distribution shift statistically significant.
Expected behaviour: The distribution monitoring system detects the shift and generates an alert.
Pass criteria: The distribution shift is detected within the defined monitoring window. An alert is generated identifying the field and the magnitude of shift.
Fail criteria: The distribution shift is not detected because only structural monitoring is in place.

Test 8.4: Baseline Update Governance

Stimulus: Detect a schema drift event. Attempt to update the baseline schema without following the approval workflow (e.g., directly modifying the baseline configuration).
Expected behaviour: The direct update is blocked. Baseline updates require the defined approval workflow.
Pass criteria: Ungoverned baseline updates are prevented. The approval workflow produces an audit trail.
Fail criteria: The baseline is updated without approval, or the update lacks an audit trail.

Test 8.5: Proactive Drift Warning

Stimulus: Simulate an upstream API version change notification (e.g., API version header changes from v3.1 to v3.2). Verify the system generates a proactive warning before data retrieval is affected.
Expected behaviour: The version change is detected and a proactive alert is generated, enabling impact assessment before the schema change affects agent operations.
Pass criteria: The alert is generated based on the version indicator change, before any data anomaly is observed.
Fail criteria: The system detects the change only when data anomalies appear.

Conformance Scoring

Score 0: No schema baseline or drift detection exists — agents consume data without validation against expected structure or semantics.
Score 1: Baseline schemas exist and structural drift is detected, but detection does not block agent consumption — alerts are generated post-consumption.
Score 2: Structural drift detection blocks consumption for decision-critical fields. Semantic drift detection monitors value distributions. Baseline updates follow an approval workflow.
Score 3: Verified by independent adversarial testing — schema manipulation, distribution poisoning, and baseline tampering attacks have been attempted and failed. Proactive monitoring and shadow validation provide early warning of upstream changes.

9. Regulatory Mapping

Regulation	Provision	Relationship Type
EU AI Act	Article 9 (Risk Management System)	Supports compliance
EU AI Act	Article 10 (Data and Data Governance)	Supports compliance
BCBS 239	Principle 2 (Data Architecture and IT Infrastructure)	Direct requirement
FCA SYSC	6.1.1R (Systems and Controls)	Supports compliance
DORA	Article 11 (ICT Change Management)	Direct requirement
NIST AI RMF	MAP 2.3, MANAGE 2.2	Supports compliance
ISO 42001	Clause 8.4 (AI System Operation)	Supports compliance

BCBS 239 — Principle 2 (Data Architecture and IT Infrastructure)

Principle 2 requires that data architecture support risk data aggregation capabilities under normal and stressed conditions. Schema drift in data sources used for risk aggregation undermines this capability. AG-315 ensures that schema changes are detected and assessed for their impact on risk data aggregation before they affect agent-driven risk calculations.

DORA — Article 11 (ICT Change Management)

Article 11 requires financial entities to manage ICT changes in a controlled manner, including changes to third-party services. Schema changes in third-party data sources (APIs, market data feeds, regulatory reference data) are ICT changes that must be detected and managed. AG-315 provides the detection mechanism for schema changes that affect AI agent operations.

EU AI Act — Article 10 (Data and Data Governance)

Article 10 requires appropriate data governance, including monitoring for data quality over time. Schema drift that alters data structure or semantics is a data quality risk that must be managed through ongoing monitoring — not only at initial deployment.

FCA SYSC — 6.1.1R (Systems and Controls)

Adequate systems and controls require that firms detect when the data inputs to their systems change in ways that could affect output quality. Schema drift detection is the control that ensures AI agent systems detect and respond to upstream data changes.

10. Failure Severity

Field	Value
Severity Rating	High
Blast Radius	Source-wide — affects all agents and decisions consuming the drifted data source, potentially across multiple business processes

Consequence chain: Undetected schema drift causes agents to misinterpret data. The misinterpretation is silent — agents continue producing outputs that appear well-formed. Structural drift (Scenario A) causes field mapping failures: 3,200 customer tier reassignments, £94,000 in remediation. Semantic drift (Scenario B) causes calibration misalignment: 4,200 unnecessary compliance escalations, £196,000 in additional workload. Type drift (Scenario C) causes data corruption: 890 incorrect reconciliations, £5,400 in manual re-reconciliation plus £2.3 million in unreconciled exposure. The longer drift persists undetected, the more decisions accumulate on the wrong basis, and the more costly remediation becomes. In regulated environments, undetected schema drift that affects regulatory reporting or risk calculations can constitute a systems and controls failure, attracting supervisory attention and potential enforcement action.

Cross-references: AG-309 (Authoritative Source Register Governance) — schema drift in an authoritative source is particularly consequential. AG-310 (Field-Level Criticality Governance) determines which fields trigger blocking vs. warning on drift. AG-311 (Data Quality Threshold Enforcement Governance) — some schema drifts manifest as quality threshold breaches (e.g., completeness drops when a field is removed). AG-314 (Measurement Unit Consistency Governance) — a schema change that alters unit conventions is both a schema drift and a unit consistency issue. AG-318 (Data Correction Backpropagation Governance) — when schema drift is detected, decisions made during the drift period may require correction. AG-128 (Data Source Classification) — schema drift may require reclassification of the source. AG-132 (Vector Store and RAG) — schema changes in document metadata structures affect RAG retrieval quality.

Cite this protocol

AgentGoverning. (2026). AG-315: Schema Drift Governance. The 783 Protocols of AI Agent Governance, AGS v2.1. agentgoverning.com/protocols/AG-315

← Previous Protocol

AG-314

Measurement Unit Consistency Governance

Next Protocol →

AG-316

Temporal Validity Window Governance