SOC 2 CC7.2 for AI Agents: Anomaly Detection and Decision Monitoring
SOC 2 CC7.2 requires monitoring system components for anomalies that are indicative of malicious acts, natural disasters, and errors affecting the entity's ability to meet its objectives. AI agents are system components — and their decision patterns are the baselines from which CC7.2 anomalies are measured. An approval rate shift of 5%, a confidence score drop of 10%, or a model version change that alters behavioral output without a change management record are all CC7.2-relevant anomalies. Infrastructure APM tools cannot detect any of them.
What CC7.2 Actually Requires for AI Agents
CC7.2 (Monitoring and Evaluation of Environmental and Technology Changes) requires entities to monitor system components for anomalies indicative of malicious acts, natural disasters, and errors. AI agents are system components; their decision patterns are the relevant baseline. A fraud agent whose approval rate shifts from 68% to 76% has exhibited a CC7.2 anomaly — even if infrastructure metrics are healthy. CC7.2 also requires anomalies to be analyzed to determine whether they represent security events. For AI agents, an undocumented model version change that alters decision behavior is exactly the type of unanticipated system change CC7.2 is designed to catch.
The Four Relevant Trust Services Criteria
CC7.2 is primary, but three others apply: CC4.1 (monitoring of internal controls — model updates that shift behavior without documented evaluation violate CC4.1), CC6.1 (logical access controls — decision records must be append-only with no DELETE path; signing keys stored separately from record store), CC3.2 (risk assessment for technology changes — pre-deployment deterministic replay generates quantitative behavioral delta evidence for model update risk assessments). Most AI agent SOC 2 gap analyses focus on CC7.2 but miss the CC4.1 model change evaluation requirement.
Why Infrastructure APM Misses the AI Compliance Gap
Datadog, New Relic, and CloudWatch detect system-level anomalies: CPU spikes, error rate increases, latency changes. They cannot detect decision-level anomalies: approval rate shifts, confidence score distribution changes, semantic reasoning divergence, or override rate increases. A model update that changes fraud detection behavior from 2.1% false positive rate to 5.8% produces no infrastructure signal — no errors, no latency change, Datadog shows green. CC7.2 requires detecting this. Infrastructure APM cannot.
Six AI Decision Anomaly Types for CC7.2
The six decision anomaly types that matter for SOC 2 CC7.2: (1) Decision rate shift — approval rate changes by >5% absolute in 7 days. (2) Confidence score distribution change — mean confidence drops by >10% relative. (3) Model version change without change record — any new model_version in provenance not in change management log. (4) Override rate increase — human reviewer override rate rises by >3% absolute in 14 days. (5) Decision category frequency shift — specific category decisions change by >15% relative in 14 days. (6) Semantic reasoning divergence — deterministic replay shows >2% of past decisions would differ today.
What SOC 2 Auditors Request for CC7.2 Evidence
A Type II auditor evaluating CC7.2 for an AI agent system requests six categories of evidence: (1) Baseline documentation — decision rate baselines established at each monitoring period start. (2) Alert configuration — threshold settings showing monitoring is calibrated to fire. (3) Alert history (12-month) — evidence continuous monitoring was active, not just configured. (4) Investigation records — documentation that anomaly alerts were reviewed and resolved. (5) Model version change log — behavioral delta measurements for each model update. (6) Decision record samples — spot verification that individual records are complete and tamper-evident. Tenet generates a structured CC7.2 compliance PDF package covering all six categories on demand.