ISO 42001 AI Management System: What the Standard Actually Requires for AI Audit Trails
ISO/IEC 42001:2023 is the first international standard for AI management systems. Organizations deploying AI in consequential workflows need to understand three core requirements: Clause 8.4 requires documented evidence that AI processes were carried out as planned; Clause 9.1 requires behavioral performance baselines and monitoring evidence; Clause 10.2 requires nonconformity records and corrective action documentation. System logs satisfy none of these. Decision audit records satisfy all three.
What ISO 42001 Is and Who Needs It
ISO/IEC 42001:2023 is the international standard for AI management systems. It applies to any organization developing, providing, or using AI systems. ISO 42001 certification is increasingly required in EU public sector procurement, enterprise AI vendor assessments, AI-related insurance underwriting, and due diligence in AI company transactions. Organizations that cannot demonstrate a structured AI management system face growing commercial and reputational barriers even outside regulated sectors.
Clause 8.4: AI System Operation Records
Clause 8.4 requires organizations to retain documented information to have confidence that AI processes were carried out as planned. For AI agents, this requires structured records showing: what inputs the AI processed, what decision it reached, what reasoning it applied, which policy version governed the decision, and evidence the record is unaltered. Application logs showing API requests and response codes do not satisfy Clause 8.4 — they record that the system ran, not that it ran correctly per documented specifications.
Clause 9.1: Performance Monitoring and Measurement
Clause 9.1 requires organizations to determine AI system performance indicators, establish monitoring methods, and retain documented evidence of results. For AI agents, performance indicators include: decision rate by category, confidence score distributions, override rate by reviewer, decision category frequency, and model version provenance. Infrastructure metrics confirm the system is running — they do not confirm it is behaving as intended. Behavioral monitoring from decision records satisfies Clause 9.1.
Clause 10.2: Nonconformity and Corrective Action
Clause 10.2 requires that when a nonconformity occurs, it must be documented with root cause analysis and corrective action evidence. AI agent nonconformities include policy violations detected in decision records, behavioral drift beyond monitoring thresholds, systematic override patterns, and context integrity failures. ISO 42001 auditors sample nonconformity records and request the corresponding AI decision records that triggered each finding, analysis documentation, and evidence of effective corrective action.
Annex A Controls: Transparency and Human Oversight
ISO 42001 Annex A.7.4 requires information sufficient to explain AI decisions. Annex A.9.3 requires records of when humans reviewed or overrode AI outputs. Both require per-decision evidence. System-level model documentation satisfies the system description requirement but not individual decision explanation. Per-decision reasoning records with factor-level explanation satisfy A.7.4. Override and confirmation records satisfy A.9.3.
Implementation: Decision Records for ISO 42001
A single Tenet decision record satisfies evidence requirements for Clause 8.4, 9.1, 10.2, and Annex A.6.2, A.7.4, A.9.3 simultaneously. Configure TenetClient with policy_version and system_id to attach documented control evidence to every record. Use ctx.snapshot_context() for Clause 8.4 operation evidence. Attach monitoring_signals for Clause 9.1 baseline tracking. Override records satisfy Annex A.9.3. Cryptographic signing satisfies Clause 7.5 integrity requirements.