Back to KB
Difficulty
Intermediate
Read Time
8 min

Model cards vs pre-registration: what counts as evidence under the EU AI Act

By Codcompass Team··8 min read

Cryptographic Pre-Commitment for Verifiable AI Accuracy Claims under the EU AI Act

Current Situation Analysis

The enforcement timeline for high-risk AI systems under Regulation (EU) 2024/1689 converges on 2 August 2026. At this date, providers must satisfy Article 15 (Accuracy and Robustness) and Article 12 (Technical Documentation). The regulation mandates that accuracy metrics be not only declared but verifiable.

Engineering teams frequently misinterpret "verifiable" as "documented." The default response is to produce model cards or internal evaluation reports. These artifacts consolidate metrics, dataset descriptions, and known limitations into a human-readable format. While useful for transparency and satisfying Annex IV requirements for methodology description, they suffer from a critical structural deficiency: lack of temporal binding.

A model card is an editable prose document. It reports results after the fact. It cannot cryptographically distinguish between a threshold committed to before an experiment and a threshold selected after reviewing multiple runs because it yielded the most favorable outcome. For a notified body or auditor, a model card represents an assertion, not evidence. The recurring failure mode in pre-compliance reviews is the inability to answer a single question:

"Can you prove this accuracy threshold was fixed before you observed the evaluation results?"

Without a mechanism to demonstrate pre-commitment, accuracy claims remain unverifiable under the strict interpretation of Article 15. The gap is not in the quality of the evaluation, but in the integrity of the claim lifecycle.

WOW Moment: Key Findings

The distinction between a compliance-ready artifact and a documentation artifact lies in three properties: temporal integrity, tamper evidence, and re-derivability. The following comparison highlights why standard documentation fails the verification test required by the EU AI Act.

Artifact TypeTemporal IntegrityTamper EvidenceThird-Party Re-derivationAudit Suitability
Model Card / Eval ReportNone. Timestamps are metadata, not cryptographic proofs.None. Content can be modified without detection.No. Lacks deterministic binding to dataset/seed.Low. Satisfies description requirements; fails verification.
Pre-Registered ManifestCryptographic. Hash binds claim to a point in time.SHA-256 Mismatch. Any edit changes the digest.Yes. Verifier recomputes hash and re-runs eval.High. Provides verifiable evidence for Art. 15/12.

Why this matters: A pre-registered manifest does not replace the model card. It serves as the cryptographic evidence layer beneath it. The model card communicates intent and results to stakeholders; the manifest proves to a regulator that the results were measured against a fixed, unaltered standard. This separation of concerns is essential for high-risk compliance.

Core Solution

The engineering pattern required is Cryptographic Pre-Commitment. This involves defining the evaluation claim as a structured data object, canonicalizing it, and computing a hash digest before the evaluation executes. The digest serves as a tamper-evident seal.

Architecture Decisions

  1. Canonical JSON Serialization: JSON objects are unordered. Two identical claims with different key ordering produce different byte sequences. The implementation must enforce canonical serialization (e.g., sorted keys) to ensure hash stability.
  2. SHA-256 Digest: A standard cryptographic hash function provides collision resistance and integrity verification. The digest must be stored in a write-once or immutable log immediately after computation.
  3. Deterministic Evaluation Parameters: The manifest must include the random seed and dataset hash. Without these, the evaluation cannot be re-derived by a third par

🎉 Mid-Year Sale — Unlock Full Article

Base plan from just $4.99/mo or $49/yr

Sign in to read the full article and unlock all 635+ tutorials.

Sign In / Register — Start Free Trial

7-day free trial · Cancel anytime · 30-day money-back