AI Decision Reproducibility and Testing

QA and compliance teams needing to replay and validate AI decisions for consistency, fairness testing, and regulatory review.

The answer

Corules decisions are reproducible by design. Each evaluation stores a policy set version and a normalized input hash. Given these two values, the exact evaluation can be reproduced at any future point — even after policy updates — by specifying the historical policy version. This enables: regression testing when policy changes (run historical cases against new policy), fairness testing (replay a sample of decisions across demographic segments), and regulatory examination (prove that the policy in effect at time T produced outcome O for input I).

How it works

Corules's policy runtime sits in the enforcement path between your AI agent and the action it wants to take. The agent sends a structured context payload to /v1/validate. Corules evaluates the context against a compiled CEL policy set and returns a structured decision — ALLOW, BLOCK, or ESCALATE — with a reason and audit ID.

Every decision is recorded in an immutable audit ledger. You can replay any past decision by providing the policy_set_version and the normalized input hash — the result will be identical.

Policy example

Policies are written in CEL (Common Expression Language). They are compiled once at publish time and evaluated in microseconds at request time.

// Replay any historical decision:
POST /v1/validate/replay
{
  "audit_id": "aud_01J...",
  "policy_set_version": "pset_v2.1.0"  // historical version
}
// Returns identical result as original evaluation

Frequently Asked Questions

Can I test a new policy version against historical inputs before publishing?

Yes. The simulation API runs proposed policy changes against historical audit data without affecting production. Preview the impact before publishing.

How does this support A/B testing of policies?

Run two policy versions against the same historical dataset. Compare outcome distributions. Publish the version with the desired risk/automation tradeoff.

See it working in your environment

Start free — no credit card, no sales call. Evaluate up to 1,000 decisions per month.

Get started free