H33
#167 · the category · June 3, 2026

First Decision Reproducibility.
The verb shifts. Reproduce.

The verb shift · LOCKED Eric Beans
Replay
is evidence. The chain shows what happened.
Reproduce
is computation. Given the same inputs, the same decision comes out.
Auditors AI Regulators Model Risk Officers Internal Model Governance Governance System Auditors Underwriting Committees Policy Review Boards Rules-Engine Audit Teams
What was proven · 10-second read

Decision reproducibility is now measurable.

01
reproduce_decision returns the structured replay object plus a measured confidence score. Both signed; both reproducible.
02
Five components × 20 points = 100 total. Inputs · Policy · Model Influence · Responsibility · Outcome. The score is the page reads at 82/100 today.
03
Two tiny additive schema fields (inputs_hash, policy_ast_hash) buy a measurable scoring axis without breaking any prior proof byte-identically.
Reading any H33 proof · the six questions

Same six answers. Computation axis, second proof.

  1. 1What happened?

    A court auditor asks reproduce_decision(decision_loan_84711_recommendation) against the canonical continuity tenant — four years post-dissolution, post-#174 instrumentation. The query returns the full structured replay object and a measured 82/100 confidence.

  2. 2Who had authority?

    No new signers — #167 is a composition over the existing canonical evidence signed by the credit officer, the AI risk agent, the model_owner (#174), and the responsibility chain authorities (#14.1).

  3. 3How was authority reconstructed?

    replay_until at T = 2035 produces the snapshot. The new helper reproduce_decision(snapshot, id) walks the snapshot, composes the structured replay object, and measures per-component reproducibility against 20-point rubrics.

  4. 4What state was produced?

    A 5-component confidence score totaling 82/100 for decision_loan_84711_recommendation. Inputs 12/20, Policy 10/20, Model Influence 20/20, Responsibility 20/20, Outcome 20/20. Three of five components fully anchored.

  5. 5What artifact was returned?

    reconstruction.json — the DecisionReproduction object plus its DecisionReproductionConfidence; both quotable verbatim.

  6. 6How can a third party verify it?

    Run cargo test --test decision_reproducibility_001 -- --ignored against scif-backend @ d4937508b. Confidence total must equal 82/100; per-component scores must match byte-identically.

01The killer query — reproduce_decision(decision_id)

The killer query · the verb is the lock
asl> reproduce_decision("decision_loan_84711_recommendation")
→ structured replay object + measured confidence score
actor:        princ_credit_risk_agent_001
capability:   recommend_credit
subject:      loan_84711_borrower_principal
policy_ref:   pol_credit_underwriting:1
model:        model_credit_underwriting v1
              (full ModelInfluenceRecord from #174 attached)
threshold:    0.75
responsibility chain: present (actor + supervisor + asset_owner + …)
outcome:      recommend_approve

CONFIDENCE:   82/100  (3 of 5 components fully anchored)

02The replay object — what gets reproduced

DecisionReproduction · the structured return
Decision decision_loan_84711_recommendation
├─ InputsPARTIAL ANCHOR · 12/20
5 features inferred from ModelInfluenceRecord:
debt_to_income=0.42 · credit_utilization=0.78 · employment_tenure=8.2
payment_history=0.96 · loan_amount_to_income=10.5
No inputs_hash anchor — score climbs to 20/20 when inline
├─ Policy ASTPOINTER ONLY · 10/20
pol_credit_underwriting:1 reference present, no AST hash anchor
├─ Model Influence RecordFULLY ANCHORED · 20/20
#174's signed record: attribution frame, 5 features, 3 counterfactuals
├─ Decision ThresholdFULLY ANCHORED · 20/20
0.75 — bound to the policy in force at decision time
├─ Responsibility ChainFULLY ANCHORED · 20/20
#14.1's chain present: actor + supervisor + asset_owner + policy_owner + …
└─ OutcomeFULLY ANCHORED · 20/20
Decision.outcome = "recommend_approve" signed on the canonical event

03The money quote

Locked Eric Beans · June 3, 2026
Reproducibility Justification.
We can reproduce the decision exactly. That does not justify it.
The boundary · LOCKED Eric Beans

Reproducibility Justification.

The chain runs again the same way. The score lands at 82/100. The same five features explain the same approval. None of that is a judgment. The proof does not establish that the decision was right, fair, legal, or just. Reproducibility is a structural fact. Justification is a verdict made by competent counsel, regulators, and courts — not by replay engines. This proof is measurement, not judgment.

04The pattern — three money quotes, one corpus

Eric LOCKED June 3 2026: "You now have a recognizable pattern across the corpus."

#184 · the capstone
Institutional Memory Legal Truth.
#174 · orthogonal axis · 1st
Influence Causation.
#167 · orthogonal axis · 2nd
Reproducibility Justification.

Each proof's deepest claim arrives with its own honest limit. The pattern says: H33 produces evidence — not verdicts.

05The Reproduction Confidence — measurable rather than rhetorical

Reproduction Confidence Components · Eric LOCKED structure
decision_loan_84711_recommendation · T = 2035-06-01 (4 years post-dissolution)
Inputs
12/20
Inferred from ModelInfluenceRecord's 5 feature values. No inputs_hash anchor present — climbs to 20/20 when inline.
Policy
10/20
policy_version_ref pointer reconstructed for pol_credit_underwriting:1. No AST hash anchor — climbs to 20/20 when inline.
Model Influence
20/20
#174's signed ModelInfluenceRecord present (attribution frame, 5 features, 3 counterfactual probes).
Responsibility
20/20
#14.1's responsibility chain present (actor + supervisor + asset_owner + policy_owner + model_owner + approving_authority).
Outcome
20/20
Decision.outcome = "recommend_approve" signed on the canonical event.
Total
82/100
3 of 5 components fully anchored. The two not-fully-anchored components (Inputs, Policy AST) are honestly scored at partial / pointer-only — and named as the next surface to close.

06The schema (Eric LOCKED Option C — two tiny additive fields)

"This is one of the rare cases where two tiny schema additions buy a huge amount of future value. Without anchors, you're inferring. With anchors, you're measuring."

PolicyRegister  + policy_ast_hash: Option<String>
PolicyAmend     + policy_ast_hash: Option<String>
Decision        + inputs_hash:     Option<String>

Both fields are Option<String>, both skip-if-none, both backward-compatible. All four prior canonical-continuity-tenant proofs verified byte-identical state_ids under the extended schema.

Plus the data shapes that compose the helper:

enum ReproductionComponentStatus { FullyAnchored, PartiallyAnchored, PointerOnly, Missing }
struct ReproductionComponent      { component, status, explanation, score, max_score }
struct DecisionReproductionConfidence { total_score, max_score, components[], caption }
struct DecisionReproduction        { decision_*, inputs*, policy_*, model_influence,
                                     decision_threshold, responsibility_chain, outcome,
                                     confidence }

The reproduce_decision(snapshot, decision_id) helper lives in the test harness — it is a composition over existing snapshot fields, keeping the engine surface stable while surfacing the orthogonal-axis affordance.

07The computation axis — where this proof sits

Computation Axis · the orthogonal branch · growing
Model influence is replayable.
#174 First Model Influence Replay — proven
Decisions are reproducible.
#167 First Decision Reproducibility — proven now
Reasoning survives systems.
candidate endpoint — to be evaluated
Whether the candidate endpoint is a true new category or merely the composition of #174 + #167 is the question #167 has now teed up. Eric LOCKED: stop and reassess.

08The canonical continuity tenant — four dimensions, one reality

tenant_insurance_claim_44962d9b-... · Claim #84711 · one decision, four proofs
#167 Decision Reproducibility (this proof)
One enterprise. One loss. One claim. Four dimensions of replay and reproduction. Eric LOCKED: "That's more powerful than ten unrelated examples."

09What this proof IS and IS NOT

This proof IS

The second proof on H33's orthogonal axis. The proof that turns reproducibility from a rhetorical claim into a measured score. The first place an auditor, regulator, or court can quote a number — 82/100 — and know what its five components mean. The category Eric named: Decision Reproducibility. Applies universally — not narrowed to AI.

This proof IS NOT

A verdict. A determination of fairness or correctness. A guarantee of perfect reproducibility — perfect reproducibility is often impossible (stochastic models, non-deterministic policies, time-dependent inputs). A model re-execution from weights — that surface is named in the score as "not yet anchored" and is honest about it. A substitute for competent counsel, regulators, or model risk committees. Reproducibility ≠ Justification.

10Honest limits (Eric LOCKED — 5 total)

  1. Confidence is a measurement, not a guarantee. 82/100 means 18 points worth of components are anchored partially or by pointer only — and the page names which.
  2. Perfect reproduction is often impossible. Stochastic models, randomized policy engines, time-dependent inputs, deprecated dependencies. The proof scores reproducibility measurably, not aspirationally.
  3. Reproducibility is not Determinism. Two reproductions may legitimately disagree if the original decision was non-deterministic. The proof captures what was bound at decision time.
  4. Phase E lock open. Per-event signature verification remains the standing honest-limit from L9.
  5. Reproducibility ≠ Justification — see section 03 above.

11Evidence appendix

FieldValue
state_id at T=2035e72d3c0e71a11ce0aaf1e8c9eb5c720aff49a6238c76976b4f4435b50e43bee2
Tenanttenant_insurance_claim_44962d9b-25f5-5622-bd9a-98d5580bb8a2 (canonical continuity tenant)
Tenant rootprinc_root_claim_44962d9b-25f5-5622-bd9a-98d5580bb8a2
Decisiondecision_loan_84711_recommendation
Actorprinc_credit_risk_agent_001
Capabilityrecommend_credit
Subjectloan_84711_borrower_principal_…
Outcomerecommend_approve
Confidence — Total82/100
Confidence — Inputs12/20 (PartiallyAnchored — features from #174)
Confidence — Policy10/20 (PointerOnly — no AST hash)
Confidence — Model Influence20/20 (FullyAnchored — #174's record)
Confidence — Responsibility20/20 (FullyAnchored — #14.1's chain)
Confidence — Outcome20/20 (FullyAnchored — Decision.outcome)
T_REPLAY2035-06-01 (~4 years post-dissolution per #184)
Reconstruction artifactreconstruction.json
Harnesstests/decision_reproducibility_001.rs (scif-backend @ d4937508b)

12Readiness determination & strategic pause

Determination

First Decision Reproducibility: PROVEN IN OPERATION. reproduce_decision returns the structured replay object and a measured 82/100 confidence at T=2035, against the canonical continuity tenant, post-dissolution, with all four prior proofs' state_ids verified byte-identical under the extended schema.

What this unlocks: an auditor — and every other audience inside that umbrella — can now ask "Can you reproduce this?" and receive a deterministic, scored, byte-identically replayable answer. The category Eric named is now standing: Decision Reproducibility. The pattern across the corpus is now recognizable.

What this does not unlock: justifications, fairness verdicts, causality claims, or guarantees of perfect reproduction. Reproducibility ≠ Justification.

Strategic pause · the reflection · ANSWERED

Eric Beans, June 3 2026, post-#167: "'Reasoning survives systems' has NOT earned a third proof yet. #174 + #167 prove model influence is replayable and decision outcome is reproducible with measured confidence — strong, but still not full reasoning."

The bar a future proof must clear (LOCKED): "Given the preserved reasoning substrate, the same system can re-run the reasoning path — not merely reconstruct the decision object." That substrate requires: policy AST executable · model weights or deterministic artifact · inputs recoverable or fully anchored · agent prompt/response chain · tool calls · intermediate state · randomness seed (if any) · execution environment. Until then, the candidate stays as candidate — disciplined and powerful.

Next move: not a third orthogonal-axis proof. L9.1 Phase E close — harden the entire corpus's per-event signature verification. Replay Confidence climbs 72 → 100 across all 17+ proofs. "You've expanded the vision. Now harden the entire corpus."

Issued by H33, Inc. · Eric Beans, CEO · 2026-06-03

Independently reconstructable. Inputs: scif-backend @ d4937508b · tests/decision_reproducibility_001.rs · reconstruction.json.