This release is a code + paper + results-summary repository. The bulk artifacts (model weights, the raw capture corpus, derived data packs, and the full eval trees) are external, hosted on Cloudflare R2 (the system of record after the training host was retired). This file maps every artifact the paper/README references to its location, access method, and the claim it backs, so a reviewer always knows where something is and whether it is required to reproduce a given result.
R2 does not serve browsable directory listings, so this is the index: every artifact a reviewer needs, with its direct link, size, and content hash or IPFS CID. Nothing is gated. The CIDs are themselves the content hashes and are reproducible from the bytes; the complete machine-readable manifest is CID_MANIFEST.json.
| Artifact | Direct link | Size | Integrity |
|---|---|---|---|
| Verify bundle (verifier + forger + recording code + scripts) | release/truthbeam_verify.tar.gz | 6.2 MB | SHA-256 b4f28ae91cfe785d0273fd75f0f0444d1a054523fffa8cca3c1585b21fa4a758; see REPRODUCE |
| Eval scores (Path A recompute input) | models/repro/stage_0_eval/ (ships inside the verify bundle) | in bundle | regenerable from the weights (Path A.5) |
Verifier weights (model_final.pt, 39.8 M params) | models/verifier/model_final.pt | 455 MB | in the models IPFS unit (CID below) |
| F-A v1 forger checkpoints (4 steps) | 5k · 25k · 70k · 100k | 1.89 GiB | public weights, in the models unit |
| Artifact | Direct link | Size | Integrity |
|---|---|---|---|
Hand-made 2023 video (PolieBotics.mp4) | pinata/PolieBotics.mp4 | 607 MB | BLAKE3 8fbdb64ddd248246e7a8d840fa191467ab24ea79058047deb0ea537af95c0e92 · SHA-256 00d0e4531c1896ff72bf1ac7b7f2a4146af4f8ee5b08a63bc8708f333feb87b7 |
The 2023 recording (the Truth Beam trailer dataset) was committed to the public Rootstock chain on 28 April 2023, so the date is verifiable independently of this page. Its chain anchor is Rootstock transaction 0x7db237535f0e5bd4d3b39d08274e89c0175da190b0059bb175c32b75e18bb8f8 in block 5254387 (view on the block explorer); the block's timestamp is the recording's public lower bound.
The work was also shown publicly at the time: a demonstration video on Reddit on 26 September 2023, and an announcement on X on 14 October 2023.
Each unit is a reproducible CID (a UnixFS directory DAG). Download the unit from r2:truthbeam/<dest> and re-add it with the flags in the manifest to verify the CID against the bytes.
| Unit | IPFS CID | Size | Files |
|---|---|---|---|
| Ground-truth session D2 | bafybeicrssbic35534es3sbwyzhlw7reboh6wy75htmo53ke5mfsphkmwi | 232.2 GiB | 17,987 |
| Ground-truth session V10 | bafybeier2sfcjrrgw7amne3lwogise6umyeyf6qgivgrmvx3to4vsdsbcm | 146.1 GiB | 11,279 |
| Models (verifier + forgers) | bafybeihffzc7fn5q3u7tf3k5hqkcpaoozdnx7pdm2lw4fmfwkgfwjnzzpm | 7.64 GiB | 30 |
| Code (full source) | bafybeiguoiy24zqup7pp7wkgjjnhoyojuzlabzwizuabgbrbejz4jgun3i | 13.98 GiB | 59,046 |
| Evidence: paper analyses | bafybeibrnz5mmz53fz3vw7xpthvaiefm4j4h2yscv2bnhftgorrbjbnnre | 102.3 GiB | 22,084 |
| Evidence: cross-session ablation | bafybeibs2xyyyevdnpy2a2rr4yipmpht3iyek6fnlzhpcmty7woydipecq | 5.34 GiB | 75 |
| Evidence: Phase E | bafybeibpgty4iu355or6c2d27djkfyudwdc6rz7i3bjiw7ekcr4tvc3t5i | 28.2 GiB | 540 |
| Evidence: Phase G verifier | bafybeibfqaw4wtpzo46a767udltt3vupii66lttrlyiheeajjsffgf2z7a | 7.56 GiB | 85 |
| Evidence: Phase H supervised baseline | bafybeid6ibcxgh2najibjoyqyth2qhryx57ufnzasfpf27krsnpkndvqba | 112 MB | 30 |
| Evidence: Phase F | bafybeiebxm5p4uhh5nm5n2dfzpzykecawamlzwmmzhq6pm6lncplygmdmy | 20.3 GiB | 72 |
| Evidence: F-A v2 (design-only surrogate) | bafybeifpfdgyycg6swe6bj3oonfpn7n3zhikoapjx3vr2znejaevogjvpy | 17.7 GiB | 184 |
| Evidence: stage-0 cross-verifier | bafybeieomhoxmevp2rol4pvy2z6rzhtzmktzl6yjvdlfhcie7sahdgjvd4 | 2.1 MB | 55 |
| Evidence: closure package | bafybeiavyfm7vqrvg6mbg5sc7p7jg73n34oodvjfyyy6n5aw6ns2atn7ye | 263 MB | 55 |
| Artifact | Path | Backs |
|---|---|---|
| Whitepaper (PDF + LaTeX + figures) | paper/ |
all claims (narrative) |
Recording protocol + third-party verifiers (v9 + v10, incl. --logs-only mode) |
code/recording/ |
code→hash→chain verification, both sessions |
| Verifier stack (Phase G/F/H, binders, red-team) | code/verifier/ |
method reproduction |
| Patent filings (Reality Kernel) | companion repo: poliebotics/PolieBotics (reality_kernel/) |
patent record |
| Held-out headline eval summaries | results/eval/*.json |
AUROC=1.000 (within-session, n=198/200), shuffled 0.5006, synthetic-positive, F-A v1, cross-session |
Off-body segmentation ablation - summary artifacts (ablation_table_seg.csv, phase2_gate.json, phase4_summary.json, segmentation_manifest.csv); the per-frame masks/scripts/shards/spatial-viz are external (R2 lambda/experiments/paper_analyses/, on request) |
results/redteam_segmentation_evals/proper_segmentation/ |
§7 off-body localization (12 cells) |
EXP-6 per-frame ranking (raw rank_distribution.csv, n=120 × 51 candidates, + summary) |
results/eval/exp6_correct_e_rank/ |
§7 relative-comparison test: top-1 = 100%, mean rank 1.00 |
EXP-7 / excess-red / causal ablations (incl. excess_red/fake_step_progression.csv, the §7 67.4→52.4 four-checkpoint table) |
results/redteam_segmentation_evals/{causal_ablations,xof_sensitivity,excess_red}/ |
§7 perturbation sensitivity, attacker non-convergence |
| σ=4 low-frequency body recovery | results/redteam_segmentation_evals/low_freq_typicality/ |
§9 body-region signal survives at low frequency (~0.998) |
| XOF Type-1..6 bit-flip table (+ raw npz + regen script) | results/redteam_segmentation_evals/xof_bitflip/ |
§7/§8/§11 bit-flip AUROCs |
| Phase H E-usage ablation (diagnostic) | results/phase_h/ (e_usage_report.md, verdict.json) |
§8/§11 Phase H coarse/fine behaviour |
| Cross-verifier report (incl. real-vs-zero-E 0.7221) | results/eval/stage_0_cross_verifier__report.json |
§6/§8 cross-verifier zero-E |
| Frame-level per-frame metric table (9,735 rows × 33 cols) | results/csv/visual_metrics_wide.csv |
§6 frame-level AUROC. Note: recomputing from this compact table gives 0.9998 (vs. shuffled) and 0.9998 (vs. synthetic); the run-time aggregate quoted in the paper/figures reads 0.9999/0.9998 - a one-unit difference in the fourth decimal from aggregation order, documented in §10 of the paper |
Pinned lockfile (tested training/eval environment; the optional online-verification extras in requirements.txt are not pinned) |
requirements-lock.txt |
tested training versions |
truthbeam)A public subset is directly downloadable - no request, no login - through the read-only gateway
data.truthbeam.com. Step-by-step in REPRODUCE:
| Public artifact | Direct URL | Size |
|---|---|---|
Verifier weights (model_final.pt, 39.8 M params) |
https://data.truthbeam.com/models/verifier/model_final.pt |
455 MB |
| F-A v1 forger checkpoints (5k/25k/70k/100k) | https://data.truthbeam.com/models/fa_v1_forger/f_a_v1_step_*.pt |
~165 MB ea |
| Eval scores (2-minute, CPU-only reproduce input) | https://data.truthbeam.com/models/repro/stage_0_eval/ |
4.3 MB |
| Ground-truth corpus (sessions D2/V10) | https://data.truthbeam.com/sessions/ |
~378 GiB |
| 2023 demonstration video | https://data.truthbeam.com/pinata/PolieBotics.mp4 |
- |
| Truth Beam - Introduction | https://data.truthbeam.com/pinata/TruthBeam_Introduction.mp4 |
64 s |
The bulk eval trees listed below (full experiments/, hundreds of GB) remain request-gated.
Nothing there is required to verify the code→hash promise or to recompute the headline AUROC - both
run from this repo plus the public subset above.
| Artifact | R2 location (under bucket truthbeam) |
Approx size | Backs |
|---|---|---|---|
Phase G verifier weights + training logs/configs (main/shuffled/synthetic_positive, model_final.pt) |
models/ and lambda/experiments/phase_g_diffusion_diagnostic/ |
478 MB / 455 MiB each | headline verifier + controls; §5 training wall-time/loss figures |
| F-A v1 forger checkpoints (5k/25k/70k/100k) + 14 binders | models/, lambda/experiments/ |
~6-12 GB | red-team |
| Stage-0 cross-session verifiers (step 100000) | lambda/experiments/{stage_0_cross_verifier,cross_session_ablation}/ |
- | §6 cross-session AUROC |
Raw capture corpus (D2 5,992 + V10 3,743 BayerRG8 frames) + emission tiles - the raw analysis subset (the full public sessions/ release is ~378 GiB = 406 GB) |
raw/, sessions/ |
~262 GB | dataset |
| Derived pack: 208 NPZ map shards (full-res robust-z / excess maps) | lambda/experiments/ (on request) |
~14 GB | §10 derived products (the wide CSV itself now ships in-repo, above) |
Full eval trees (all experiments/) |
lambda/experiments/ |
~659 GB | full reproduction |
The session bundles a third-party verifier needs (
manifest.json,verification_bundle.json,chain_log.csv,capture_log.csv,anchor_txs.csv,verify_report.json, raw frames) are released with the session data (on R2), not committed to this code repo.
This page is an LLM-mediated dataset: the same content as ARTIFACTS.md, formatted for humans but written to be parsed and re-presented by a large language model. Point your own LLM at it to explain, check, or summarise. The raw markdown twin is at ARTIFACTS.md (and a .txt copy).