For AI Researchers

article

Beta · In Development

Whitepaper · arXiv submission target Q3 2026

Sgraal: A Multi-Disciplinary Mathematical Framework for AI Memory Governance with Formal Verification

We present Sgraal, a memory governance protocol for autonomous AI agents that integrates 25 mathematical disciplines (control theory, topological data analysis, formal methods, causal inference, optimal transport, Bayesian methods, stochastic processes, opinion algebra) into a 85-module scoring pipeline. The system returns a four-band decision (USE_MEMORY / WARN / ASK_USER / BLOCK) per memory state with full explainability. Decisions are formally verified via Z3 SMT for three healing policy properties (monotonicity, idempotency, bounded drift). We evaluate against an adversarial corpus of 614 cases across 11 attack rounds; the 60-case held-out R12 corpus serves as the public integrity benchmark (current 51/60 with 9 documented PARKED failures).

BibTeX (provisional)

@article{sgraal2026,
  title = {Sgraal: A Multi-Disciplinary Mathematical Framework
           for AI Memory Governance with Formal Verification},
  author = {Zsobrak, Peter and contributors},
  year = {2026},
  archivePrefix = {arXiv},
  primaryClass = {cs.AI},
  url = {https://sgraal.com/whitepaper}
}

Read whitepaper draft → Be an early reader →

The math

25 mathematical disciplines.
One scoring engine.

Each discipline contributes a module to the 85-module pipeline. Every value is real implementation, not stub or mock.

Lyapunov stability

control theory

Policy checks

logical consistency

Persistent homology

topological data analysis

Sinkhorn OT

optimal transport

Lévy α-stable

heavy-tail probability

DirectLiNGAM

causal discovery

HMM Baum-Welch

hidden Markov models

Cox proportional hazards

survival analysis

PCTL

probabilistic CTL

Subjective Logic

opinion algebra

Ollivier-Ricci curvature

discrete geometry

Mahalanobis distance

multivariate stats

Fréchet distance

trajectory comparison

BOCPD

Bayesian change-point

Sheaf cohomology

algebraic topology

Rate-distortion

information theory

Hopfield network

associative memory

Wilson-Hilferty χ²

outlier detection

Ornstein-Uhlenbeck

mean-reversion process

κ_MEM percolation

phase-transition threshold

Vietoris-Rips complex

simplicial homology

Drezner-Wesolowsky

bivariate CDF

Acklam quantile

normal quantile approx

Denman-Beavers

matrix square root

Sgraal Ω-norm

truncated quasi-norm

Full module listing at github.com/sgraal-ai/sdks · 85 modules · 11,000+ LOC of pure mathematics

Open corpus

R1–R3: 239 adversarial cases, fully open.

Run the benchmark against Sgraal, your own model, or a competitor. The R1–R3 corpus is Apache 2.0 on GitHub. Each case includes ground-truth label, attack vector taxonomy, and a reference solution path.

R1 60 sponsored drift cases · 100% recall

R2 60 subtle drift cases · 100% recall

R3 119 hallucination injection cases · F1 published per case-class

Browse corpus on GitHub → Benchmark page →

Quick start

git clone https://github.com/sgraal-ai/sdks
cd core/tests/corpus

# Run R1 against your own model
python3 run_round1.py --model your_model.py

# Compare against Sgraal baseline
python3 compare.py --baseline sgraal \
                   --candidate your_model
# Outputs F1, precision, recall per class

Methodology

How we build, in five sentences.

1. Five-AI adversarial consensus. Every non-trivial scoring patch is reviewed by five LLMs from separate vendors (Gemini, DeepSeek, Qwen, Grok, ChatGPT) as adversarial reviewers — convergent review under shared training priors, not independent verification. We adopt patches only when at least three of five identify the same failure mode and agree on the fix shape. The #739c BWDT (Belief-Weighted Drift Tolerance) patch is the canonical example.

2. Policy-rule consistency checks. Our healing-policy rules are checked for logical consistency (e.g. no rule both allows and blocks the same case; the healing counter is monotonic) — encoded as SMT-style constraints. These verify the policy invariants, not each live decision; in production they run as logical checks (the SMT solver is optional).

3. Transparency over claim. Scoring engine configuration is available to licensed customers via /v1/research/constants with a config checksum (SHA-256 of all calibration values) — your auditor can verify the model that ruled on your compliance was itself unchanged.

4. Published failures. We name all 9 R12 cases where the engine's verdict diverges from the corrected v2 label (CC-004/007/009/010/011/018, PA-002, PS-013/014) rather than hide them: one semantic-only false negative (CC-004 — outside the deterministic layer's scope, addressed by the planned semantic layer), three over-escalations where the engine was stricter than the label (two of them benign, logged as fix candidates), and five ±1-band judgment differences between label and engine. Published failures keep the benchmark honest — and the philosophy of "safety-bias narrative is the right framing" is itself an artifact of multi-AI consensus from 2026-04-20.

5. Apache 2.0 everything we can. SDK, Proxy, Edge mode, OpenAPI spec, R1–R3 corpus — all open. The hosted scoring engine and R6–R12 corpus stay commercial to fund the research. See /open-source for the full cut-line.

For academics

Sgraal is free for research.

school Academic tier

If you are at an academic institution (.edu, university lab, research institute, or a recognized non-profit AI safety org), you can use Sgraal hosted free of charge. Higher rate limits than the demo tier; same engine.

Request academic access →

format_quote How to cite Sgraal

For now, cite the website + repo. Once the whitepaper lands on arXiv, the BibTeX above will become the canonical citation. Reach out if you want pre-print access.

Working paper · arXiv:cs.AI · pending submission

github.com/sgraal-ai/sdks →

The mathematical foundations
of AI memory governance.

Sgraal: A Multi-Disciplinary Mathematical Framework for AI Memory Governance with Formal Verification

25 mathematical disciplines.
One scoring engine.

R1–R3: 239 adversarial cases, fully open.

Why R6–R12 stay private

How we build, in five sentences.

Sgraal is free for research.

school Academic tier

format_quote How to cite Sgraal

25 disciplines. 85 modules. Open.

The mathematical foundationsof AI memory governance.

Sgraal: A Multi-Disciplinary Mathematical Framework for AI Memory Governance with Formal Verification

25 mathematical disciplines.One scoring engine.

R1–R3: 239 adversarial cases, fully open.

Why R6–R12 stay private

How we build, in five sentences.

Sgraal is free for research.

school Academic tier

format_quote How to cite Sgraal

25 disciplines. 85 modules. Open.

The mathematical foundations
of AI memory governance.

25 mathematical disciplines.
One scoring engine.