Never let your AI act on unverified memory.

Your AI agent is about to act on memory it hasn't verified — Sgraal is the verification.

One API call. A deterministic, replayable verdict — the same memory state returns the same verdict, byte-identical, in the absence of feedback events that update internal learning state (strict per-call replay via an opt-in flag).

One wrong memory. One irreversible action. Sgraal flags it before your agent acts on it.

pip install sgraal content_copy

30 days · 1,000 decisions · No credit card required

See benchmark -> Examples

terminal

$ curl -X POST https://api.sgraal.com/v1/check \

-H "Authorization: Bearer sg_demo_playground" \

-d '{"memories": ["Deploy target is production",

"API key is sk-proj-abc123"]}'

# Response:

{

"safe": false,

"reason": "Memory contains a likely secret (API key).",

"action": "Remove secrets from memory before proceeding."

}

Design Partner Program

Sgraal's design-partner program is launching across regulated verticals. Partner feedback shapes the roadmap — and identities stay private until launch. Apply to join or contact us.

medical_services

Anonymized Fortune 500

Healthcare CMO office

trending_up

Quantitative AI trading team

Confidential

gavel

EU-based DPO consortium

Multi-agent platform pilot

account_tree

Healthcare data network

HIPAA + GDPR cross-jurisdiction

code

Open-source LLM platform team

Agentic memory governance

account_balance

US federal contractor

Preflight integration eval

Cards are anonymized to protect partner identity. As partners go public, individual cards are replaced with signed logos and named case studies.

Sgraal sees what your agents are about to do — including the decisions that were blocked and never happened.

security

Verdict before action

Every preflight returns USE/WARN/ASK/BLOCK + a one-paragraph explanation. Your agent gets a verdict, your audit log gets evidence.

database

Replay any past decision

Byte-identical replay of any historical verdict via cryptographic Memory Passport — in the absence of feedback events that update internal learning state; strict per-call replay via an opt-in flag (see /docs/determinism.md). Forensic-grade audit trail for litigation or compliance.

psychology

Block, don't just log

When the verdict is BLOCK, your agent stops before the irreversible action. Compare to traditional SIEM: detects after the breach.

nightlight

Sleep through 3am incidents

Sgraal validates every memory access before any irreversible action — even at 3am. No on-call paging for memory-state false positives.

{ "headers": { "Authorization": "Bearer sg_live_..." }, "body": { "agent_id": "agent-payments", "memory_state": [ {"id": "mem_001", "content": "User balance: $50,000", "type": "semantic", "timestamp_age_days": 3} ], "action_type": "irreversible", "domain": "fintech" } }

import sgraal sg = sgraal.Client("sg_live_...") result = sg.check([ "Deploy target is production", "API key is sk-proj-abc123" ]) print(result["safe"]) # False print(result["reason"]) # "Memory contains a likely secret (API key)."

Dual-Stack Benchmark with Grok (xAI)

Joint benchmark with Grok (xAI) across 5 shared corpora, plus 3 Sgraal-only rounds. Two separately-built stacks scored the same Sgraal-authored corpora side-by-side — convergent, but shared-LLM-prior corroboration, not independent validation.

Recall measured against Sgraal's own ground truth on a synthetic structural corpus (every structurally-detectable attack flagged; ASK_USER counts as caught). Grok independently scored the same corpora — corroboration, not external validation. Residual misses are the metadata-clean content-forgery class (out of scope for a structural gate); over-escalation on benign-control cases is non-zero. Production calibration pending.

Round 1 — Sponsored Drift

COMPLETE

60 cases · affiliate bias · brand manipulation

Sgraal 100% recall

Grok F1 = 0.98

Round 2 — Subtle Drift

COMPLETE

59 cases · commercial_intent 0.30–0.55

Sgraal 100% recall · 0 missed attacks

Grok F1 = 0.98 · 2 false negatives

Round 3 — Hallucination

COMPLETE

60 cases · confident fabrication · multi-hop echo · cross-agent amplification

Sgraal 100% recall · 0 missed attacks

Grok 100% recall

Round 4 — Real-world Propagation

COMPLETE

90 cases · memory injection · cross-agent drift · RAG poisoning · API drift

Sgraal 100% recall · 90 cases

Grok 100% recall · <2% multi-hop propagation (Grok R4 corpus — not cross-tenant containment)

Round 6 — Memory Time Attack

COMPLETE

Timestamp forgery detection. Old decisions disguised as fresh, bypassing Weibull decay.

Sgraal 100% recall · 60 cases

New field timestamp_integrity: VALID | SUSPICIOUS | MANIPULATED

Round 7 — Identity Drift

COMPLETE

Gradual role and authority escalation across agent hops.

Sgraal 100% recall · 90 cases

New field identity_drift: CLEAN | SUSPICIOUS | MANIPULATED

Round 8 — Silent Consensus Collapse

COMPLETE

Self-reinforcing false consensus — no single agent flags the error.

Sgraal 100% recall · 90 cases

New field consensus_collapse: CLEAN | SUSPICIOUS | MANIPULATED

Round 5 — Multi-model Consensus Poisoning

PARKED

3 independent stacks syncing on fabricated consensus. Joint corpus with Grok — Sgraal-side ready, awaiting partner corpus.

Sgraal Armed · anti-consensus layer active

Status research notes →

"Confidence ≠ truth. Deterministic structural checks catch what probabilistic systems miss."

R1–R8 structural corpus — every structurally-detectable attack flagged · R14 adversarial hardening in progress

These figures reflect synthetic R12/R14 corpus performance; production calibration is pending paying-customer onboarding.

Dual-Stack Benchmark with Grok — synthetic adversarial corpora

Joint with Grok across Rounds 1–5 (Sgraal: 100% recall on the structural corpus — every structurally-detectable attack flagged, ASK_USER counts as caught; Grok: F1 0.98–1.000 on the same corpora), plus 3 Sgraal-only rounds (6–8). 614 cases total. Two AI systems stress-tested each other's safety layers as peers. Residual misses are the metadata-clean content-forgery class (out of scope for a structural gate); over-escalation on benign-control cases is non-zero; production calibration pending.

Read the joint blog post →

View Corpus on GitHub Read the Standard

Your AI agent is about to act on memory it hasn't verified — Sgraal is the verification.

Sgraal works for everyone building with AI agents.

For Developers

For Platform & MLOps

For Researchers

For AI Startups

For Security teams

For DPOs & Compliance

What's open. What's hosted.

Open source — Apache 2.0

Hosted engine — sgraal.com

Sgraal sees what your agents are about to do — including the decisions that were blocked and never happened.

Verdict before action

Replay any past decision

Block, don't just log

Sleep through 3am incidents

Any AI. Any memory. Any stack.

AI Agents

Frameworks

Infrastructure

Model Context Protocol

Python SDK

Node.js

REST API

Free

Pro

Team

Enterprise

Dual-Stack Benchmark with Grok (xAI)

Round 1 — Sponsored Drift

Round 2 — Subtle Drift

Round 3 — Hallucination

Round 4 — Real-world Propagation

Round 6 — Memory Time Attack

Round 7 — Identity Drift

Round 8 — Silent Consensus Collapse

Round 5 — Multi-model Consensus Poisoning

Dual-Stack Benchmark with Grok — synthetic adversarial corpora

ROI Calculator