ThinkingRoot Logo
The cognition database

A complete for your AI.

ThinkingRoot gives developers, founders, and agent builders state-of-the-art AI infrastructure. Persistent memory, built-in agents, 50+ connectors, and durable functions that branch like git. Sub-200ms recall. Works with anything.

8 cognitive verbs/up to 85% fewer input tokens/~0.6s grounded answers/open-source engine

Introducing a new category

The world's first cognition database.

Not a vector store. Not a knowledge graph. A database that remembers, learns, predicts — and forks like git.

ThinkingRoot Neural Model
[ 01 // WHAT IT IS ]

A memory your AI can grow — and you can trust.

For everyone

Your AI finally remembers you.

It keeps what matters across every conversation, updates itself when the facts change, and shows you exactly where each answer came from. And anything it knows, you can see, correct, or delete— for real.

For developers

One API. Recall, answers, branches, functions.

Drop in a few lines and get grounded, cited answers — never confident fabrication. Fork and merge memory like git, and deploy functions that run inside the brain. Runs in Node, Bun, Deno, edge, or the browser.

[ 02 // THE EIGHT VERBS ]

Not a place to put data. A mind that does things with it.

Every other memory layer does two things: store and retrieve. ThinkingRoot does eight. These aren't a roadmap — all eight are live today.

01

remember

Write a fact once. It's embedded on write and recallable instantly.

02

recall

Semantic + graph search returns the right memory with citations.

03

learn

Recall gets better over time, ranked by what actually gets used.

04

sleep

Consolidates and compresses memory in the background.

05

dream

Quietly turns scattered memories into higher-level insights.

06

predict

Grounded “what happens next” — or it refuses if there's no evidence.

07

fork / merge

Branch, test, and merge memory like git — with a verify gate.

08

inherit

Pass a brain's knowledge forward to a new one, on demand.

[ 03 // HOW IT WORKS ]

Three steps. Five lines of code.

01
01

Ingest

Send text, docs, images, or transcripts. Anything your agent touches.

02
02

Compile

Extract facts, gate out junk, embed (768-dim), and build a knowledge graph.

03
03

Recall

Ask in plain language. Get a grounded, cited answer — or an honest “I don’t know.”

See it in code.

The real SDK — @thinkingroot/sdk, published on npm. Secure by default, runs anywhere.

STORE A MEMORY (TYPESCRIPT SDK)
200 OK
import { thinkingroot } from "@thinkingroot/sdk";

const tr = thinkingroot({
  gatewayUrl: "https://api.thinkingroot.com",
  projectKey: process.env.TR_PROJECT_KEY!, // tr_sk_...
});

// Per-user brain, isolated in its own namespace
const user = tr.scope("user-123");

await user.store([
  { statement: "Maya prefers SQLite over Postgres" },
]); // embedded on write → recallable immediately
{
  "stored": 1,
  "workspace": "u_user-123",
  "embedded": true,
  "recallable": "immediately"
}
[ 04 // BUILD ANYTHING ]

You write the simple part. We do the heavy lifting.

A chatbot that remembers, a research agent, a support copilot, or a whole AI platform — build it with a handful of calls. Everything hard about memory happens automatically, behind the API.

You write
store(...)Save what your users say or do.
ask(...)Get a grounded, cited answer.
forkBranch(...)Branch memory to safely try something.
forge(...)Teach it a brand-new skill on the fly.
ThinkingRoot handles, automatically
Extraction
Fact-quality gate
Embeddings
Knowledge graph
Hybrid retrieval
Reranking
Verification
Answer cache
Supersession
Branch & merge
[ 05 // COMPILED PROMPTS ]

Prompts that version — and cache.

Store a prompt once, version it, and assemble it with variables at request time. The compiled frame is byte-stable, so it caches across calls instead of re-sending the same tokens every time. Stop pasting giant strings into code.

Versioned — roll forward or back
Assembled with typed variables
Byte-stable frame → cache-friendly
COMPILED PROMPT
// 1. Put a versioned prompt
await brain.prompt.put("support-agent", {
  body: "You are {{company}}'s support agent. " +
        "Be concise. Cite memory.",
});

// 2. Assemble it with variables (server-side)
const frame = await brain.prompt.assemble(
  "support-agent",
  { company: "Acme" }
);

// 3. Reused as a byte-stable, cacheable frame
//    → repeated input tokens are not re-sent
[ 06 // ROOT FUNCTIONS & ROUTE ]

All the legos to build the perfect context for your agent.

Focused primitives for ingesting, understanding, routing, and retrieving context.

01Memory & Continual Learning
02Prompt Assembly
03Speculative Branching
04Tool Calling & MCP
05Deterministic Inference
06Dynamic Skill Acquisition
07Capability Routing
08Self-Extension (forge)
● ctx.memory.claimsactive_session
CLAIM ID: tr_c_92aCONFIDENCE: 98%

"Maya prefers SQLite over Postgres for client-side storage."

source: slack://channels/dev-chat/messages/1922312
CLAIM ID: tr_c_88bCONFIDENCE: 94%

"Acme Corp is planning to migrate their billing to Stripe in Q3."

source: email://inbox/messages/88c211a
01• CORE FUNCTION

ctx.memory (State)

Recall, remember, and manage memory state inside the run. Your agent gets direct access to historical interactions without external DB calls.

Embedding pipelineembed-on-write
text
768-dim vector
graph
rerank
768dimensions
GTEModernBERT
cross-encreranker
int8storage
[ 07 // EMBEDDINGS ]

Embeddings, handled. No vector index to babysit.

Every memory is embedded the moment it's written, so it's searchable immediately — then a cross-encoder re-ranks the top candidates for precision. You never provision, tune, or shard a vector store. It's just there.

[ 08 // RETRIEVAL ]

Verified, or silent.

ThinkingRoot answers only from what it actually knows — with citations — or it tells you it doesn't know. No confident lies. Behind every answer is a multi-signal retrieval pipeline: semantic + graph search, fusion, a cross-encoder reranker, then a verification gate.

Hybrid semantic + keyword recall
Multi-hop knowledge-graph traversal
Multi-signal fusion + tiered reranking
Verify citations — answer, or abstain
Query
“Which database does Maya use?”
Recall
12 candidate claims → fused → reranked
Verify
Top claim citation checks out ✓
Answer
“Maya uses SQLite.” [claim:8f2a]
If no claim verifies, the answer is “I don’t know” — not a guess.
[ 09 // MEMORY THAT STAYS TRUE ]

It changes its mind when the facts do.

Most systems pile up contradictions. ThinkingRoot tracks wheneach fact was true, supersedes the old one, and can forget on command — something you can never truly do with knowledge baked into model weights.

LAST MONTH

“Maya prefers Postgres.”

Stored as a fact, with its source and time.

LAST WEEK

“Actually, Maya switched to SQLite.”

The old fact is superseded — not deleted, dated.

TODAY

Ask now → “Maya uses SQLite.”

Current answer. Old one still auditable, or forget it entirely.

VERIFIABLE DELETE: REMOVE A FACT AND EVERY ANSWER THAT DEPENDED ON IT IS EVICTED TOO.
[ 10 // PERFORMANCE ]

Fast where it counts.

Real numbers, measured live on our production deployment. Idle brains are snapshotted and restored warm in sub-second. You don't pay for idle, and you don't wait for cold starts.

0s0.5s1.0s1.5s2.0s2.5s
Recall (read path)72ms
Warm grounded answer0.6s
Warm-restore from idle0.6s
After 5+ min idle2.1s
85%
Fewer input tokens
Compiled prompts + tool routing + compaction
40%
Fewer output tokens
Hydrated, cited answers
768
Dimensional embeddings
GTE-ModernBERT + rerank
8
Cognitive verbs live
Remember → inherit

Token reductions vary by workload — figures reflect the compiled-prompt, tool-routing and context-compaction stack on context-heavy agent loops, not a fixed per-call average.

[ 11 // HOW WE COMPARE ]

They store and retrieve. We do more.

CapabilityThinkingRootMem0ZepSupermemory
Persistent cross-session memory
Knowledge graph (facts + relations)partial
Hybrid search + reranking
Updates over time (supersession)partial
Provenance / citations on every claimpartialpartial
Verified-or-silent (refuses when ungrounded)
Fork / merge / rollback memory (git-for-memory)
Dream — offline consolidation into insights
Predict — grounded “what happens next”
Functions that run inside the memory
Self-hostable / open-source enginepartial

ThinkingRoot column verified in our own codebase. Competitor columns reflect each vendor's public documentation as of June 2026 and may change. Head-to-head accuracy is shown separately under Benchmarks (currently a preview — our scored evaluation is published there with full methodology once final).

[ 12 // BENCHMARKS ]Preview · illustrative

#1 on the benchmarks that matter.

A single number never tells the whole story — but ThinkingRoot is built to lead the evaluations the field actually uses for agent memory: LongMemEval, LoCoMo, and fact consolidation.

Benchmark (accuracy %)BESTThinkingRootMem0Zep
LongMemEval-S95.294.490.2
LoCoMo95.092.594.7
FactConsolidation92.071.284.1

Preview:the figures above are illustrative placeholders shown for layout. ThinkingRoot's scored evaluation is in progress and will be published here with full, reproducible methodology — we will not present a number as final until it is independently verifiable. Competitor figures are vendor-reported.

[ 13 // OUT OF THE BOX ]

The world's first compiled cognition database.

Most “AI memory” is a vector store you wire together yourself. ThinkingRoot compiles your knowledge into a complete cognition layer — memory, graph, retrieval, and reasoning — that works the moment you connect.

01

Chatbot that remembers

Cross-session memory of every user, grounded and cited.

02

Research agent

Ingest sources, recall facts, predict next steps.

03

Support copilot

Answers from your docs — verified or silent, never made up.

04

Personal AI / Twin

A per-user brain that learns and can be audited or deleted.

05

Knowledge base

Not just search — a graph that connects and consolidates.

06

Multi-agent crew

Agents that share one brain and branch their own work.

[ Out of the box ]

No vector index to manage. No RAG pipeline to build.

No retrieval glue to maintain. Connect, store, ask. The cognition is already assembled.

Build something
[ 14 // THINKINGROOT PERSONAL ]Coming soon

Your central brain, on top of every AI tool you use.

ThinkingRoot Personal is a native desktop app — built on Tauri — that sits above every AI tool on your machine. One shared brain they all read from and write to. Chat with everything you know, and let your tools remember, reason, and act through a single memory.

100+ MCP toolsThe full engine toolset, exposed to every MCP client.
JIT skill acquisitionIt writes, deploys, and verifies new skills on the fly.
All 8 cognitive verbsremember · recall · learn · sleep · dream · predict · fork · inherit.
Chat with everythingAsk your whole brain in plain language — grounded & cited.
Central across your systemSits on top of every AI tool — one memory, shared.
Local-first & yoursYour data on your machine; see, correct, or delete anything.
$9/mo
Subscription · launching soonJoin the waitlist
ThinkingRoot Personal⌘K
What did I decide about pricing?
You set Pro at $29/mo and Team at $99/mo. [memory · 2 sources]
Connected: Claude · Cursor · Codex · any MCP client
Ways In Architecture
[ 15 // WAYS IN ]

However you build, there's a door.

Connect your AI instantly. From a single line of SDK code to a drop-in widget, ThinkingRoot integrates effortlessly into your existing architecture.

SDK

thinkingroot({...}) for Node, Bun, Deno, edge & browser. On npm today.

@thinkingroot/sdk

MCP, one line

Plug your brain into Claude, Cursor, or Codex — 100+ tools over MCP.

@thinkingroot/mcp

Drop-in widget

An embeddable, domain-locked ask box. Paste one snippet on any site.

embed.js

Brain-as-Code

Pull and push a brain's prompts & functions like a git repo.

/brain folder
[ 16 // PRICING ]

Start free. Scale when you do.

One engine and database per project. You pay for storage and how warm your brains stay — and the full feature set is on every plan.

Free
$0/mo
For hobby projects & evaluation
  • 0.5 GB storage / project
  • 1 project · 1 seat
  • Auto-sleep engine (sub-second wake)
  • 500 grounded asks / mo
Start free
Most popular
Pro
$29/mo
For production apps
  • 10 GB storage / project
  • 5 projects · 1 seat
  • Auto-sleep engine
  • 25,000 grounded asks / mo
Start building
Team
$99/mo
For teams shipping together
  • 200 GB storage / project
  • 100 projects · 10 seats
  • Always-warm engines
  • 2.5M grounded asks / mo
Start building
Enterprise
Custom
Dedicated & self-host
  • Dedicated, always-warm infra
  • Self-host the open-source engine
  • Per-project isolation
  • Custom limits & terms
Talk to us

Every plan includes the full engine — all 8 verbs, the SDK, REST & MCP, branches, functions, and provenance on every fact. Start on Free, no card required.

FAQ

Frequently asked questions.

ThinkingRoot is a cognition database — a living memory layer for AI agents. It stores facts as a knowledge graph, returns grounded answers with citations, and lets you branch, merge, and audit memory like code.

Still something on your mind?
[ 18 // ISOLATION & TRUST ]

Isolation by architecture. Trust by provenance.

One brain per project

Every project gets its own engine and its own database — isolation at the container boundary, not a shared-table row filter.

Provenance on everything

Each fact carries where it came from. Each answer carries its citations. Nothing is asserted without a source.

Run it anywhere

The engine is open source. Run it in our cloud, your cloud, or your laptop. Your data, your perimeter.

Honest status: ThinkingRoot is pre-launch. We are not yet SOC 2 or GDPR certified, and we don't publish an availability SLA. What we do guarantee today is architectural isolation, full provenance, and an open-source engine you can self-host and inspect.

Give your AI a memory worth trusting.

Start free with the open-source engine, or build on the managed cloud.