How is ThinkingRoot different from a vector database?

A vector database stores and retrieves chunks. ThinkingRoot also learns from which memories get used, predicts what happens next, consolidates memory while idle (dream), and forks and merges like git — with provenance on every fact and verifiable deletion.

Is ThinkingRoot open source?

Yes. The ThinkingRoot engine is open source and self-hostable — run it in our cloud, your cloud, or your laptop. The managed cloud adds accounts, a secure gateway, and project provisioning.

How does ThinkingRoot avoid hallucinations?

It answers only from what it can cite. If no stored claim verifies the answer, it says it does not know instead of guessing. We call this 'verified or silent.'

What languages and tools does ThinkingRoot support?

A TypeScript and JavaScript SDK that runs in Node, Bun, Deno, edge, and the browser; a REST API; and a one-line MCP server that plugs into Claude, Cursor, and Codex.

Can I delete what an AI knows about a user?

Yes. Because ThinkingRoot stores facts in a graph, you can remove a specific fact, and every cached answer that depended on it is evicted too — something impossible to do cleanly with knowledge baked into model weights.

The cognition database

A complete for your AI.

ThinkingRoot gives developers, founders, and agent builders state-of-the-art AI infrastructure. Persistent memory, built-in agents, 50+ connectors, and durable functions that branch like git. Sub-200ms recall. Works with anything.

Start Building Talk to Founder

8 cognitive verbs/up to 85% fewer input tokens/~0.6s grounded answers/open-source engine

Introducing a new category

The world's first cognition database.

Not a vector store. Not a knowledge graph. A database that remembers, learns, predicts — and forks like git.

[ 01 // WHAT IT IS ]

A memory your AI can grow — and you can trust.

For everyone

Your AI finally remembers you.

It keeps what matters across every conversation, updates itself when the facts change, and shows you exactly where each answer came from. And anything it knows, you can see, correct, or delete— for real.

For developers

One API. Recall, answers, branches, functions.

Drop in a few lines and get grounded, cited answers — never confident fabrication. Fork and merge memory like git, and deploy functions that run inside the brain. Runs in Node, Bun, Deno, edge, or the browser.

[ 02 // THE EIGHT VERBS ]

Not a place to put data. A mind that does things with it.

Every other memory layer does two things: store and retrieve. ThinkingRoot does eight. These aren't a roadmap — all eight are live today.

remember

Write a fact once. It's embedded on write and recallable instantly.

recall

Semantic + graph search returns the right memory with citations.

learn

Recall gets better over time, ranked by what actually gets used.

sleep

Consolidates and compresses memory in the background.

dream

Quietly turns scattered memories into higher-level insights.

predict

Grounded “what happens next” — or it refuses if there's no evidence.

fork / merge

Branch, test, and merge memory like git — with a verify gate.

inherit

Pass a brain's knowledge forward to a new one, on demand.

[ 03 // HOW IT WORKS ]

Three steps. Five lines of code.

Ingest

Send text, docs, images, or transcripts. Anything your agent touches.

Compile

Extract facts, gate out junk, embed (768-dim), and build a knowledge graph.

Recall

Ask in plain language. Get a grounded, cited answer — or an honest “I don’t know.”

See it in code.

The real SDK — @thinkingroot/sdk, published on npm. Secure by default, runs anywhere.

STORE A MEMORY (TYPESCRIPT SDK)

200 OK

import { thinkingroot } from "@thinkingroot/sdk";

const tr = thinkingroot({
  gatewayUrl: "https://api.thinkingroot.com",
  projectKey: process.env.TR_PROJECT_KEY!, // tr_sk_...
});

// Per-user brain, isolated in its own namespace
const user = tr.scope("user-123");

await user.store([
  { statement: "Maya prefers SQLite over Postgres" },
]); // embedded on write → recallable immediately

{
  "stored": 1,
  "workspace": "u_user-123",
  "embedded": true,
  "recallable": "immediately"
}

[ 04 // BUILD ANYTHING ]

You write the simple part. We do the heavy lifting.

A chatbot that remembers, a research agent, a support copilot, or a whole AI platform — build it with a handful of calls. Everything hard about memory happens automatically, behind the API.

You write

store(...)Save what your users say or do.

ask(...)Get a grounded, cited answer.

forkBranch(...)Branch memory to safely try something.

forge(...)Teach it a brand-new skill on the fly.

ThinkingRoot handles, automatically

Extraction

Fact-quality gate

Embeddings

Knowledge graph

Hybrid retrieval

Reranking

Verification

Answer cache

Supersession

Branch & merge

[ 05 // COMPILED PROMPTS ]

Prompts that version — and cache.

Store a prompt once, version it, and assemble it with variables at request time. The compiled frame is byte-stable, so it caches across calls instead of re-sending the same tokens every time. Stop pasting giant strings into code.

Versioned — roll forward or back

Assembled with typed variables

Byte-stable frame → cache-friendly

COMPILED PROMPT

// 1. Put a versioned prompt
await brain.prompt.put("support-agent", {
  body: "You are {{company}}'s support agent. " +
        "Be concise. Cite memory.",
});

// 2. Assemble it with variables (server-side)
const frame = await brain.prompt.assemble(
  "support-agent",
  { company: "Acme" }
);

// 3. Reused as a byte-stable, cacheable frame
//    → repeated input tokens are not re-sent

[ 06 // ROOT FUNCTIONS & ROUTE ]

All the legos to build the perfect context for your agent.

Focused primitives for ingesting, understanding, routing, and retrieving context.

01Memory & Continual Learning

02Prompt Assembly

03Speculative Branching

04Tool Calling & MCP

05Deterministic Inference

06Dynamic Skill Acquisition

07Capability Routing

08Self-Extension (forge)

● ctx.memory.claimsactive_session

CLAIM ID: tr_c_92aCONFIDENCE: 98%

"Maya prefers SQLite over Postgres for client-side storage."

source: slack://channels/dev-chat/messages/1922312

CLAIM ID: tr_c_88bCONFIDENCE: 94%

"Acme Corp is planning to migrate their billing to Stripe in Q3."

source: email://inbox/messages/88c211a

01• CORE FUNCTION

ctx.memory (State)

Recall, remember, and manage memory state inside the run. Your agent gets direct access to historical interactions without external DB calls.

Embedding pipelineembed-on-write

text

768-dim vector

graph

rerank

768dimensions

GTEModernBERT

cross-encreranker

int8storage

[ 07 // EMBEDDINGS ]

Embeddings, handled. No vector index to babysit.

Every memory is embedded the moment it's written, so it's searchable immediately — then a cross-encoder re-ranks the top candidates for precision. You never provision, tune, or shard a vector store. It's just there.

[ 08 // RETRIEVAL ]

Verified, or silent.

ThinkingRoot answers only from what it actually knows — with citations — or it tells you it doesn't know. No confident lies. Behind every answer is a multi-signal retrieval pipeline: semantic + graph search, fusion, a cross-encoder reranker, then a verification gate.

Hybrid semantic + keyword recall

Multi-hop knowledge-graph traversal

Multi-signal fusion + tiered reranking

Verify citations — answer, or abstain

Query

“Which database does Maya use?”

Recall

12 candidate claims → fused → reranked

Verify

Top claim citation checks out ✓

Answer

“Maya uses SQLite.” [claim:8f2a]

If no claim verifies, the answer is “I don’t know” — not a guess.

[ 09 // MEMORY THAT STAYS TRUE ]

It changes its mind when the facts do.

Most systems pile up contradictions. ThinkingRoot tracks wheneach fact was true, supersedes the old one, and can forget on command — something you can never truly do with knowledge baked into model weights.

LAST MONTH

“Maya prefers Postgres.”

Stored as a fact, with its source and time.

LAST WEEK

“Actually, Maya switched to SQLite.”

The old fact is superseded — not deleted, dated.

TODAY

Ask now → “Maya uses SQLite.”

Current answer. Old one still auditable, or forget it entirely.

VERIFIABLE DELETE: REMOVE A FACT AND EVERY ANSWER THAT DEPENDED ON IT IS EVICTED TOO.

[ 10 // PERFORMANCE ]

Fast where it counts.

Real numbers, measured live on our production deployment. Idle brains are snapshotted and restored warm in sub-second. You don't pay for idle, and you don't wait for cold starts.

0s0.5s1.0s1.5s2.0s2.5s

Recall (read path)72ms

Warm grounded answer0.6s

Warm-restore from idle0.6s

After 5+ min idle2.1s

85%

Fewer input tokens

Compiled prompts + tool routing + compaction

40%

Fewer output tokens

Hydrated, cited answers

768

Dimensional embeddings

GTE-ModernBERT + rerank

Cognitive verbs live

Remember → inherit

Token reductions vary by workload — figures reflect the compiled-prompt, tool-routing and context-compaction stack on context-heavy agent loops, not a fixed per-call average.

[ 11 // HOW WE COMPARE ]

They store and retrieve. We do more.

Capability	Mem0	Zep	Supermemory
Persistent cross-session memory
Knowledge graph (facts + relations)	partial
Hybrid search + reranking
Updates over time (supersession)	partial
Provenance / citations on every claim	partial		partial
Verified-or-silent (refuses when ungrounded)
Fork / merge / rollback memory (git-for-memory)
Dream — offline consolidation into insights
Predict — grounded “what happens next”
Functions that run inside the memory
Self-hostable / open-source engine		partial

ThinkingRoot column verified in our own codebase. Competitor columns reflect each vendor's public documentation as of June 2026 and may change. Head-to-head accuracy is shown separately under Benchmarks (currently a preview — our scored evaluation is published there with full methodology once final).

[ 12 // BENCHMARKS ]Preview · illustrative

#1 on the benchmarks that matter.

A single number never tells the whole story — but ThinkingRoot is built to lead the evaluations the field actually uses for agent memory: LongMemEval, LoCoMo, and fact consolidation.

Benchmark (accuracy %)	BESTThinkingRoot	Mem0	Zep
LongMemEval-S	95.2	94.4	90.2
LoCoMo	95.0	92.5	94.7
FactConsolidation	92.0	71.2	84.1

Preview:the figures above are illustrative placeholders shown for layout. ThinkingRoot's scored evaluation is in progress and will be published here with full, reproducible methodology — we will not present a number as final until it is independently verifiable. Competitor figures are vendor-reported.

[ 13 // OUT OF THE BOX ]

The world's first compiled cognition database.

Most “AI memory” is a vector store you wire together yourself. ThinkingRoot compiles your knowledge into a complete cognition layer — memory, graph, retrieval, and reasoning — that works the moment you connect.

Chatbot that remembers

Cross-session memory of every user, grounded and cited.

Research agent

Ingest sources, recall facts, predict next steps.

Support copilot

Answers from your docs — verified or silent, never made up.

Personal AI / Twin

A per-user brain that learns and can be audited or deleted.

Knowledge base

Not just search — a graph that connects and consolidates.

Multi-agent crew

Agents that share one brain and branch their own work.

[ Out of the box ]

No vector index to manage.
No RAG pipeline to build.

No retrieval glue to maintain. Connect, store, ask. The cognition is already assembled.

Build something

[ 14 // THINKINGROOT PERSONAL ]Coming soon

Your central brain, on top of every AI tool you use.

ThinkingRoot Personal is a native desktop app — built on Tauri — that sits above every AI tool on your machine. One shared brain they all read from and write to. Chat with everything you know, and let your tools remember, reason, and act through a single memory.

100+ MCP toolsThe full engine toolset, exposed to every MCP client.

JIT skill acquisitionIt writes, deploys, and verifies new skills on the fly.

All 8 cognitive verbsremember · recall · learn · sleep · dream · predict · fork · inherit.

Chat with everythingAsk your whole brain in plain language — grounded & cited.

Central across your systemSits on top of every AI tool — one memory, shared.

Local-first & yoursYour data on your machine; see, correct, or delete anything.

$9/mo

Subscription · launching soonJoin the waitlist

ThinkingRoot Personal⌘K

What did I decide about pricing?

You set Pro at $29/mo and Team at $99/mo. [memory · 2 sources]

Connected: Claude · Cursor · Codex · any MCP client

[ 15 // WAYS IN ]

However you build, there's a door.

Connect your AI instantly. From a single line of SDK code to a drop-in widget, ThinkingRoot integrates effortlessly into your existing architecture.

SDK

thinkingroot({...}) for Node, Bun, Deno, edge & browser. On npm today.

@thinkingroot/sdk

MCP, one line

Plug your brain into Claude, Cursor, or Codex — 100+ tools over MCP.

@thinkingroot/mcp

Drop-in widget

An embeddable, domain-locked ask box. Paste one snippet on any site.

embed.js

Brain-as-Code

Pull and push a brain's prompts & functions like a git repo.

/brain folder

[ 16 // PRICING ]

Start free. Scale when you do.

One engine and database per project. You pay for storage and how warm your brains stay — and the full feature set is on every plan.

Free

$0/mo

For hobby projects & evaluation

0.5 GB storage / project
1 project · 1 seat
Auto-sleep engine (sub-second wake)
500 grounded asks / mo

Start free

Frequently asked questions.

ThinkingRoot is a cognition database — a living memory layer for AI agents. It stores facts as a knowledge graph, returns grounded answers with citations, and lets you branch, merge, and audit memory like code.

Still something on your mind?

Read the docs Talk to founder

[ 18 // ISOLATION & TRUST ]

Isolation by architecture. Trust by provenance.

One brain per project

Every project gets its own engine and its own database — isolation at the container boundary, not a shared-table row filter.

Provenance on everything

Each fact carries where it came from. Each answer carries its citations. Nothing is asserted without a source.

Run it anywhere

The engine is open source. Run it in our cloud, your cloud, or your laptop. Your data, your perimeter.

Honest status: ThinkingRoot is pre-launch. We are not yet SOC 2 or GDPR certified, and we don't publish an availability SLA. What we do guarantee today is architectural isolation, full provenance, and an open-source engine you can self-host and inspect.

Give your AI a memory worth trusting.

Start free with the open-source engine, or build on the managed cloud.

Start Building Talk to Founder

A complete for your AI.

The world's first cognition database.

A memory your AI can grow — and you can trust.

Your AI finally remembers you.

One API. Recall, answers, branches, functions.

Not a place to put data. A mind that does things with it.

remember

recall

learn

sleep

dream

predict

fork / merge

inherit

Three steps. Five lines of code.

Ingest

Compile

Recall

See it in code.

You write the simple part. We do the heavy lifting.

Prompts that version — and cache.

All the legos to build the perfect context for your agent.

ctx.memory (State)

Embeddings, handled. No vector index to babysit.

Verified, or silent.

It changes its mind when the facts do.

Fast where it counts.

They store and retrieve. We do more.

#1 on the benchmarks that matter.

The world's first compiled cognition database.

Chatbot that remembers

Research agent

Support copilot

Personal AI / Twin

Knowledge base

Multi-agent crew

No vector index to manage. No RAG pipeline to build.

Your central brain, on top of every AI tool you use.

However you build, there's a door.

SDK

MCP, one line

Drop-in widget

Brain-as-Code

Start free. Scale when you do.

Frequently asked questions.

Isolation by architecture. Trust by provenance.

One brain per project

Provenance on everything

Run it anywhere

Give your AI a memory worth trusting.

No vector index to manage.
No RAG pipeline to build.