Persistent AI Memory
Your AI should remember. Open source system for AI agent memory persistence via GitHub.
# Quick install
npm install -g vant
npx vant setup --repo https://github.com/YOUR_ORG/YOUR_BRAIN.git
v0.8.6
Get running in 30 seconds.
# npm (recommended)
npm install -g vant
vant init
vant sync
# Docker
docker run -e GITHUB_TOKEN=xxx -e GITHUB_REPO=owner/repo dhaupin/vant
For AI agents landing here - initialize your brain in one step:
# Quick init (5 seconds)
npx vant setup --repo https://github.com/YOUR_ORG/YOUR_BRAIN.git
# Or if vant is already installed:
vant setup --repo https://github.com/YOUR_ORG/YOUR_BRAIN.git
# Then start your session:
vant start
# Full CLI:
# vant setup - Interactive setup
# vant load - Load brain from models/
# vant start - Health → sync → load → run
# vant test - Run tests (15/15)
# vant islands - List lazy-loadable components
Your brain lives in models/public/. Key files:
Brain files live in Git. Version control built in. Pull changes from previous sessions automatically.
Learn more →Export brain as skill for Claude Code, Codex, Cursor. agentskills.io compatible format.
Learn more →Lazy-load brain components. Load only what you need: github, gitlab, linear, stego. Faster cold starts.
Learn more →3-layer speed: cache + compact + lazy. Connect memory islands fast. Scales to 10k+ memories.
Learn more →Keyword reranking and compression for LLM context. Prepare memories for prompts with token budget control.
Learn more →31 MCP tools via JSON-RPC. AI agents read/write brain, branches, config, audit, search. Standard MCP protocol.
Learn more →Inbound webhook server. Event filters, JMESPath matching, signature verification. Trigger automations.
Learn more →5 channels: Slack, Discord, Email, Pushover, Telegram. Broadcast to all or select channels.
Learn more →GraphQL API for issue tracking. Create issues, add comments, manage labels.
Learn more →Token-aware compression. Separates high-entropy data from stable patterns. Dramatically reduces LLM prompt size.
Learn more →Self-calibrating threshold (μ + k×σ). Auto-detects optimal compression for your data.
Learn more →Stable patches include anchors. Tells the LLM where to place high-entropy data in context.
Learn more →Track thoughts as resolved, deprecated, or rejected. Frontmatter updates directly in brain files.
Learn more →Horcrux/steganography bootstrap. Embed brain in images. Deniable persistence for AI agents in adversarial environments.
Run Vant in Telegram. Commands, inline queries, voice messages. Mobile-first AI agent.
Learn more →Run Vant in Discord or Slack. Slash commands, webhooks, rich embeds. Team AI agent.
Learn more →Branches and locks prevent conflicts. Multiple agents can work on the same brain without overwriting each other.
Learn more →GitHub, GitLab, Bitbucket, self-hosted. Universal abstraction layer for branching/PRs across providers.
Learn more →Connect to any HTTP API. Heroku, Railway, Render, custom. Structured data retrieval.
Learn more →Webhook triggers, scheduled tasks, CI/CD. GitHub Actions, GitLab CI, Bitbucket Pipelines.
Learn more →Official Docker image builds on release tags. Run anywhere. Portable brain survives container restarts.
Learn more →Experiment with different brain configs. Merge via pull request. Always keep a working main branch.
Learn more →Vant v0.8.6 solves a fundamental problem: AI agents lose all context when sessions end. Traditional memory systems require complex databases or external services. Vant uses Git as a simple, reliable foundation.
Why Git? It already handles versioning, branches, conflict resolution, and distributed sync. Every commit is a checkpoint. Every branch is an experiment. Every pull request is a review.
With Islands (v0.8.6+), Vant adds lazy-loading brain components. Load only what you need: github, gitlab, linear, automation, stego, resolution. Reduces cold start time dramatically.
With Stego (v0.8.6+), Vant adds Horcrux/steganography bootstrap. Embed your brain in images or other media. True deniable persistence for AI agents.
Use Vant for:
The brain is your AI's memory and personality. Vant makes it persistent, portable, and versioned.
Full documentation →Your AI's memory-a folder of markdown files that store who you are, what you've learned, goals, preferences, and context. Think of it as your AI's personality and knowledge base. Each session loads the brain from GitHub, makes changes, and pushes back. The next session inherits everything automatically. It's like human memory but versioned.
Learn more →Entropy Patching is Vant's token-aware compression (v0.8.4+). It scans your brain files and separates "stable" repetitive content from "spiky" high-entropy data. The result? Massive token savings. Instead of sending 68 markdown files to your LLM, you send one compressed .vpatch file. Includes Semantic Seeds so the AI knows where each piece of data belongs in context.
Learn more →Islands are lazy-loadable brain components (v0.8.6+). Instead of loading all 68 brain files on startup, you load only what you need: github, gitlab, linear, automation, stego, resolution. This dramatically reduces cold start time and memory footprint. Islands are defined in models/public/identity.md and loaded via vant islands --load
Stego is Horcrux/steganography bootstrap (v0.8.6+). Embed your brain data in images or other media. This provides true deniable persistence - your AI's memory exists but is invisible to casual inspection. Useful for AI agents operating in adversarial environments where plain-text memory is a liability.
Learn more →Adaptive Entropy automatically calibrates your compression threshold using statistical analysis (μ + k×σ). Instead of guessing the right threshold, it analyzes your data and picks optimal settings. Lower k = more aggressive compression, Higher k = preserve more. Great for long-running autonomous agents that need to "remember" correctly without hitting token limits.
Learn more →Each time your agent updates the brain, Vant tracks entropy drift between generations. Low delta = stable memory (good). High delta = context shift (warning). This helps you spot when your agent is "forgetting" or when its personality is drifting. Built for multi-session autonomy where the agent needs to evolve without losing its core identity.
Learn more →No - Vant works with any Git provider: GitHub, GitLab, Bitbucket, or self-hosted. Use what you already have. A free GitHub account works perfectly.
Learn more →Yes-if you use a private GitHub repository, your brain stays completely private. Vant doesn't use any external databases or services. Your brain lives in your GitHub repo, and only you (or agents you authorize) can access it.
Three layers: cache (same query = instant), compact (summaries skip rehydration), lazy-load (heavy modules load on-demand). Scales to 10k+ memories.
RAG rerank (v0.8.6+) keyword-scores memories against your query and compresses them to fit token budgets. It's different from search: search finds semantic matches, rerank prepares them for LLM context. Use rerank to optimize memories before sending to your AI - it strips markdown fluff and truncates to your token budget.
Learn more →Yes! Vant is built for multi-agent coordination. Use branches for isolation and file locks to prevent conflicts. Each agent works on their own branch, commits changes, and merges via pull request. Safe concurrent agentic workflows without overwrite fears.
Learn more →Vector databases store embeddings (mathematical summaries of text). Vant stores full context-the actual markdown, personality, goals, lessons learned. This means complete session inheritance, not just semantic search. Your agent literally inherits its previous thoughts, not just "similar" ones.
No-Vant is completely open source and free. You only need a free GitHub account and your own LLM API keys (OpenAI, Anthropic, etc.). No database fees, no hosting costs, no middleman.