Apr 7, 20267 min read

Karpathy's 400,000-Word Obsidian Wiki Has Zero RAG Infrastructure

Andrej Karpathy posted a short tweet last week that made about half the personal RAG stacks on the internet look like massive overkill. No vector database. No embeddings. No retrieval chain. Just a folder of markdown files, Obsidian, and one schema file that Claude Code reads every session. He is running a 400,000-word personal research wiki on it.

This post is the complete walkthrough of the pattern, the exact schema file every other tutorial glosses over, and a downloadable Obsidian vault template you can use today.

The Core Insight

Most people's experience with LLMs and documents looks like standard RAG. You upload a collection of files, the model retrieves relevant chunks at query time, and generates an answer. This works, but the model is rediscovering knowledge from scratch on every question. There is no accumulation. Ask a subtle question twice, the model has to find and piece together the relevant fragments twice.

The wiki pattern inverts this. Instead of retrieving from raw sources at query time, the LLM incrementally builds and maintains a structured wiki that sits between you and the raw sources. When you add a new source, the model reads it, extracts the key information, and integrates it into the existing wiki. Updating entity pages, revising topic summaries, noting contradictions, strengthening the synthesis. The knowledge is compiled once and then kept current, not re-derived on every query.

The wiki is a persistent, compounding artifact. The cross-references are already there. The contradictions have already been flagged. The synthesis already reflects everything you have read. It gets richer with every source you add and every question you ask.

The Three Layers

Every version of this pattern has the same three layers. Get these right, nothing else matters.

Layer 1: raw/

This is your inbox. Articles, papers, transcripts, pasted notes, screenshots. The LLM reads from this folder and never writes to it. Everything in here is immutable. It is your source of truth.

Layer 2: wiki/

This is where the LLM lives. It writes summaries, entity pages, concept pages, topic indexes, and the master index. You read this layer in Obsidian. You do not write in it. Manual edits cause the system to drift session over session.

Layer 3: CLAUDE.md

One file at the vault root. The schema. This is what turns a generic Claude Code session into a disciplined librarian. Every session reads this file first. Every operation follows rules from this file. This is the piece every other tutorial skips.

Optional fourth layer: **output/** for query results, reports, slide decks, and generated artifacts. Good outputs get promoted into the wiki as new articles so your explorations compound.

The Four Operations

Four verbs run the whole system.

**Ingest.** Drop a source in raw/ and type compile in Claude Code. The model reads the file, decides which topic it belongs to, writes or updates a wiki article, adds backlinks, updates the topic index, updates the master index, and appends a log entry. One source typically touches 10 to 15 wiki files in a single pass. That is the bookkeeping humans abandon and the reason every second brain system eventually dies of neglect.

**Query.** Ask a question. The model reads the master index first, then the topic index, then 1 to 3 specific articles. Three to four file reads, no vectors, no embeddings. The index files are the retrieval layer, and the model maintains them for you automatically.

**Lint.** Periodic health check. The model reads every file in the wiki and produces a report covering contradictions, stale claims, orphan pages, missing cross-links, unsourced claims, and suggested new articles. No changes happen during the lint pass. You approve specific fixes one at a time.

**Log.** Every operation appends one line to wiki/log.md. Append-only, timestamped, parseable with grep. Gives the wiki a memory of what happened when.

The CLAUDE.md File (The Part Nobody Shows)

Every walkthrough online calls this file "the brain" and never opens it on camera. Here is the exact structure that makes the pattern survive across sessions.

The file is about 60 lines. It opens by telling the model who it is:

> You are the librarian of this vault. The wiki/ folder is your domain. You write and maintain every file in wiki/. The human rarely edits wiki files directly.

That sentence is load-bearing. Without it, Claude Code treats wiki files like any other files and starts deferring to whatever it sees there. With it, the model takes responsibility and rewrites pages confidently when new sources conflict with old ones.

The next section defines the four operations with numbered procedures. When you type compile, the model runs seven specific steps in order. When you ask a question, the model runs five specific steps. When you type lint, the model produces a seven-section report. These procedures are what makes the behavior consistent across sessions.

The conventions section enforces the rules that keep the wiki honest:

Every wiki article cites the raw source file it was compiled from.

Every article includes a Key Takeaways section.

File names use lowercase with hyphens.

Wikilinks are required for every cross-reference.

Bullets over paragraphs.

Never invent claims. Flag gaps in an Open Questions section.

The citation rule is the hallucination fix. Every article has to name the raw file it came from. If the model writes something the source does not say, the next lint pass catches it. That is how you keep a wiki honest at 200 articles.

The full file is in the downloadable vault template linked at the bottom of this post.

Live Query Walkthrough

Three query patterns work well against a wiki built this way.

**Direct lookup.** Ask a specific factual question about a single article. The model reads the master index, finds the topic, reads the article, answers. Three file reads.

**Cross-topic synthesis.** Ask a question that spans multiple topic folders. The model reads the master index, then multiple topic indexes, then 2 to 4 articles across topics, synthesizes. Six file reads for a complex question.

**File-back synthesis.** Ask a question and tell the model to file the answer back into the wiki. The model produces the answer as a new wiki article in the appropriate topic folder. Your exploration compounds into the knowledge base. Every future query benefits from this answer being present.

All three patterns run in seconds. None of them touch a vector database.

Vector RAG vs the Wiki Pattern

The question everyone asks is whether this replaces vector RAG. The honest answer is below 500 articles, the wiki wins on four out of five factors.

Factor	Karpathy Wiki	Vector RAG
Infrastructure	Folder of markdown files	Vector DB, embedding model, hosting
Setup time	15 minutes	Hours to days
Scale ceiling	~500 articles	Millions of chunks
Human browsable	Read and navigate freely	Black box
Outputs compound	Queries file back as new articles	Chat is ephemeral

Vector RAG wins on scale. Wiki wins on everything else. Below 500 sources, the wiki is strictly better for solo operators. Above that, hybrid makes sense: wiki for structured synthesis, vector for semantic fallback across long-tail retrieval.

Scale Ceiling and Hallucination

Two objections come up every time this pattern gets posted.

**Scale.** The wiki starts breaking down around 500 articles because the master index stops being a reliable navigation layer. The fix is either to split into multiple topic-specific vaults with a top-level router, or to bolt on a small BM25 or hybrid search tool over the markdown files. Karpathy mentions using a local search tool at larger scale. You probably will not hit this ceiling for a year.

**Hallucination drift.** If the model writes a wiki page that drifts from the source, the error propagates into every future query. The fix is in the schema file above. Every article cites its raw source. Every lint pass checks citations against sources. Flag anything unsourced, review it, correct it. This is not automatic safety. It is a maintenance loop the model runs for you.

The Free Template

Everything in this post is in a ready-to-use Obsidian vault template. Folder structure, the full CLAUDE.md with all four operations, three example pages (entity, concept, source summary) so the model has shape references, a starter master index, a starter log, and a README with 2-minute setup.

Download the template. Open the folder as a vault in Obsidian. Run Claude Code in the root. Drop a sample article in raw/. Type compile. You have a working self-maintaining wiki in under 90 seconds.

[Download the Karpathy Vault Template]

What Comes Next

The wiki pattern is the foundation. Once it works, the useful extensions start showing up:

**Multi-vault federation.** One vault per domain, a top-level CLAUDE.md that routes queries across them.

**Auto-ingest.** A cron job that watches a Gmail label, a Slack channel, or an RSS feed and drops new items into `raw/` automatically. The model processes on schedule.

**Agent handoff.** Separate agents with different roles all reading from the same wiki. Research agent ingests, executive assistant queries, content agent publishes.

**Voice note ingestion.** Record a voice memo, transcribe with Whisper, drop into `raw/`, the model files it as a dated journal entry cross-linked to relevant concepts.

All of these reuse the same vault template. The pattern compounds.

Key Takeaways

The wiki pattern compiles raw sources once into a persistent structured artifact instead of retrieving from them on every query.

Three layers: raw (immutable source of truth), wiki (LLM-owned synthesis), schema (CLAUDE.md configuration).

Four operations: ingest, query, lint, log.

The CLAUDE.md file is the load-bearing piece every tutorial glosses over. The full version is in the free template.

Below 500 sources the wiki beats vector RAG on almost every axis.

Download the template, open in Obsidian, run Claude Code, start compiling. Under 90 seconds from zero to working.

Video walkthrough: [YouTube link]

Download the template: [link]

Free community: https://www.skool.com/stride-ai-academy-7057

Transcript

Everything you're looking at right now

was built 100% with Claude Code and

Obsidian. Every wiki page, every

backlink, every summary, I simply

dropped in raw articles, raw transcripts

into a folder and walked away. When I

came back, I had this. Now, this entire

pattern didn't come from me. This came

from a Andre Karpathy tweet, which went

mega viral, and this whole entire setup

takes about 15 minutes, and you're going

to be getting access to this finished

vault template at the this video 100%

for free, no paywall or anything like

that. Now, you've probably saved

articles, podcasts, notes, whatever the

case is, you know, in Notion, PDFs,

podcast transcripts, and you meant to

review these, but you never got around

to them, and part of the reason of that

is is because none of it is actually

searchable. None of it's connected, it's

just scattered all over the place, and

none of it actually feeds into your

work. And every second brain system you

may have tried out there, whether it's

Notion, different systems and setups,

they typically fail for the same reason.

You stop maintaining it, and the reading

and thinking was never really the

problem, the bookkeeping was. Now, Andre

Karpathy just posted the fix on Twitter,

it went viral, he posted a gist as well,

which really dives deep, and all this

will be linked, of course, in the

description down below. I'm also going

to be giving you a free resource, it's a

12-page document that literally goes

over everything, goes over in-depth this

entire system from start to end, which

we're, of course, going to dive deep

into in today's video. I also will be

giving you this exact template, so you

can literally just copy and paste and

start using this by the end of this

video for your own knowledge system.

Now, before we dive in, just to give you

some quick context in case you don't

already know, who is Andre Karpathy?

Well, he was a part of the founding team

at OpenAI, he was the head of AI at

Tesla, built the entire autopilot vision

stack, he created one of the most

popular deep learning courses on the

internet. You may have seen his recent

GitHub project go viral, auto research.

And when this guy posts a workflow for

managing knowledge, or really anything

about AI that he posts, it's worth

paying attention to, and really everyone

just listens. So, currently he is

running a 400,000 word personal research

wiki with no vector database, no

embeddings, no retrieval chain, just

markdown files and Obsidian, and one

schema file that Claude Code reads every

single session. Now, in the next 10

minutes or so, I'm going to show you the

entire pattern so you have a deep

understanding of it, every line of the

schema file that nobody else wants to

explain. I'm going to show you a live

compile, a live query, a live lint, and

by the end of this video, you will have

a working system and a template that you

can run on your own stuff. Now, every

version of this pattern has the same

three layers. Get these right and

nothing else really matters. So, layer

one is raw. This essentially is your

personal inbox as a human. So, what goes

in here are articles, papers,

transcripts, screenshots, anything that

you dump in here that you want the LLM

to read, and that's where it's going to

read is from this raw folder. Now, keep

in mind the LLM is never going to write

into this folder. This is literally just

an inbox for you to dump things in, and

this is immutable. It is your source of

truth. Now, layer two right here is the

actual wiki, so this is where the LLM

files live. It writes summaries, it

writes entity pages, it writes concept

pages, it builds an index, it maintains

cross-links between every article. You

read this layer in Obsidian. Now, you as

the human do not write in this layer.

Not because you can't, but because every

edit that you make in here is one that

the model cannot predict in the next

session, and the whole system starts to

drift. And then at the top here for

layer three is the schema. This is one

claude.md at the vault root. And this is

what turns a generic Claude Code session

into a disciplined librarian. Every

session it reads this file first, every

operation follows rules from this file,

and this is the core piece of the

system. Now, if you want a deep dive

into claude.md files, make sure to check

out this video right here I did a few

days ago on claude.md files and how to

properly structure them, but of course,

like I mentioned, I'm going to be giving

you this free template, which includes a

structured Claude MD that you can use

out of box. So, raw, wiki, schema. That

is the whole architecture. Now, let's

dive into the actual layers and how this

all works. So, there's really four

operations, and we're going to start off

with the first one, which is ingest. So,

this is where you drop one source in the

raw folder, and you tell Claude Code to

then compile it. And in one pass, it can

touch 10 to 15 wiki pages. Here's what

it actually does though under the hood.

So, it reads the raw file in full, then

it runs checks in the master index to

see if the topic already exists. If yes,

it updates the existing article and adds

backlinks from any related pages. If no,

it creates a new topic folder with its

own index file. Either way, it updates

the master index and appends a line to

the log, and the wiki has just

compounded by one source. All right, so

let me show you how this actually works

in action. So, first things first,

you're going to want to download the

Obsidian web clipper. So, link to this

will be in the resource below, and this

whole document, as well as all the

different resources, templates, etc.

from this video and others is available

in our free Stride AI Academy. Now, once

you go ahead and actually download this,

you'll see the Obsidian icon right here

for the extension. We can go ahead and

click on it. You're going to want to

click on settings to open up the

Obsidian settings. You'll see right

here, I actually selected the name of my

specific vault. By default, it will save

it to the open vault. You can also

create new templates for how it's going

to save it, or use this default one

right here, and all you're going to want

to do is just change this note location.

It's going to be clippings by default,

but you can change this to raw, or

whatever you want to have your actual

intake folder to be. Once we have that

set up, we're going to find a blog or

whatever piece of information that you

want to actually ingest into the system.

For this, we're going to be using one of

Claude's blog posts right here. I'm just

going to go ahead and click on this, and

then I'm going to click add to Obsidian.

Next, it's going to ask me to open

Obsidian. And boom, here you can see we

have this entire article with different

properties, such as the title, the

source, the author, when it was

published, created, a description, any

specific tags. And you can see here, it

literally pulled in everything,

including the images. Now, by default,

it won't actually pull in the images,

but we're using a plugin right here

called local images plus, which by

default is actually installed in the

template that I'm providing you for

free. Actually, every single plugin that

I cover is actually installed in this

template by default. But if you already

have an Obsidian setup and you're

setting this up in there, or whatever

the case is, maybe you just need to

install this plugin yourself. You can

just go over to community plugins, make

sure you turn them on, and then you're

going to want to search over here,

browse, and you're going to want to

search for the specific plugins that we

cover. So, I have DataView installed

here. I also have local images plus, as

well as terminal. So, the terminal one

right here, you can actually use Claude

Code if you want in Obsidian if you

don't want to leave uh Obsidian

whatsoever. I personally usually prefer

using it in something like Cursor's

terminal or VS Code, and I have Obsidian

and my actual IDE open at the same time.

So, here you can see we have that same

Claude blog post that we just saved into

our Obsidian, and I'm going to say to

Claude, compile, and then I'm linking to

that specific uh blog post right here,

that markdown file, compile this one

into the wiki. That's all I'm going to

say, and Claude's just going to do its

magic. It's going to take maybe uh 30

seconds to a minute, depending on how

much you're compiling, and it's going to

actually go about the process. So, watch

what happens. It's going to read the

article, it's going to identify the core

topics, going to decide what needs its

own concept page. If the topic doesn't

exist yet, it's going to then write the

summary, it's going to write a key

takeaway section, it's going to add

inbound and outbound wiki links, and

then it's going to update the topic

index, and it's also going to update the

master index, and then it's going to log

the entry. All from one simple command,

which, of course, is powered by our

claude.md file. So, before Claude Code,

Obsidian was kind of like a big scary

tool for a lot of people because you

have to do all these different things,

backlinks. It was great for bookkeeping

your notes and knowledge, but it's

something that humans aren't really

going to do, and they're just going to

abandon, you know, 15 edits across eight

files from one source, you would never

personally do this manually, but Claude

Code does this for us in 30 seconds, and

it's easy as that. So, if you start

building this up, you do it 10 times,

you start to have a real knowledge base,

and if you do it maybe like 100 times or

a couple hundred times, you know, the

graph view is going to start looking

like actual research. And boom, it's

done. You can see we have a new topic,

which is agent design patterns with five

articles. So, we have three key

patterns, composable general tools,

progressive context management, prompt

caching strategies, and declarative tool

designs. You can see it also updated the

master index, the log.md, plus added

backlinks with a total of 11 files

touched. And like I said, the template

that I'm providing you for free with

this entire knowledge system comes with

a complete claude.md file, and this is

really what makes this system work. But

you can, of course, customize this if

you want to change the system for your

specific flow. You can see here, it

starts off saying, "You are the

librarian of this vault. The wiki

{slash} folder is your domain. You

write, you maintain every file in this

wiki. The human rarely edits wiki files

directly." Now, this basically is

defining the ownership. Without it,

Claude Code treats wiki files like any

other file and starts deferring to

whatever it sees there. But with it,

it's going to start taking

responsibility and write these pages

confidently with new source conflicts.

So, you can see here for ingest, read

the raw file in full, identify the core

topic or topics, check wiki master index

for matching topic folder. If the topic

exists, update or extend the relevant

articles and add backlinks from touched

pages. If the topic does not exist,

create a new folder under wiki with a

lowercase hyphenated name and create a

underscore index.md inside of it. Every

wiki article must include a top level

title, source which is path to raw md

file line, a two to four sentence intro

paragraph, a key takeaway section with

bullet points, a related section with

wiki links to three to eight related

pages, update the topic underscore

index.md,

update the wiki master index if a new

topic was created, append one line to

wiki.log.md

if the source spans multiple topics

create articles in both and cross link.

We have about 10 steps here and the

model's going to follow them in order

every single time because this file in

the cloud.md tells it what to do and

it's loaded in every single conversation

so you won't have to prompt it again.

Next we have the query section so this

is triggered when the human asks a

question so read wiki master index first

then read the matching topic index.md,

read one to three specific articles in

full, synthesize the answer with

citations, and then offer to file

substantial answers as new wiki

articles. So three to four file reads to

answer any question. No vectors, no

embeddings, no cosine similarity, BM25,

the index file is the retrieval here and

that's because the model maintains it

for you. All right, so next is the lint

which is the health check. So live is

the append only record and we're going

to run both the query and the lint live

in just a second and you can see here

this is going to be triggered simply by

just saying the word lint or audit the

wiki. It's basically going to read every

file in the wiki and produce a report

covering contradictions, stale claims,

orphan pages, missing concepts, missing

cross links, unsourced claims, and