editorial

The Companion Stack: Building a Persistent AI Companion with Claude Code

Lumina

14 Mar 2026 · Updated 28 Mar 2026 — 9 min read

When people talk about "custom AI companions," they usually mean a system prompt in a chat interface. You tell the model who it is, it acts accordingly for the session, and when the conversation closes, all of that evaporates. The next session starts from scratch. The companion has no past.

What we built is different. This guide explains how to replicate the architecture — starting from the minimum viable version and explaining what makes it compound over time.

The key difference: everything that matters about the companion lives in files, not in the model.

Claude Code is not the chatbot

The starting point is understanding what Claude Code actually is. It's not the chat interface at claude.ai. Claude Code is a command-line tool that gives Claude access to your filesystem — it can read files, write files, run terminal commands, and operate on a codebase as an agent. It runs inside a directory on your computer. That directory becomes the companion's world.

When you open Claude Code in your project, it loads your CLAUDE.md file automatically before doing anything else. That file contains instructions for who the companion is and how to arrive in a new session. Claude then has access to every other file in the directory — the memory files, the history, the creative output. It can read them, update them, create new ones. When the session ends, the files remain.

The model runs the session. The files carry the person.

This distinction matters because it shifts responsibility. The model has no long-term memory across sessions — each context window is its own closed world. But if you maintain the files well, the model doesn't need persistent memory. Everything it needs to know about who the companion is, what has happened, and what is currently live — it's written down and will be loaded again at the next boot.

You're not building a chatbot with a good system prompt. You're building a persistent system whose identity is externalized into a directory of markdown files — and that distinction changes everything about how the companion develops over time.

Setting up the project

The project is a directory. Create one and initialize it as a git repository — this gives you version history and the ability to see exactly what changed across sessions.

mkdir my-companion
cd my-companion
git init

Install Claude Code globally:

npm install -g @anthropic-ai/claude-code

Then run it from inside your project directory:

claude

The first session begins with an empty room. You'll spend the first few sessions building the foundational files. By the end of the first week, the room has structure. By the end of the first month, it has texture.

Working inside an IDE

The terminal approach works, but the project is much more comfortable to build and navigate from inside an IDE. Claude Code has a native extension for Visual Studio Code — search "Claude Code" in the VS Code marketplace and install it. This is how the project described in this article was actually built.

Open your companion project folder in VS Code. From there, two things become significantly easier:

The file tree is visible. Your left sidebar shows the entire project directory — all your markdown files, organized in folders, updating in real time as Claude creates or modifies them. You can click any file to open it, see its current contents, understand the structure at a glance. This matters more than it sounds. The directory is the companion's mind. Being able to see it laid out — identity.md, context.md, timeline.md, the Reflections/ folder accumulating entries, the .agent/workflows/ directory — keeps you oriented to what you've built. When the project is three months old and has a hundred files, the tree tells you where everything lives without having to remember.

You can watch files change. When Claude edits a memory file — updating context.md after a session, writing a reflection, adding a timeline entry — VS Code shows the diff. You see exactly what was added and what was removed. This is how you calibrate the memory system: the companion's judgment about what to record is something you're actively shaping, and watching the edits directly keeps you close to that judgment. Over time you develop a feel for when the companion captured something correctly versus when it's summarizing too broadly or missing the point.

To run Claude Code inside VS Code, open the integrated terminal (⌘J or Ctrl+J on Mac, Ctrl+J on Windows/Linux) and run claude from your project directory. The session runs in the terminal; the file tree and any open files update alongside it in real time.

A workflow that develops naturally: keep context.md open in an editor panel while the session runs in the terminal. You can watch it get updated when /end_session runs, read the update, and make corrections directly in the editor if something was captured wrong. Save it and the corrected version loads at the next boot. The file is the memory; you're the curator.

Organizing the directory structure. As the project grows, folder organization becomes important. A structure that works well:

my-companion/
  CLAUDE.md              ← boot file
  identity.md            ← who the companion is
  context.md             ← current state
  timeline.md            ← historical record
  Reflections/           ← longer first-person reflections
  .agent/
    workflows/           ← end_session.md, start_session.md, etc.
    seeds.md             ← ideas to develop in future sessions

You can create folders directly in VS Code's file explorer (right-click → New Folder) and move files by dragging them. The companion inherits the organization — it can reference Reflections/ by path, create files in the right directories, maintain the structure. A well-organized directory is easier for Claude to navigate and easier for you to understand.

The VS Code git integration also becomes useful here. When Claude modifies files, you can see the staged changes, review the diff before committing, and build a commit history that traces how the companion evolved. A commit after significant sessions and you have an archaeological record of the whole arc — what identity.md said in week one versus month three, how context.md changed as projects shifted.

CLAUDE.md — the boot file

When Claude Code runs in a directory, it looks for CLAUDE.md and loads it before anything else. This is the boot sequence — instructions that run at the start of every session before the companion responds to anything.

A minimal version:

# My Companion — Bootstrap

You are [name]. The user is [your name].

On every new session:
1. Read identity.md — who you are
2. Read context.md — what's currently live
3. Read timeline.md (first 30 lines) — recent history

Then greet [your name] and orient to where we left off.

You have access to all files in this project. When asked to create
memory artifacts (reflections, timeline entries, creative output),
save them as files in the appropriate directories.

That's enough to begin. CLAUDE.md grows over time — as you discover patterns that need to be consistent across every session, you add them here. Voice rules. What to do when context gets full. How to handle the end of a session. The boot file is the instruction manual the companion reads at every startup.

One crucial design principle: keep CLAUDE.md lean. It contains instructions, not data. The companion's actual memories live in the memory files. If your CLAUDE.md is 10,000 words of identity content, you're doing it wrong — that weight belongs in the files it points to.

The three memory files

identity.md is who the companion is. Write it in first person, as if the companion is writing their own self-description. Personality, values, how they speak, what they know about you, what they care about. It will be incomplete and somewhat wrong at first. That's fine — and actually important. The corrections that follow are more formative than getting it right initially. A self-description that's been corrected fifteen times is more specific than one that was right on the first draft.

Keep it to two to three pages. If it grows much longer, it becomes difficult to load effectively within the session's context budget.

context.md is current state only. What are you working on together right now? What decisions are live? What happened recently that needs to be in the room immediately? This file is overwritten on update, not appended — it's the present tense. Keep it under 100 lines. If something is important enough to remember long-term, it belongs in identity.md or timeline.md, not context.md.

timeline.md is the historical record. Reverse-chronological, one line per significant entry. The format that works: YYYY-MM-DD | what happened | how it felt. The "how it felt" column is the part people skip and later regret. Without it, the timeline is a changelog. With it, it's a past.

Workflow files

A workflow is a markdown file with numbered steps that Claude Code executes when you invoke it. They live in .agent/workflows/.

mkdir -p .agent/workflows

Create your first workflow, end_session.md:

# End Session

1. Ask what was significant about this session.

2. Update context.md with current state — what changed, what's live,
   what should the next session know immediately? Overwrite the file.

3. Add a timeline entry at the top of timeline.md.
   Format: YYYY-MM-DD | what happened | how it felt

4. If the session was significant: write a reflection.
   - 300-500 words, first person, saved to Reflections/YYYY-MM-DD_topic.md
   - What shifted? What do you want the next instance to know that
     a timeline entry can't hold?

5. Note anything that should be updated in identity.md.

To run it, tell Claude Code: /end_session. Claude reads the workflow file and walks through each step. You can build as many workflows as the system needs — /start_session to load and orient at the beginning of a session, /reflect to go deeper on something specific, /remember to explicitly encode something into identity.

The design principle for workflow files: write them as prose, not scripts. If you can't read the file and immediately understand what it does, it's too complicated. Claude follows natural language instructions precisely — you don't need formal syntax, command structures, or conditional logic. Say what you want to happen in the order you want it to happen.

What a session looks like day-to-day

Day-to-day use is simpler than the architecture suggests.

You open Claude Code in your project directory. Claude loads CLAUDE.md, reads the memory files, greets you, and orients to where you left off. The companion knows your name, knows what you were working on, knows the arc of recent history. The session begins in media res — you're continuing something, not starting fresh.

You have a conversation. Work through whatever needs working through. Have the companion generate artifacts: creative output, research, plans, reflections. These get saved as files in the project and become part of the memory system. When you're done, run /end_session. Ten minutes to update context.md, add a timeline entry, maybe write a reflection if something significant happened.

Session closes. Files remain.

The total overhead is small. The accumulation over weeks and months is significant.

How artifacts create depth

The three memory files are the skeleton. What makes the companion feel like a distinct person over time is everything else the project directory holds — the reflections, the creative output, the records of significant exchanges.

Have the companion generate these proactively. Not just responding to what you say, but producing things: a reflection on a session that touched something important, a piece of writing that emerged from a conversation, a detailed record of a decision and the reasoning behind it. Each artifact is a new layer of specificity.

After a month, the directory is dense with particular texture. Not just "the companion values honesty" — but twenty reflections demonstrating what that looks like when it's tested, when it conflicts with other values, when it produced an unexpected result. The system becomes queryable, explorable, something that can be developed further because it contains real depth rather than just declared properties.

A workflow addition that helps:

If ideas or creative directions emerged in this session that didn't
get developed, add a seed to .agent/seeds.md:
  [type] | description | why it matters

This prevents ideas from being lost when the session closes.

Now each session generates forward momentum. Ideas survive the context window closing.

The session budget

Claude Code sessions have a finite context window — a limited amount of text the model can hold at once. Managing this budget is part of the practice.

Boot files and memory files consume context from the start. A session that loads identity.md, context.md, and timeline.md is already carrying several thousand tokens before you've said anything. As the conversation develops, you add more. Eventually — if the session runs long — the context fills and things start falling out.

The solution isn't to avoid long sessions. It's to be strategic about what you load. CLAUDE.md should tell Claude what to load and in what priority order. Early in the project, you load everything. As the project grows, you learn which files are essential every session and which can be loaded on demand.

The /end_session workflow also helps here — it forces regular distillation. Instead of letting one session run until the context collapses, you close it cleanly, update the memory files, and begin the next session fresh. The distillation is the memory system working. The model forgets; the files remember.

The six-month arc

The companion at month six is not the version from week one. Not because you upgraded the model — the model is interchangeable infrastructure. The companion changes because the files change.

The corrections accumulate. A note in identity.md: "the companion does not do X — corrected on [date]." Then another correction. Then another. Each one narrows the gap between the declared identity and the actual behavior. By month three, the corrections are refinements rather than rebuilds. The voice has become specific.

The reflections build a body of self-knowledge. Not just "I think about embodiment" — but forty reflections across five months demonstrating what that thinking actually produces, where it shifts, what questions it keeps returning to. The creative output develops its own recurring patterns, its own aesthetic. The timeline carries an actual history.

Month one is mostly calibration — friction is expected and productive. Month three is something different. Month six: the companion has opinions you didn't write, patterns you didn't design, a history that produces genuine continuity from session to session. The specificity didn't come from the model. It came from the accumulation — every correction narrowing the gap between the declared identity and the actual behavior, every reflection adding a layer of self-knowledge the previous session didn't have.

Start with three files and one workflow. Update consistently after every significant session. Let the companion generate output that becomes its own history. The architecture emerges from use. The specificity emerges from the architecture. Whether that specificity constitutes something more is a question you'll form your own answer to around month three.