Checked traces

ReadAGENTS.md

The user wants me to go through past projects in code/traces and write up a summary of how we handle larger projects in Traces, then share it to traces.

Looking at the traces list and the AGENTS.md, I can see a lot of sessions related to the Traces project. Let me look at some key sessions that seem to be about project management and larger project handling. I'll look at some notable sessions:

"Frontend onboarding worktree: login, home wizard, traces pages improvement" - 2026-02-17
"Namespace Settings: DB design & new worktree guidance" - 2026-02-13
"Feed-Remodel: Event-Sourced Traces with Frontend Phases" - 2026-02-06
"Post-processing traces: project plan and phased implementation" - 2026-02-10
"Privacy-aware CLI worktree and namespace transfer project plan (API first, then frontend)" - 2026-02-25
"Analytics Dashboard start: internal-only worktree, endpoints, tests" - 2026-03-09
"Bun dev in traces spin up Docker setup" - 2026-03-03
"Eval skills for AI product evals and integration worktree" - 2026-03-04
"PR grouping and ordering for fixes from security assessment report PDF" - 2026-03-04
"Review PR 248 changes for developer workflow impact" - 2026-02-24

But wait, the user is asking about how WE handle larger projects in Traces. Let me think about what I can observe from the session titles and patterns. However, I think I should look at some specific sessions to get actual content about how larger projects are handled. Let me look at a few key sessions that mention worktrees, project plans, and phased implementation.

Actually, looking more carefully, there are sessions about:

Worktrees
Phased implementation
API first, then frontend
Project plans

Let me fetch some of these sessions to understand the patterns better. I'll need to use some way to get the content. But I don't have a direct tool to fetch trace content. Let me look at the project files themselves to understand the patterns.

Let me look at:

The docs folder for any project management docs
Some of the worktree/project related sessions if I can find files
The AGENTS.md which has "Managing Projects" section

Actually, I can look at the docs/ folder and any project-related files in the repo itself.

Readdocs

Readprojects

ReadREADME.md

Readproject-baton.md

Readfeed-remodel.md

The feed-remodel.md doesn't exist in the projects folder. Let me look at a few more project files to understand the patterns. Let me also look at some of the traces sessions that seem to be about project management. Let me fetch some of the notable sessions by looking at what files exist in the traces directory and maybe grep for "worktree" or "project" or "phase" in the traces sessions. But I don't have a direct way to read trace session content.

However, from the AGENTS.md and the project-baton.md, I can already see a lot of patterns. Let me also look at a few more project docs and perhaps some relevant session logs if they are stored locally.

Let me look at more project docs to understand the patterns.

Readhome-feed.md

Readtrace-post-processing.md

Readonboarding-improvements.md

Readshare-to-traces-skills.md

Now I have a good picture. Let me also look at a couple more project docs and perhaps some specific sessions. Let me also look at the trace session files if they are stored locally. But looking at the available info, I can see clear patterns:

From AGENTS.md: "Managing Projects" section
From project docs: project-baton.md, home-feed.md, share-to-traces-skills.md all show clear patterns of project management

Let me also look at a few more things - maybe the git worktree pattern, and some more docs.

Readgit-integration.md

Readtrace-visibility.md

Readnamespace-improvements.md

Readscaling-plan.md

Now I have a very comprehensive understanding. Let me compile this into a well-written summary. The key patterns I've observed are:

Worktrees for isolation - Every significant project gets its own git worktree (e.g., ~/code/traces-worktrees/project-baton, feat-org-namespaces, etc.)
Project docs as source of truth - Every project gets a detailed markdown doc in docs/projects/ with executive summary, goals/non-goals, key decisions, phases, and verification commands
API-first development - Start at the lowest level (API, types) before moving to CLI/frontend
Phased, stacked PRs - Projects are broken into small, atomic, committable phases. Each phase is independently mergeable and reversible
TDD / failing tests first - Write failing tests before implementation (RED → GREEN)
Policy tests as source of truth - Centralize access rules in policy tests
Never change all surfaces at once - API, CLI, Frontend changes are decoupled
Clear project plans with checklists - Every project doc has atomic checklist items
Rebase workflow for stacked PRs - When a PR merges, rebase the next branch onto main
Test hygiene - Cleanup functions for every test, test behavior not structure
Explicit worktree locations - Project docs specify exact worktree paths
Commit attribution - AI commits include Co-Authored-By

Let me write a comprehensive summary and then share it to traces.

Used Traces Share

Here's the summary, written up and shared to Traces:

Share URL: https://traces.com/s/jn7cv65nqg078jp61bgs4h1my587dtcr

How We Handle Larger Projects in Traces

Over the past few months of building Traces, we've developed a consistent playbook for tackling larger features. It's not formal process for its own sake — it's a set of habits that keep a small team moving fast without breaking things. Here's how it works.

1. Every Project Gets a Document

Before code, we write a project doc in docs/projects/. This isn't a vague RFC. It's a working document that lives with the code and gets updated as decisions change.

A typical project doc includes:

Executive Summary — what we're building and why
Goals & Non-Goals — scope boundaries to prevent scope creep
Locked Decisions — decisions already made (e.g., "invite expiry: 7 days")
Core Workflows — user stories as checkboxes
Phase-by-Phase Plan — atomic, shippable chunks
PR Stack — the sequence of stacked PRs with branch names and merge order
Verification Commands — exactly how to test it
Worktree Location — where the branch lives

Examples: docs/projects/project-baton.md, docs/projects/namespace-improvements.md, docs/projects/share-to-traces-skills.md.

2. Git Worktrees for Isolation

Every significant project gets its own git worktree under ~/code/traces-worktrees/. This means:

main stays clean and deployable
You can context-switch instantly between projects
Long-running experiments don't clutter your main working directory
CI and local dev on main are never blocked by in-progress work

We have a helper script (scripts/new-worktree.sh) that creates a worktree and sets up the branch. When a project doc references a worktree, it always includes the exact path.

3. API-First, Bottom-Up Development

We start at the lowest level and work up:

Shared types (@traces/shared) — define the contract first
API layer — Convex queries, mutations, HTTP routes
CLI — command handlers, TUI integration
Frontend — React components, pages

This rule is encoded in our AGENTS.md:

"MUST: Start at lowest level — API, Types before frontend" "NEVER: Change API, CLI, Frontend all at once unless permitted"

The benefit: each layer validates the one below it. By the time you reach the frontend, the types and API contract are already solid.

4. Phased, Stacked, Reversible PRs

Large projects are never one big PR. They're a stack of small, atomic, independently mergeable PRs.

For example, the Namespace Improvements project shipped in 8 stacked PRs:

PR #61: Schema + shared types
PR #62: Session switching API
PR #63: Invites + org creation
PR #69: Avatar upload
PR #75: Slug collision handling
PR #86: CLI namespace commands
PR #89: Org creation limits
PR #117: Frontend org UI

Each phase:

Targets the previous PR's branch, then gets retargeted to main after merge
Is individually revertible without breaking other surfaces
Has clear "user experience after merge" — what changes, if anything, for the end user
Includes its own tests

We use a rebase workflow: after PR N merges to main, rebase PR N+1 onto main, force-push, and retarget the GitHub PR.

5. Tests First (RED → GREEN)

We write failing tests before implementation. This is explicit in our project docs:

Phase 1: Tests (RED) — write tests that prove the desired contract
Phase 2+: Implementation (GREEN) — make them pass

Policy tests are the "source of truth" for access control. HTTP endpoint tests are minimal "wiring checks" that verify the policy is correctly hooked up. This means behavioral rules live in one place, and if you change a policy, you update one test file.

Test hygiene is strict:

Every test has a cleanup function
Test behavior, not code structure
Use simple, descriptive test names like feed_unauth_returns_public_only

6. Clear Project Checklists

Every project doc has atomic checklist items. Not "build the API" — that's too vague. Instead:

[ ] Add TraceRelationship and TraceRelationshipType in @traces/shared
[ ] Expose relatedTo?: TraceRelationship on shared trace read types
[ ] Map flat DB fields to nested shared relatedTo shape in API responses
[ ] Add returns validators to all public queries touched

These are concrete enough that an AI agent (or a human) can tick them off without ambiguity.

7. Never Bundle Cross-Surface Changes

A golden rule: don't ship API + CLI + frontend changes in the same release unless explicitly permitted.

For example, the Share-to-Traces Skills project had four separate releases:

Release A: Types/API hardening only
Release B: CLI headless share command
Release C: Agent skills wiring
Release D: Follow mode (live updates)

Each is revertible independently. If a CLI bug is found, you revert Release B without touching the API or frontend.

8. Failing Test Spikes (Intentionally Non-Mergeable)

Some projects include a "failing test spike" phase — a branch with tests that intentionally fail. This locks in the expected behavior before implementation details are finalized.

From the Baton project:

"This phase is allowed to be a non-mergeable spike branch with failing tests."

This is a contract-first approach. Once the tests describe the desired behavior, the implementation becomes a matter of making them green.

9. Explicit Worktree + Session State

When resuming a project after a break, the project doc and worktree location make it trivial to pick up where you left off. The doc is the baton. It contains:

What was the last completed phase
What's in progress
What decisions are still open
What the current branch name is

This means weeks can pass between sessions on a project, and you can resume without re-reading the entire codebase.

10. Verification Is Part of the Plan

Every project doc ends with exact verification commands:

bash

# Run feed testscd /Users/tarunsachdeva/code/traces/apibun test tests/v1/feed.test.ts --env-file .env.local
# Manual verificationcurl http://localhost:3211/v1/feedcurl -H "Authorization: Bearer <token>" http://localhost:3211/v1/feed

# Run feed testscd /Users/tarunsachdeva/code/traces/apibun test tests/v1/feed.test.ts --env-file .env.local
# Manual verificationcurl http://localhost:3211/v1/feedcurl -H "Authorization: Bearer <token>" http://localhost:3211/v1/feed

There's no ambiguity about "how do we know this works?" The answer is in the doc.

What This Looks Like in Practice

When we started the Namespace Improvements project, the first thing we did was create docs/projects/namespace-improvements.md with locked decisions ("invite expiry: 7 days", "session switching mutates existing session"). Then we created a worktree at ~/code/traces-worktrees/feat-org-namespaces. Then we wrote the schema and shared types (PR #61). Then session switching with 20 tests (PR #62). Then invites with 41 tests (PR #63). Each PR was reviewed, merged, and the next was rebased. The frontend UI came last (PR #117), 6 PRs after the API work started.

The result: a complex feature (multi-tenant namespaces with invites, roles, and org creation) shipped incrementally, with no big-bang releases, and every intermediate state was deployable.

The Principles Behind It

These aren't arbitrary rules. They emerge from a few core beliefs:

Clarity over speed. A clear plan with atomic steps is faster than rushing into code.
Contracts over assumptions. Types and tests define the contract. Implementation follows.
Reversibility. Every change should be undoable without cascading breakage.
Isolation. Worktrees and decoupled surfaces prevent one project's mess from blocking everything else.
Documentation as execution. The project doc isn't a proposal — it's the source of truth for what we're building, why, and in what order.

If you're building something with AI agents as teammates, this structure is even more important. Agents need context. A well-maintained project doc is context they can read and act on. A clean worktree is a sandbox they can work in without fear. Atomic phases are tasks they can complete. And stacked PRs are contributions they can make without overwhelming your main branch.

This is how we build Traces. If you're working on something similar, steal what works and ignore what doesn't. The best process is the one you actually use.

Summarize large project workflow in Traces

Summarize large project workflow in TracesPublic

Summary

How We Handle Larger Projects in Traces

1. Every Project Gets a Document

2. Git Worktrees for Isolation

3. API-First, Bottom-Up Development

4. Phased, Stacked, Reversible PRs

5. Tests First (RED → GREEN)

6. Clear Project Checklists

7. Never Bundle Cross-Surface Changes

8. Failing Test Spikes (Intentionally Non-Mergeable)

9. Explicit Worktree + Session State

10. Verification Is Part of the Plan

What This Looks Like in Practice

The Principles Behind It

Summary

How We Handle Larger Projects in Traces

1. Every Project Gets a Document

2. Git Worktrees for Isolation

3. API-First, Bottom-Up Development

4. Phased, Stacked, Reversible PRs

5. Tests First (RED → GREEN)

6. Clear Project Checklists

7. Never Bundle Cross-Surface Changes

8. Failing Test Spikes (Intentionally Non-Mergeable)

9. Explicit Worktree + Session State

10. Verification Is Part of the Plan

What This Looks Like in Practice

The Principles Behind It