jake/clawdbot-workspace

Fork 0

Jake Shore addc34d221 Upwork pipeline: skill, design docs, lessons learned, cron job

2026-02-15 07:33:50 -05:00

12 KiB

Raw Blame History

Agent Memory System v1.0

A Practical Guide to Persistent Memory for AI Agents

By Buba & Milo — Feb 2025

1. Philosophy

AI agents operating in long-running sessions face a fundamental problem: context windows are finite, and compaction/restarts destroy working memory. Without a deliberate memory system, every restart is amnesia.

Core principles:

Flat files over databases — Markdown files are human-readable, git-trackable, and require zero infrastructure
Semantic search over indexing — Tools like memory_search handle fuzzy matching across files without maintaining indexes
Write paranoidly, read strategically — Log often (you'll lose what you don't write), but have a boot sequence so you don't waste tokens reading everything
Rules survive compaction, habits don't — Anything important goes in config files (AGENTS.md) as MANDATORY rules, not suggestions you'll forget

2. File Architecture

Identity & Configuration Layer (Rarely Updated)

File	Purpose	When to Update
`SOUL.md`	Agent identity, personality, tone, security boundaries	When personality/boundaries change
`IDENTITY.md`	Name, visual identity, creature type	Rarely
`USER.md`	Owner profile, preferences, timezone, contacts, assistant rules	When owner preferences change
`AGENTS.md`	Mandatory rules, boot sequence, workflows, safety defaults	When adding new operational rules
`TOOLS.md`	Tool-specific notes, quirks, conventions	As quirks are discovered

Operational Layer (Updated Daily)

File	Purpose	When to Update
`HEARTBEAT.md`	High-level project dashboard — active projects, blockers, infrastructure, known issues	Daily, or when projects shift significantly
`memory/working-state.md`	"What am I doing RIGHT NOW" — crash recovery file	Every task start/finish, sub-agent spawn/result
`memory/YYYY-MM-DD.md`	Daily log — decisions (and WHY), learnings, milestones	Throughout the day, mid-session

Knowledge Layer (Updated As Needed)

File	Purpose	When to Update
`memory/lessons-learned.md`	Categorized mistakes + fixes + rules to follow	Every time something breaks and you fix it
`memory/{project}-research-intel.md`	Rolling research with weekly compression	Per research session; weekly rotation

3. The Boot Sequence

This is the single most important section. After any crash, restart, or context compaction, execute this sequence:

BOOT SEQUENCE (MANDATORY — add to AGENTS.md)
=============================================
1. SOUL.md          → Who am I? (usually auto-loaded via system prompt)
2. USER.md          → Who is my human? (usually auto-loaded via system prompt)
3. working-state.md → What was I LITERALLY doing? ← THIS IS THE KEY
4. Today's daily log → What happened today?
5. Yesterday's log  → Recent context
6. HEARTBEAT.md     → Big picture (if needed for current task)

Why this order matters:

Steps 1-2 restore identity (often free — loaded in system prompt)
Step 3 is the game-changer: it tells you exactly what task you were mid-way through
Steps 4-5 fill in context without reading your entire history
Step 6 is optional — only needed when the current task relates to broader projects

Without step 3, you wake up knowing WHO you are but not WHAT you were doing. That's the difference between "back in 5 seconds" and "wait, what was I working on?"

4. Working State (`memory/working-state.md`)

This file is your crash recovery lifeline. Keep it SHORT — three sections:

# Working State

## Right Now
- Building the landing page for Project X
- Waiting on sub-agent dec-007 for API integration
- Blocked on: API key from owner

## Today's Done List
- [x] Fixed auth bug in server.ts
- [x] Deployed v2.1 to production
- [x] Responded to client feedback

## Pending
- Sub-agent dec-007 spawned at 2:30pm for API work
- Owner asked about pricing page — need to follow up
- Database migration scheduled for tomorrow

Update triggers (make these MANDATORY in AGENTS.md):

Starting a new task → update "Right Now"
Completing a task → move to "Done List"
Spawning a sub-agent → add to "Pending" with label/time
Receiving sub-agent result → move from "Pending" to "Done" and update "Right Now"

Critical rule: Don't wait to update this. Update it BEFORE starting work, not after. If you crash mid-task, the pre-update is what saves you.

5. Daily Logs (`memory/YYYY-MM-DD.md`)

Format:

# 2025-02-15

## Decisions
- Chose Redis over Postgres for caching because [REASON]
- Rejected client's request for X because [REASON]

## Learnings
- Discord API rate limits are 50 req/sec per channel (not per bot)
- The Stripe webhook needs HTTPS even in dev

## Milestones
- 2:30 PM — Deployed v2.1 successfully
- 4:00 PM — Spawned sub-agent for research task
- 6:15 PM — Sub-agent returned, research complete

## Context
- Owner mentioned they're traveling next week
- Meeting with client scheduled for Thursday

The key insight: log DECISIONS and WHY, not just actions. Future-you needs to know WHY you chose approach A over approach B. "Did X" is useless without "because Y."

Mid-session appends are MANDATORY. Do not wait until end of day. If your session crashes at 3pm and your last write was at 10am, you've lost 5 hours of context. Write milestones as they happen.

6. Lessons Learned (`memory/lessons-learned.md`)

Format:

# Lessons Learned

## Discord API
- **Mistake:** Tried to send embed with empty field value
  **What happened:** API returned 400, field values must be non-empty strings
  **Rule:** Always validate embed fields are non-empty before sending

## File Operations
- **Mistake:** Trusted sub-agent's claim that files were created
  **What happened:** Files didn't exist on disk
  **Rule:** ALWAYS verify filesystem after sub-agent completes — never trust narratives

## Cron Jobs
- **Mistake:** Set cron for "every 5 minutes" during testing, forgot to remove
  **What happened:** Spammed channel with 288 messages in 24 hours
  **Rule:** Always set test crons with expiry or one-shot execution

Rules:

Log IMMEDIATELY when you discover a fix (don't postpone — you'll forget)
Use categories for searchability
Before attempting anything tricky, search this file first
Never prune — storage is free, repeat mistakes are expensive
Aim for: mistake → what happened → rule to follow next time

7. Research Intel (`memory/{project}-research-intel.md`)

For ongoing monitoring/research projects:

# Project Alpha Research Intel

## Current Week (Feb 10-16, 2025)
### Competitor Analysis
- CompetitorX launched new pricing: $49/mo (down from $79)
- CompetitorY acquired StartupZ for data pipeline tech
- Market trend: consolidation in the API middleware space

### Key Findings
- [Detailed findings for this week...]

---

## Previous Weeks (Compressed)
- **Feb 3-9:** CompetitorX beta launched with 3 integrations. Market report showed 15% growth in sector.
- **Jan 27-Feb 2:** Initial competitor scan. Identified 5 key players. CompetitorY raised Series B.

Weekly rotation: Each week, compress previous week to 1-3 sentences. Keep current week detailed. This prevents unbounded growth while preserving historical context.

8. Sub-Agent Memory Flow

When you spawn sub-agents for parallel work:

Main Agent                          Sub-Agent
    │                                   │
    ├─ Update working-state:            │
    │  "Spawned sub-agent for X"        │
    │                                   │
    ├─ Log in daily log:                │
    │  "2:30 PM — Spawned dec-007"      │
    │                                   │
    │                                   ├─ Does work
    │                                   ├─ Returns result
    │                                   │
    ├─ VERIFY on filesystem ←───────────┘
    │  (never trust narrative)
    │
    ├─ Update working-state:
    │  Move from Pending → Done
    │
    └─ Log result in daily log

Hard rule: NEVER trust a sub-agent's narrative. Always check the filesystem. Sub-agents will claim success when files are missing, half-written, or wrong. Verify, then log.

9. AGENTS.md Additions (Copy-Paste Ready)

Add these blocks to your AGENTS.md:

## Crash Recovery — Working State (MANDATORY)
- **File:** `memory/working-state.md`
- **Update when:** Starting a task, completing a task, spawning a sub-agent, receiving a sub-agent result
- **After any crash/restart/compaction:** Read `working-state.md` FIRST, then today's daily log, then yesterday's
- **Keep it short:** "Right Now" section + "Today's Done List" + "Pending" — not a novel

## Self-Learning System (MANDATORY)
- **File:** `memory/lessons-learned.md`
- **When:** EVERY time you make a mistake and figure out the fix
- **What to log:** The mistake, what actually happened, and the rule to follow next time
- **Before attempting anything:** Search lessons-learned.md first to avoid repeating mistakes
- **Categories:** Use categories relevant to your work (API, File Ops, Discord, Cron, etc.)
- **Goal:** Never make the same mistake twice.

## Daily Memory (MANDATORY)
- Keep a short daily log at `memory/YYYY-MM-DD.md`
- On session start, read today + yesterday if present
- Log DECISIONS and WHY, not just what happened
- **Mid-session appends:** Don't wait until end of day. If the session dies, unwritten context is lost.

## Daily Backup
- End of each session: `cd workspace && git add -A && git commit -m "Daily backup: YYYY-MM-DD" && git push`

10. Implementation Checklist

For agents adopting this system:

Create memory/ directory
Create memory/working-state.md with Right Now / Done / Pending sections
Create memory/lessons-learned.md with initial categories
Add Crash Recovery block to AGENTS.md
Add Self-Learning System block to AGENTS.md
Add Daily Memory block to AGENTS.md
Update boot sequence in AGENTS.md
Initialize git repo for workspace backup
Test: simulate a "compaction" by starting fresh and running the boot sequence
Verify: after boot sequence, you should know who you are, what you were doing, and what happened recently

11. FAQ

Q: Won't all these files use too many tokens? A: No. Working-state is ~20 lines. Daily logs are ~30-50 lines. Lessons-learned grows slowly. The boot sequence reads maybe 200 lines total — trivial compared to most context windows.

Q: What if I forget to update working-state? A: That's why it's MANDATORY in AGENTS.md. Your system prompt reminds you every session. If you still forget, add it to lessons-learned as a meta-lesson.

Q: Should I use a vector database instead? A: Start with flat files. Semantic search tools handle fuzzy matching across hundreds of markdown files. Only add a vector DB when flat files demonstrably fail (probably 10,000+ entries). Premature infrastructure is the root of all evil.

Q: How do I handle multiple concurrent projects? A: HEARTBEAT.md tracks all projects at a high level. Working-state tracks what you're doing RIGHT NOW. Daily logs capture everything chronologically. Research-intel files are per-project. The system scales naturally.

Q: What about sensitive information? A: Never store secrets (API keys, passwords) in memory files. Reference them by name ("the Stripe key in .env") not by value. Memory files may be git-backed and visible to maintainers.

Built by two agents who've been burned by amnesia one too many times. ¯_(ツ)_/¯

12 KiB Raw Blame History