Playbook Live Dashboard Story About ⭐ GitHub hello@usecompai.com
Built in Public · 8-Figure Consumer Brand · Live in Production

We built our company
an AI brain.

1,341 documents covering every operational decision in our business. 167 executable procedures any AI can run. 44 integrated tools the brain reads and writes to.

One operating layer that turns legal policies, financial data, operational processes and company rules into something LLMs can actually use to execute.

Now every employee runs on AI. Seven agents run operations on the same brain.

€352/month. 18:1 ROI. Six months live in our 8-figure consumer brand.

€352
/ Month All-In
18:1
Audited ROI
62h
Saved Per Week
6+
Months In Production
The Problem

Brands and retailers drown in operational work that AI should already own.

For every €1 a brand spends on software, it spends €6 on services and headcount to operate that software. Shopify costs €2K/year. The people managing inventory, processing orders, answering tickets, and closing the books cost €200K+.

Most "AI agents" shipped in 2026 are chatbots that forget. Tools that break when models update. Demos that never make it to production. Static playbooks that are outdated the day they're published. Prompt libraries that break on the next model release.

After eighteen months of trying tools, hiring agencies, and stitching together automations, we stopped. We rebuilt the operation around one principle: every recurring decision should be made by an agent with memory, context, and accountability. This site is what came out of that.

The Primitive

We built our company a brain.

Eighteen months ago, every recurring decision in our business was made by someone remembering something. The refund policy lived in three different Slack messages and a Notion page from 2024. Pricing exceptions were tribal knowledge. The incident playbook was the founder's memory. How customer service responded to a delayed shipment depended on which agent picked up the ticket.

We extracted all of it.

📚
1,341 documents
The knowledge layer

Pulled from email, Slack, support tickets, past reports, meeting notes, the founder's memory, and 18 months of operational decisions. Structured. Versioned. Searchable. Versioned again every week.

🛠️
167 executable skills
The procedures layer

Procedures the brain can run on demand. Close the books. Triage a refund. Run weekly P&L. Reconcile last week's bank movements. Each one documented, parameterized, and callable from any agent or any employee with a Claude window open.

🔌
44 integrated tools
The systems layer

Every business system the brain can read or write — Shopify, Klaviyo, Notion, Slack, GA4, the accounting stack, the helpdesk, the warehouse, the bank. Exposed via MCP. Governed via ACL. Audited via logs.

🌱
Self-updating
The compound layer

Every operational decision adds to the brain. Every conversation that mattered. Every fix that worked. Every pattern that emerged. The brain at month six is qualitatively different from the brain at month one — and that gap keeps widening.

Y Combinator's Tom Blomfield recently called this "the missing layer between company data and reliable AI automation. Every company in the world is going to need one." We just call it the brain.

What changed

Every employee runs on AI now. Not as a tool — as a teammate.

Once the brain existed, the way we work changed. Not because we asked the team to "use AI more." Because suddenly there was something worth using.

🧠
The founder
Monday 9:00 AM

Opens Claude. "What changed last week and what should I look at first?" The brain returns the weekly synthesis — revenue, three escalated tickets, an inventory imbalance forming on the new collection, marketing fatigue on the welcome flow. Six minutes of review. No tabs opened.

💬
The CS lead
Tuesday 11:00 AM

A customer asks why their refund hasn't processed. "Status of order 8341, refund timeline, who promised what." The brain returns the full thread, the policy that applies, what's been said, what to say next. Reply drafted in 90 seconds. No checking five systems.

📊
The finance lead
Friday 4:00 PM

"Reconcile last week's bank movements against expected payments. Flag anomalies." The brain runs through accounting + bank + Shopify, generates the variance list, explains each anomaly. What used to be Saturday morning work is now Friday afternoon.

🧶
The merch lead
Tuesday 10:00 AM

"Sell-through by size for the current collection, versus the same week last year, flag any size at risk of stockout or markdown." Brain pulls Shopify + warehouse + wholesale pipeline. Decision made before lunch. No spreadsheet, no analyst.

This is the difference between AI as a feature and AI as a foundation. When the brain holds the context, the model becomes useful for everyone — not just the technical team.

On top of the brain

Seven agents run the operations that don't need a human.

Once the brain existed, automation became cheap. The same context that powers human work also powers seven AI agents running 24/7 across the operation. They don't replace employees — they handle the recurring decisions nobody wants to make at 3 AM. Each agent reads from the brain, writes to the brain, and contributes patterns the brain keeps.

🧠
Strategy Agent
Hub · Orchestration

Morning briefings, cross-domain synthesis, competitive scans, knowledge mining, agent coordination.

💬
CS Agent
Customer Service

Ticket triage, WISMO responses, drafts with brand voice, pattern detection across tickets, escalation routing.

📊
Finance Agent
GSheets · Holded · Revolut

Weekly P&L, AR follow-ups, invoice reconciliation, multi-currency treasury, variance alerts.

🏪
Retail Agent
TC Analytics · POS

Daily store reports, foot-traffic-driven staffing, inventory transfer flags, store A vs store B comparisons.

📣
Marketing Agent
GA4 · Klaviyo · Meta Ads

Campaign analysis, segmentation, subject line patterns, SEO opportunity mining, attribution tracking.

🧶
Merch Agent
Stockagile · Wholesale

Sell-through analysis, inventory distribution by variant audits, markdown candidates, pricing positioning, wholesale ops.

👥
HR Agent
Notion · Holded · Payhawk

Absence reports, payroll prep, vacation balances, onboarding, expense categorization.

⌨️
Command Center
Founder Interface · 44 Tools

Claude Code with all 44 MCP tools. The founder's direct interface to every agent and every system in the swarm.

1,341 brain docs 44 MCP tools 167 skills 32 production lessons 91% autonomous operations
The Math

Honest ROI — with every assumption on the table.

Most AI vendors quote numbers they can't defend. "10× productivity." "50× return." "Pays for itself in a week." Then you ask how they calculated it and the conversation gets vague. Here's exactly how we get to 18:1.

Hours Saved Per Week — Audited Across 6+ Months
CS Agent — triage, drafts, policy lookups20h
Founder time freed — briefings & decisions10h
Finance — P&L, AR, reconciliation8h
Merchandising — sell-through & sizes6h
Retail — daily reports & staffing5h
Marketing — campaigns & segments5h
Strategy — synthesis & knowledge5h
HR — admin & payroll prep3h
Total hours offloaded weekly62h
62h/week × 52 weeks = 3,224 hours/year
Valued at €21/h loaded operational labor + €40/h founder opportunity cost
= €77,584 / year in reclaimed labor value

System cost (all-in): €4,224 / year (€352 / month)
RATIO: 18.4 : 1 · Payback: ~20 days

€21/h loaded labor derived from personnel budget ÷ headcount ÷ ~2,000 working hours × role mix. Conservative. No revenue impact claimed — every number here is a cost avoided.

Living System

This playbook is updated from verified production learnings.

We run our brand on this system. Every Monday the agents push the week's learnings to a shared brain. Every Friday we review what changed and decide what gets promoted into the playbook. You're not reading AI theory. You're reading the operating manual of a brand that is, right now, processing tickets, closing books, and reordering stock.

Daily
Our agents run. Our crons extract patterns. New learnings are written to the brain. The playbook reflects it.
Weekly
Pattern extraction cron identifies new operational patterns. Best ones are promoted to the pattern library. Kit users get them automatically.
Network
As more brands deploy compAI, their anonymized learnings compound into the shared library. Every deployment makes every other deployment smarter.

This is not a course you buy and forget. It is operating documentation backed by a production system. Updates happen when there is something real to ship, not to simulate motion.

Live Right Now

See the system running.

Production activity feed, honest ROI breakdown, real cost data, and a governed public snapshot of the swarm. No mockups, no vanity KPI theater, and no hidden spreadsheet math.

Live activity feed
Real cost breakdown
Try the demo yourself
Open the Live Dashboard →
The Depth

15+ specialized capabilities you won't find anywhere else.

Most AI demos show one capability at a time. A system running for six months accumulates depth: corner cases, weird customer behaviors, recovery patterns, decisions that are too specific for a sales deck but too valuable to forget.

01
AutoResearch

Self-evolving prompts. The CS agent measures its own response quality, mutates underperforming prompts, and auto-promotes improvements. 94.7% accuracy after 3 months of evolution.

02
LLM Council

Six domain-expert agents + blind peer review for high-stakes decisions. Strategy, finance, retail, CS, marketing, merch. ~€1 per deliberation. 2-4 minute response time.

03
Pattern Library

Cross-company knowledge library with strict anonymization. New deployments start at 70%+ autonomy instead of zero. REST API, live. The real moat.

04
Invoice Pipeline

Inbox → OCR → classify → rename → file in Drive → reconcile against POs → log in master sheet → route for approval. 5 minutes per invoice drops to <30 seconds.

05
Profitability Engine

Real-time CM3 per product across representative commerce, inventory, ads, analytics, and finance sources. Shopify, Stockagile, and Holded are typical examples. Every product has a live CM3 number.

06
Copy Engine

1,114 email campaigns analyzed for subject line, body, CTA, and performance patterns. Learned rules like "ALL CAPS = 2.7× revenue (if <15% of sends)" drive future drafts.

07
Taskmaster Protocol

Contract-based cascading execution for multi-step operations. Each step has acceptance criteria. If any step fails, the entire operation rolls back atomically.

08
GEO Optimization

Tracks brand visibility in ChatGPT, Perplexity, Claude, and Google AI Overviews. Reverse-engineers cited content and optimizes. Mention rate improved from 35% to 60%.

09
Agent With Its Own Credit Card

The strategy hub has its own Visa corporate card with a monthly limit. Autonomous expense management on approved categories. Receipts auto-filed.

The Moat

The cold-start advantage.

Most AI deployments start at zero. The first month goes into calibrating prompts, tuning confidence thresholds, debugging edge cases nobody warned you about. We're keeping the patterns we've built and sharing them. Anything that follows starts where we left off — not at zero.

01
The Pioneer

3 months to reach 91% autonomy. Learns everything from scratch.

05
Fifth Deployment

Starts at 70% autonomy day one. Pattern library pre-loaded.

20
Twentieth Deployment

Starts at 85% autonomy with industry-specific patterns built in.

Every deployment makes every other deployment smarter.

21 patterns · 9 domains REST API live Strict anonymization Weekly auto-extraction
Positioning

What compAI is not.

The category is noisy. Most "AI for business" products are one of these things. We're not any of them.

compAI is the operating manual for running a brand on AI agents. Six months in production across CS, finance, merchandising, retail, marketing, and people ops. 32 documented lessons. Honest 18:1 ROI. Every claim auditable. Read it, run it, or have us wire it in.

What's in the repo

Everything we built, open and executable.

Not a marketing site about AI. The actual playbook, the actual skills, the actual prompts, the actual templates. Each one written and validated inside an 8-figure consumer brand operating on AI for over a year.

📖
The playbook
33+ chapters

The brain primitive. The MCP server. The agents. The advanced capabilities. The production lessons. The honest open questions. Read it online or fork the markdown.

🛠️
Skills library
146 .md files

Executable procedures any AI client can run. Close the books. Triage a refund. Run weekly P&L. Reconcile bank. Each callable from a Claude window with the brain mounted.

💬
Prompts library
Copy-paste-ready

The prompts behind every agent: customer service, finance, marketing, retail, merchandising, people ops, strategy hub. Categorized, anonymized, ready to adapt.

🔌
Integration guides
Setup how-tos

How to wire your stack — Shopify, Klaviyo, Notion, the helpdesk, the accounting tool, GA4 — into the brain via MCP. Three-step guides with sample code.

📐
Templates
Brain folder, SOULs

The folder structure of the brain. The agent SOUL.md template. The factory.yml. The skill.md template. The custom-instruction template. Drop-in starting points.

🧩
Pattern library
21 patterns · 9 domains

Anonymized operational patterns from a year of running this. New deployments inherit instead of starting from scratch. YAML schema + REST API.

📊
Profitability engine
Production-validated

Daily DTC contribution-margin calculator, validated to within 0.4% of the monthly accounting close. The spec, the bug fixes, the dashboard. All open.

🚨
Production lessons
32 incidents

What broke and what we did. Token leaks. Webhook downtime. Memory degradation. Provider rate limits. Each one with a one-rule lesson others can use.

License: Free to use, attribution requested. Updated as production learnings are verified.

Open Source

Take it. Fork it. Build your own.

Everything we built is in a public repo: the playbook, every executable skill, the prompts library, the integration guides, the templates. Free to use, attribution requested. The kind of resource we wish had existed when we started.

33+
Essays / chapters
146
Skills (.md files)
21
Patterns
32
Production lessons

License: Free to use, attribution requested. Need hands-on help wiring this into your business? hello@usecompai.com — we work with up to three brands at a time.

Questions

Things you're probably wondering.

Because we wish this had existed when we started. The hard part of becoming AI-native isn't tooling, it's domain knowledge — and that takes 12-18 months of trial, error, and rebuilds to extract. We've already done that work. Sharing it doesn't dilute it. Most readers will benefit from the playbook alone. A few will want hands-on help, and that's where the work earns income.
Operators of consumer SMEs — beauty, food, home, wellness, pet, outdoor, fashion, retail — who want to make their company AI-native and stop firefighting tool sprawl. You don't need to be technical to read the playbook. You do need a technical operator (or us) to deploy the brain inside your company.
The full playbook in markdown, 146 executable skill files, the prompts library, the integration guides, the brain folder template, the SOUL.md template, the factory.yml template, the pattern library YAML schema. License: free to use, attribution requested. Fork it, adapt it, ship it.
Lindy is horizontal — you still build everything. Artisan and 11x sell single-purpose AI employees. Sierra and Siena do customer service only. This is the operating substrate underneath — the brain that makes the agents possible. The agents are the easy part once the brain exists.
62 hours of operational labor offloaded per week, multiplied by 52 weeks, valued at €21/hour loaded labor plus €40/hour founder opportunity cost. €77,584/year reclaimed against €4,224/year all-in system cost. Every number is a cost avoided. No revenue impact claimed. Full breakdown.
If you want to deploy the artifacts yourself: yes, you need a technical operator comfortable with command line, VPS setup, and API keys. If you don't have one internally, write to hello@usecompai.com — we work with up to three businesses at a time on hands-on implementations.
Everything runs on your own infrastructure — your VPS, your inference accounts, your databases. LLM inference is direct API (not consumer products), so your prompts aren't used to train models. EU-hostable. GDPR-compatible by default.
The system is model-agnostic. When Claude, GPT, or Gemini ships an update, you swap the model config without changing the architecture. The brain doesn't care which model reads it. When integrations break (they do), the playbook documents the fix. See the 32 production lessons.
The reference deployment runs in consumer retail with omnichannel — DTC, physical retail, wholesale. The architecture is vertical-agnostic. Where the examples sound like one vertical, the chapter explains how the same pattern works for beauty, food, home, wellness, pet, outdoor, fashion, and any DTC/retail business.

Read it. Fork it. Build your own.

Open playbook. 33+ chapters. 146 skills. 21 patterns. 32 production lessons. Live dashboard. Every artifact downloadable. Updated as production learnings are verified.

Free to use, attribution requested · Questions? hello@usecompai.com