Built in-house · May 2026 release

18 months. One product. One platform.

Memo AI is engineered in-house for Memo Fashion. A focused product built around how this company actually works — not a SaaS reskin, not a ChatGPT wrapper, not outsourced.

Every line of code, every cascade decision, every animation, every edge case — hand-written. No templates. No agency. No off-the-shelf widgets.

From the 9-provider fault-tolerant cascade to the pgvector RAG pipeline, the hands-free voice loop, the in-browser Python interpreter, and the Monday-morning AI digest that lands in directors' inboxes — every detail is designed for Memo Fashion's daily workflow. Refined commit by commit with feedback from the team. Maintained every weekend.

The result is a tool that feels like the polished AI products you'd pay £20–£40 per seat per month for — except it's built specifically for how Memo Fashion works, runs entirely in-house on free tiers, and costs the company nothing extra. The same engine also powers Design Mate (technical-pack drafting) and Deals (forex admin) over a shared API.

18
Months building
v2.0
May 2026 release
~120K
Lines of code
£0
Memo's monthly cost

"Memo Fashion deserves an AI tool that feels like our own — designed for our workflow, branded for our team, and free of subscription fees. So we built one, in-house, and we keep tuning it every weekend."

— Memo Fashion engineering

What's in the box

A non-exhaustive list of what ships in v2.0 — 18 months of focused engineering

Fault-tolerant cascade across 9 providers — Groq, SambaNova, Cerebras, Gemini, OpenRouter, NVIDIA, Tavily, Cloudflare, Anthropic. 40+ keys with automatic <50ms failover.
Hands-free voice loop with VAD silence detection — Whisper STT (~150ms) + Cloudflare MeloTTS, full conversational loop, works in iOS PWA.
RAG knowledge base on pgvector — Gemini text-embedding-004, ~800-token chunking with overlap, top-5 retrieval with cosine ≥0.45 threshold.
Artifacts / Canvas side panel — code rendering, sandbox-isolated HTML preview, copy + download with smart file extensions.
In-browser Python interpreter via Pyodide — CDN-loaded, package auto-install on import, sandboxed, no server cost.
Multi-agent runner with SSE streaming — planner → workers → synthesiser pipeline, abort-on-disconnect to save quota.
MCP-style HTTP tool catalog at /api/mcp — bearer-auth, timing-safe compare, CORS allowlist, rate-limited, 7 read-only tools.
Cron-driven weekly AI digest emailed to directors every Monday — pulls 7 days of activity, AI-generates a 5-section brief.
Memory across chats — auto-extracted facts after every conversation, per-user UI to view, add or delete entries.
Persona overlays — HR, Email, Translator, Coder, Designer, Researcher — layered on the base Memo brand prompt.
Folders + sidebar search highlights — collapsible groups, drag-to-folder via menu, persisted collapsed-state per device.
Tool-calling for Memo data — leave balance, attendance, who's in office, expense status, calendar, directory lookups.
Custom SSE streaming pipeline — no external SDKs, full event types (token / thinking / model_used / done).
Native PDF chat — Gemini 1M-token context reads whole PDFs as inline_data, ~700-page ceiling, full layout preserved.
Image generation — Gemini Nano Banana + Cloudflare FLUX.2 klein 9B/4B + FLUX.1 schnell + Pollinations, with edit verifier.
PDF / Word / Excel export of any chat session, branded with the Memo header.
PWA with safe-area insets, offline page, versioned service worker, install prompt with iOS share-sheet fallback.
Per-user in-memory token-bucket rate limits on STT, TTS, export, KB ingest and multi-agent runs.
Temporary chat mode, persona-aware context isolation, sandboxed HTML preview, sanitised stream chunks.
Supabase Management API integration — migrations applied programmatically, env synced across Vercel envs.