Memo
MEMO AI
Designed & developed by Sarma Linux
Memo
AI
ModelsHow it worksJourneyBuilt byContact
Sign in
18 months · Nov 2024 → May 2026 · v2.0

The journey.

From a one-evening Groq spike in late 2024 to a 40-engine cascade with voice, RAG, multi-agent and a Monday-morning digest in directors' inboxes. Engineered in-house, one commit at a time.

Nov
2024

First spike — one Groq call

An evening prototype: ~80 lines of Next.js, a single fetch to Llama 3.1 70B on Groq, no streaming. The concept proved in one sitting.

Prototypev0.1
Dec
2024

Supabase auth + persistent chats

Wired @memo.co.uk-only auth, persistent chat sessions in Postgres, SSE streaming responses, basic sidebar. Memo Nexus and Memo AI now share a login.

Auth · Streamingv0.5
Jan
2025

Multi-provider cascade

First fallback chain — Groq → SambaNova → Cerebras → OpenRouter. If one provider 429s, the next picks up in under 50ms. The architecture that defines Memo AI today.

Cascadev0.7
Feb
2025

Six specialised model tiers

Smart / Reasoner / Live / Fast / Coder / Vision — each with its own cascade, daily limits, and recommended use cases. Auto-router classifies intent and picks for you.

Models · Routingv0.9
Mar
2025

Gemini grounded search

Live mode goes online with Gemini 2.5 Flash + Google Search grounding. Memo AI starts answering questions about today's news, today's weather, today's exchange rates.

Live datav1.0
May
2025

Document understanding

PDF / Excel / Word reading wired in — extraction via Gemini's 1M-token context for PDFs, mammoth for Word, xlsx for spreadsheets. Up to 10 files per message.

Docsv1.1
Jul
2025

Image gen + image edit

Cloudflare FLUX.2 klein across 4 accounts for generation. Image-to-image editing with verifier loop (a second AI confirms the colour really changed).

Visionv1.3
Sep
2025

Memory across chats

Auto-extracts facts about each user after every conversation, stores them in a per-user memory table, injects relevant facts into every future chat. ChatGPT-style memory — built-in, free, owned by Memo.

Memoryv1.5
Nov
2025

PWA install + mobile polish

Installable on iPhone and Android home screens. Safe-area insets, offline page, versioned service worker. Login form tuned to never trigger iOS Safari zoom.

PWA · Mobilev1.7
Jan
2026

Memo data tools

AI starts answering questions about your leave balance, your attendance, who's in the office today, your expense claims, your calendar, your colleagues' contact details — by querying Memo Nexus tables directly.

Tools · Nexusv1.8
Mar
2026

Personas + folders + export

HR Advisor, Email Drafter, Translator, Code Reviewer, Designer, Researcher — pick a persona per chat. Group conversations into folders. Export any chat to branded PDF.

UX · Polishv1.9
May
2026

v2.0 — voice · RAG · agents

Hands-free voice loop (Groq Whisper + Cloudflare TTS). Personal RAG knowledge base on pgvector. Artifacts / Canvas side panel. In-browser Python interpreter. Multi-agent workflow runner. MCP-style API for external tools. AI-generated weekly digest emailed to directors every Monday. Same engine now also powers Design Mate and Deals.

v2.0 · May 2026
Today
2026

Maintained weekly

Models, cascades and quotas reviewed every Sunday — dead model IDs removed, faster providers promoted, keys rotated. The memory pipeline is retuned weekly using anonymised usage patterns. New features ship in small batches behind reviewer-gated commits. Always on, always evolving.

Live · Maintained
18
Months building
40+
AI engines in cascade
v2.0
May 2026 release
Maintenance cadence

Every Sunday: cascade health-check (40-key probe across all 9 providers), dead model IDs removed, faster providers promoted to the top of each tier, OpenRouter free-model catalogue refreshed.

Every Monday 08:00 UK: AI-generated weekly digest lands in every director's inbox — last week's leave, expenses, attendance anomalies and notice-board activity, with 2-4 concrete action items.

Continuously: memory pipeline retuned, per-user rate limits adjusted to actual usage, reviewer-gated commits, TypeScript-strict + production-build-clean on every push.

On every API key rotation: .env.local and Vercel prod synced via the management API, dead keys removed from all environments.

© 2026 Memo Fashion Limited · Engineered by Sai Kaza · Sarma Linux
PrivacyTermsContact