April 2026 · Powered by DeepSeek · Gemini · Groq · SambaNova · Cerebras

Your work,
amplified.

Memo AI is an intelligent work assistant built exclusively for Memo Fashion. Powered by DeepSeek V3.2 (685 billion parameters), Google Gemini 3, and 34 more AI engines — with live exchange rates, real-time weather, container tracking, image generation, and persistent memory that learns who you are across every conversation.

or

Only @memo.co.uk accounts are permitted

Online
Good morning, Memo
How can Memo AI help you today?
✉️ Write an email
🔍 Search the web
📊 Analyse data
💡 Brainstorm ideas
Ask Memo AI anything…
Production · April 2026

Enterprise AI. Multi-provider architecture.
Built for Memo Fashion.

Memo AI uses a cascade architecture — every message is routed through up to 14 AI engines across seven infrastructure partners. If one engine is at capacity, the next takes over in under 50 milliseconds. You never wait. You never see errors. The system adapts in real time.

0
AI engines
0
API connections
0
Max cascade depth
0ms
Fastest response
How the cascade works
When you send a message, Memo AI tries the best available engine first — like DeepSeek V3.2, a 685-billion parameter model. If that engine is busy, the system automatically switches to the next one (Groq GPT-OSS 120B, then Llama 4 Maverick, then Cerebras Qwen 3 235B...) — up to 14 engines deep. Each switch takes under 50 milliseconds. Multiple connections rotate across providers so no single endpoint is ever overloaded. The result: you always get an answer, always from a capable model, with zero downtime.
🌍
Live exchange rates
🌤
Real-time weather
📦
Container tracking
🧠
Persistent memory
Six specialised models · multi-provider cascade

Pick the brain for the job.

Memo AI routes your question to the right model automatically. Each mode is backed by a cascade of 7–14 AI engines from Groq, SambaNova, Cerebras, Google and OpenRouter — so there's always capacity. Your files stay attached across messages. Generated images are stored permanently. And Memo AI remembers you — your name, your role, your preferences — across every conversation.

Smart
1,000/day · 14-step cascade · 685B MoE
The everyday powerhouse. Polished emails, summaries, deep analysis, brainstorming, translation. Primary engine is DeepSeek V3.2 — a 685-billion parameter model that rivals GPT-4 and Claude Sonnet on reasoning benchmarks. If it's busy, 11 more engines fire automatically.
Cascade (12 engines)SambaNova DeepSeek V3.2 (685B) · Groq GPT-OSS 120B (45ms) · Llama 4 Maverick · DeepSeek V3.1 · Llama 3.3 70B · Cerebras Qwen 3 235B · Qwen 3 32B · OR GPT-OSS 120B :free · Nemotron 120B :free · GLM-4.5 Air :free · Nemotron Nano 30B :free. All free. Connections rotate automatically across all providers.
🧠
Reasoner
500/day · 10-step cascade · deep thinking
For problems that need real thought. Complex code, multi-step maths, legal reasoning, strategy. Uses DeepSeek V3.2 and V3.1 — both trained specifically for chain-of-thought reasoning. Shows its thinking process in a collapsible panel so you can follow the logic.
Cascade (10 engines)SambaNova DeepSeek V3.2 · V3.1 · V3.1-cb · GPT-OSS 120B · Cerebras Qwen 3 235B · Llama 4 Maverick · Llama 3.3 70B · OR GPT-OSS 120B :free · GLM-4.5 Air :free · Nemotron 120B :free. Reasoning traces visible on demand.
🔴
Live
1,000/day · Gemini engines · real-time Google Search
Actually knows what happened today. Current news, weather, stock prices, sports scores, company info. Every answer is grounded in live Google Search results with sources cited at the bottom. Multiple Gemini connections across 3 models ensure consistent availability.
Cascade (4 engines)Gemini 2.5 Flash + Google Search · Gemini 2.5 Flash Lite + Search · Gemini 3 Flash Preview + Search · Groq GPT-OSS 120B + Tavily web search. Gemini engines + Tavily search = ~15,000 searches/day total capacity.
Fast
5,000/day · 41ms first token · 9-step cascade
Instant answers. Quick lookups, one-liner rewrites, “what's the word for…”. First token arrives in 41 milliseconds on Groq GPT-OSS 20B — faster than you can blink. Practically unlimited: 1,500 messages per person per day.
Cascade (7 engines)Groq GPT-OSS 20B (41ms) · Groq Llama 3.1 8B (49ms) · Cerebras Llama 3.1 8B (2,000 tok/sec on WSE-3 chip) · OR Nemotron Nano 9B :free · Liquid LFM 2.5 Thinking :free · GPT-OSS 20B :free · Gemma 3 4B :free.
💻
Coder
800/day · 9-step cascade · DeepSeek V3.2
Built for programming. Generates clean TypeScript, Python, SQL, HTML/CSS. Explains patterns, spots bugs, writes tests, refactors legacy code. Primary engine is DeepSeek V3.2 — the same model that topped the SWE-bench coding leaderboard.
Cascade (8 engines)SambaNova DeepSeek V3.2 · Groq GPT-OSS 120B · Cerebras Qwen 3 235B · DeepSeek V3.1 · Qwen 3 32B · OR GLM-4.5 Air :free · GPT-OSS 120B :free · Qwen3 Coder :free.
👁
Vision
500/day · auto-activates · image editing
Reads screenshots, photos, receipts, handwriting, diagrams, charts. Switches on automatically when you attach an image. Plus: paste any image and say “change to navy velvet” — FLUX.2 klein edits it with instruction-following AI, not random noise. Verified: colour changes actually work.
Cascade (6 engines)Groq Llama-4-Scout 17B (39ms) · Gemini 2.5 Flash · Google Gemma 4 31B :free · Gemma 4 26B :free · Nemotron Nano VL :free · Gemma 3 27B :free. Image editing: Cloudflare FLUX.2 klein 9B/4B across 4 free accounts.
Capabilities · production-ready

Everything you need. Built in.

Every feature below is live right now. Sign in with your @memo.co.uk email and start using them immediately.

✍️
Professional Writing
emails · letters · reports
Write polished professional emails in seconds. Set the tone — formal, friendly, concise. Complete documents ready to send, not drafts with placeholders. Powered by DeepSeek V3.2 (685B params, outscores GPT-4o on writing benchmarks).
🔍
Live Web Search
news · prices · events · real-time
Ask about anything happening right now. UK minimum wage, weather in Milan, GBP to EUR rate, latest fashion week news. Grounded in live Google Search via multiple rotating Gemini connections. Sources cited at the bottom of every answer.
💱
Live Exchange Rates
ECB data · 13 currencies · instant
“Convert 5000 GBP to EUR” → instant answer with live European Central Bank rates. Supports GBP, USD, EUR, INR, CNY, JPY, AED, HKD, AUD, CAD, CHF, SGD, TRY. Powered by the European Central Bank.
🌤
Weather Anywhere
global · 3-day forecast · auto-geocode
“Weather in London” or “do I need an umbrella in Milan?” — current conditions, temperature, humidity, wind, UV index, plus a 3-day forecast. Any city in the world. Powered by Open-Meteo.
📦
Container Tracking
Maersk · MSC · CMA CGM · 6 more
“Where is my container MSCU1234567?” — generates instant tracking links for 9 major shipping lines: Maersk, MSC, CMA CGM, Hapag-Lloyd, COSCO, Evergreen, ONE, Yang Ming, ZIM. Built for fashion logistics.
🎨
Image Gen & Editing
FLUX.2 klein 9B · ~1.5s per image
Generate images from text: runway shots, tech flats, moodboards, product photography. Edit existing images with natural language — “change to emerald green” actually changes the colour (verified by a second AI model). Powered by Black Forest Labs FLUX.2 klein 9B, hosted on Cloudflare Workers AI.
📎
Document Analysis
PDF · Excel · Word · 10 files
Upload PDFs, spreadsheets, Word docs — up to 10 at once. Memo AI reads them, extracts the text, and answers questions about the content. Invoices, contracts, specs. Files stay attached across messages. Stored securely in Cloudflare R2.
🧠
Memory Across Chats
remembers you · 50 chats · auto-extract
Memo AI learns about you — your name, your department, your manager, your writing preferences. Facts are extracted automatically after each conversation and injected into every future chat. 50 conversations saved per person. The same memory capability as ChatGPT and Claude — built in.
Infrastructure · multi-provider architecture

The engine room.

Seven AI infrastructure partners. Multiple API connections. If one provider is at capacity, the cascade switches to the next in under 50 milliseconds. Staff never see an error — just a different engine label.

🟠
Groq
GPT-OSS 120B · Llama 3.3 · Qwen 3
Primary chat provider. Multiple connections rotating. GPT-OSS 120B (45ms), GPT-OSS 20B (41ms), Llama 3.3 70B, Qwen 3 32B, Llama-4 Scout (vision). Custom LPU chips — fastest inference hardware commercially available.
🟣
SambaNova
DeepSeek V3.2 · V3.1 · Llama 4
Frontier reasoning. DeepSeek V3.2 (685B MoE) is the primary Smart, Reasoner and Coder engine — outscores GPT-4o on MATH-500 (90.2% vs 76.6%) and HumanEval (92.7% vs 90.2%). Free tier, generous daily limits.
🔵
Cerebras
Qwen 3 235B · Llama 8B
World's fastest AI chip (WSE-3 wafer-scale). 2,000 tokens/sec on Llama 3.1 8B. Qwen 3 235B for heavy reasoning. Millions of tokens per day capacity.
🔷
Google Gemini
2.5 Flash · 3 Flash · grounded search
Live mode backbone. Multiple connections across Google Cloud projects. Gemini 2.5 Flash + Flash Lite + Gemini 3 Flash Preview. Google Search grounding built in — every Live answer is backed by real-time web results with cited sources.
🟢
OpenRouter
17 models · deep cascade fallback
Last-resort cascade. 17 verified working :free models including GPT-OSS 120B, Nemotron 120B/30B/9B, GLM-4.5 Air, Gemma 4 31B/26B, Gemma 3 27B/12B, Arcee Trinity, Liquid LFM. If every other provider fails, OpenRouter catches it.
🟡
Cloudflare
4 accounts · FLUX.2 klein · R2 storage
Image generation (FLUX.2 klein 9B & 4B), image editing, and persistent file storage (R2). 4 Cloudflare accounts provide ~400 images/day capacity. Also hosts 30+ text models as additional cascade backup.
🔴
Tavily
Structured web search · research-grade results
Fallback web search when Gemini is at capacity. Returns structured search results with titles, snippets, and URLs. Used by Live mode when Gemini fails, and by Smart mode for search-intent queries.