Six specialised models · multi-provider cascade
Pick the brain for the job.
Memo AI routes your question to the right model automatically. Each mode is backed by a cascade of 7–14 AI engines from Groq, SambaNova, Cerebras, Google and OpenRouter — so there's always capacity. Your files stay attached across messages. Generated images are stored permanently. And Memo AI remembers you — your name, your role, your preferences — across every conversation.
✨
Smart
1,000/day · 11-step cascade · 685B MoE
The everyday powerhouse. Polished emails, summaries, deep analysis, brainstorming, translation. Primary engine is SambaNova DeepSeek V3.2 — a 685-billion-parameter mixture-of-experts model that rivals GPT-4 on reasoning benchmarks. If busy, 10 more engines fire automatically.
Cascade (11 engines)SambaNova DeepSeek V3.2 (685B) · Groq GPT-OSS 120B · SambaNova Llama 4 Maverick · SambaNova DeepSeek V3.1 · Cerebras Qwen 3 235B · Groq Llama 3.3 70B · plus 5 OpenRouter :free models. Connections rotate automatically.
🧠
Reasoner
500/day · 9-step cascade · deep thinking
For problems that need real thought. Complex code, multi-step maths, legal reasoning, strategy. DeepSeek V3.2 leads — the same model that tops SWE-bench reasoning benchmarks. Shows its thinking process in a collapsible panel so you can follow the logic.
Cascade (9 engines)SambaNova DeepSeek V3.2 · DeepSeek V3.1 · DeepSeek V3.1-cb · Groq GPT-OSS 120B · Cerebras Qwen 3 235B · Llama 4 Maverick · Groq Llama 3.3 70B · plus 2 OpenRouter :free. Reasoning traces visible on demand.
🔴
Live
1,000/day · Gemini engines · real-time Google Search
Actually knows what happened today. Current news, weather, stock prices, sports scores, company info. Every answer is grounded in live Google Search results with sources cited at the bottom. Multiple Gemini connections across 3 models ensure consistent availability.
Cascade (4 engines)Gemini 2.5 Flash + Google Search · Gemini 2.5 Flash Lite + Search · Gemini 3 Flash Preview + Search · Groq GPT-OSS 120B + Tavily web search. Gemini engines + Tavily search = ~15,000 searches/day total capacity.
⚡
Fast
5,000/day · 2,000 tok/sec · 9-step cascade
Instant answers. Quick lookups, one-liner rewrites, “what's the word for…”. Cerebras Llama 3.1 8B runs at 2,000 tokens per second on WSE-3 hardware — the fastest public inference anywhere. Practically unlimited.
Cascade (9 engines)Cerebras Llama 3.1 8B (2,000 tok/sec) · Groq GPT-OSS 20B (41ms) · Groq Llama 3.1 8B · OR Nemotron Nano 9B :free · Liquid LFM 2.5 Thinking :free · GPT-OSS 20B :free · Gemma 3 4B · Gemma 3n 4B · Gemma 3n 2B.
💻
Coder
800/day · 9-step cascade · DeepSeek V3.2
Built for programming. Generates clean TypeScript, Python, SQL, HTML/CSS. Explains patterns, spots bugs, writes tests, refactors legacy code. DeepSeek V3.2 leads — the same model that topped the SWE-bench coding leaderboard.
Cascade (9 engines)SambaNova DeepSeek V3.2 · Groq GPT-OSS 120B · Cerebras Qwen 3 235B · DeepSeek V3.1 · Groq Qwen 3 32B · plus 4 OpenRouter :free (GLM-4.5 Air, GPT-OSS 120B, Qwen3 Coder, Arcee Trinity).
👁
Vision
500/day · auto-activates · image editing
Reads screenshots, photos, receipts, handwriting, diagrams, charts. Switches on automatically when you attach an image. Plus: paste any image and say “change to navy velvet” — FLUX.2 klein edits it with instruction-following AI, not random noise. Verified: colour changes actually work.
Cascade (8 engines)SambaNova Llama 4 Maverick · Gemini 2.5 Flash · Gemini 2.5 Flash Lite · Groq Llama-4-Scout 17B · Gemma 4 31B :free · Gemma 4 26B :free · Nemotron Nano VL :free · Gemma 3 27B :free. Image editing: Cloudflare FLUX.2 klein 9B/4B across 4 free accounts.