Six specialised models · multi-provider cascade
Pick the brain for the job.
Memo AI routes your question to the right model automatically. Each mode is backed by a cascade of 7–14 AI engines from Groq, SambaNova, Cerebras, Google and OpenRouter — so there's always capacity. Your files stay attached across messages. Generated images are stored permanently. And Memo AI remembers you — your name, your role, your preferences — across every conversation.
✨
Smart
1,000/day · 14-step cascade · GPT-OSS 120B
The everyday powerhouse. Polished emails, summaries, deep analysis, brainstorming, translation. Primary engine is Cerebras GPT-OSS 120B — the fastest inference anywhere at ~2,000 tokens/sec, same quality as OpenAI's flagship. If it's busy, 13 more engines fire automatically.
Cascade (14 engines)Cerebras GPT-OSS 120B · SambaNova DeepSeek V3.2 (685B) · Cerebras GLM-4.7 (new) · SambaNova MiniMax M2.5 (new) · Groq GPT-OSS 120B · Llama 4 Maverick · DeepSeek V3.1 · Cerebras Qwen 3 235B · Llama 3.3 70B · plus 5 OpenRouter free models. Connections rotate automatically.
🧠
Reasoner
500/day · 12-step cascade · deep thinking
For problems that need real thought. Complex code, multi-step maths, legal reasoning, strategy. DeepSeek V3.2 leads, backed by the brand-new Cerebras GLM-4.7 and SambaNova MiniMax M2.5. Shows its thinking process in a collapsible panel so you can follow the logic.
Cascade (12 engines)SambaNova DeepSeek V3.2 · Cerebras GLM-4.7 · DeepSeek V3.1 · Cerebras GPT-OSS 120B · SambaNova MiniMax M2.5 · DeepSeek V3.1-cb · Qwen 3 235B · Llama 4 Maverick · Groq GPT-OSS 120B · Llama 3.3 70B · plus 2 OpenRouter :free. Reasoning traces visible on demand.
🔴
Live
1,000/day · Gemini engines · real-time Google Search
Actually knows what happened today. Current news, weather, stock prices, sports scores, company info. Every answer is grounded in live Google Search results with sources cited at the bottom. Multiple Gemini connections across 3 models ensure consistent availability.
Cascade (4 engines)Gemini 2.5 Flash + Google Search · Gemini 2.5 Flash Lite + Search · Gemini 3 Flash Preview + Search · Groq GPT-OSS 120B + Tavily web search. Gemini engines + Tavily search = ~15,000 searches/day total capacity.
⚡
Fast
5,000/day · 2,000 tok/sec · 9-step cascade
Instant answers. Quick lookups, one-liner rewrites, “what's the word for…”. Cerebras Llama 3.1 8B runs at 2,000 tokens per second on WSE-3 hardware — the fastest public inference anywhere. Practically unlimited.
Cascade (9 engines)Cerebras Llama 3.1 8B (2,000 tok/sec) · Groq GPT-OSS 20B (41ms) · Groq Llama 3.1 8B · OR Nemotron Nano 9B :free · Liquid LFM 2.5 Thinking :free · GPT-OSS 20B :free · Gemma 3 4B · Gemma 3n 4B · Gemma 3n 2B.
💻
Coder
800/day · 12-step cascade · DeepSeek V3.2
Built for programming. Generates clean TypeScript, Python, SQL, HTML/CSS. Explains patterns, spots bugs, writes tests, refactors legacy code. DeepSeek V3.2 leads, supported now by GLM-4.7 and MiniMax M2.5.
Cascade (12 engines)SambaNova DeepSeek V3.2 · Cerebras GPT-OSS 120B · Cerebras GLM-4.7 · Qwen 3 235B · DeepSeek V3.1 · MiniMax M2.5 · Groq GPT-OSS 120B · Qwen 3 32B · plus 4 OpenRouter :free (GLM-4.5 Air, GPT-OSS 120B, Qwen3 Coder, Arcee Trinity).
👁
Vision
500/day · auto-activates · image editing
Reads screenshots, photos, receipts, handwriting, diagrams, charts. Switches on automatically when you attach an image. Plus: paste any image and say “change to navy velvet” — FLUX.2 klein edits it with instruction-following AI, not random noise. Verified: colour changes actually work.
Cascade (6 engines)Groq Llama-4-Scout 17B (39ms) · Gemini 2.5 Flash · Google Gemma 4 31B :free · Gemma 4 26B :free · Nemotron Nano VL :free · Gemma 3 27B :free. Image editing: Cloudflare FLUX.2 klein 9B/4B across 4 free accounts.