Skip to main content

Model selection

Filo Agent lets you switch between different models within the same workflow: use faster, more efficient models for lightweight tasks, and switch to stronger reasoning, longer context, or multimodal models for more complex work.

FiloMail model selection overview

Quick selection

Start with the kind of work you are doing, then pick the model that matches the need.

Task typeModelBest for
Everyday email and Q&AClaude Haiku 4.5Quick summaries, everyday drafts, simple questions
General complex workClaude Sonnet 4.6Complex emails, long documents, cross-tool analysis
Advanced reasoningGPT-5.5 / GPT-5.4Planning, critical review, professional writing, coding help
Long documents, multimodal work, and large projectsGemini / DeepSeek / GLMLong context, images, videos, screenshots, large codebases

Model comparison

Token usage is a relative level. Final usage can vary based on context length, cache hits, and output length.
ModelProviderToken usageBest for
Claude Haiku 4.5
AnthropicLowerFast everyday summaries, short reply drafts, lightweight Q&A, and low-stakes inbox cleanup.
Claude Sonnet 4.6
AnthropicMediumComplex email workflows, document analysis, cross-tool research, and dependable default agent work.
GPT-5.4
OpenAIMediumProfessional writing, structured reasoning, code help, and tasks that need careful judgment.
GPT-5.5
OpenAIHighestHigh-stakes planning, deep code work, product review, and complex multi-step decisions.
DeepSeek V4 Pro
DeepSeekLowestLarge codebases, automation-heavy tasks, technical synthesis, and cost-sensitive long runs.
GLM 5.2
Z.aiLowestEngineering workflows, tool-heavy execution, long-running agent tasks, and structured operations.
Gemini 3.1 Pro Preview
GoogleMediumPDF understanding, image or screenshot analysis, research workflows, and multimodal review.
Gemini 3.5 Flash
GoogleLowestFast long-context reading, media-heavy inputs, broad exploration, and parallel agent work.

How to choose

Use the fastest capable model for routine work, then move up when the task needs deeper reasoning, more context, or multimodal understanding.