Models
All Zen models -- capabilities, pricing, and recommended use cases
Available Models
List Models
GET https://api.hanzo.ai/v1/modelsReturns all available Zen models.
Zen4 Generation (9 models)
The latest generation. Flagship, reasoning, and code models.
Zen3 Generation (5 models)
Multimodal, vision, safety, and embedding models.
Model Selection Guide
By Task
| Task | Recommended Model |
|---|---|
| General chat | zen4 |
| Maximum reasoning | zen4-ultra |
| Deep reasoning (CoT) | zen4-thinking |
| Code generation | zen4-coder |
| Fast code iteration | zen4-coder-flash |
| Premium code accuracy | zen4-coder-pro |
| Image understanding | zen3-vl |
| Multimodal (text+vision+audio) | zen3-omni |
| Content moderation | zen3-guard |
| Text embeddings | zen3-embedding |
| Edge / mobile | zen3-nano |
| Budget-friendly | zen4-mini |
| Extended context docs | zen4-max |
| High capability | zen4-pro |
By Budget
| Budget | Recommended |
|---|---|
| Free tier ($5) | zen4-mini, zen3-nano |
| Low cost | zen4-mini, zen3-vl, zen4-coder-flash |
| Standard | zen4-pro, zen4-coder, zen4-thinking |
| Premium | zen4, zen4-ultra, zen4-coder-pro |
Open Weights
All Zen models are also available as open weights for self-hosting:
- HuggingFace: huggingface.co/zenlm
- Ollama:
ollama run zen4 - Formats: SafeTensors, GGUF, MLX
Cloud API via Hanzo gives you managed infrastructure, usage tracking, and pay-per-token billing without running your own GPUs.