⚡ Zen LM

Models

All 14 Zen models -- capabilities, pricing, and recommended use cases

Available Models

List Models

GET https://api.hanzo.ai/v1/models

Returns all available Zen models.

Zen4 Generation (9 models)

The latest generation. Flagship, reasoning, and code models.

ModelContextArchitectureInput $/1MOutput $/1M
zen4202K~400B Dense$3.00$9.60
zen4-ultra202K~400B Dense + CoT$3.00$9.60
zen4-pro131K80B (3B active) MoE$2.70$2.70
zen4-max256K1.04T (32B active) MoE$3.60$3.60
zen4-mini40K8B Dense$0.60$0.60
zen4-thinking131K80B (3B active) MoE + CoT$2.70$2.70
zen4-coder262K480B (35B active) MoE$3.60$3.60
zen4-coder-pro262K480B Dense BF16$4.50$4.50
zen4-coder-flash262KDense$1.50$1.50

Zen3 Generation (5 models)

Multimodal, vision, safety, and embedding models.

ModelContextArchitectureInput $/1MOutput $/1M
zen3-omni202K~200B Dense Multimodal$1.80$6.60
zen3-vl131K30B (3B active) MoE VL$0.45$1.80
zen3-nano40K4B Dense$0.30$0.30
zen3-guard40K4B Dense$0.30$0.30
zen3-embedding8KEmbedding (3072 dim)$0.39--

Model Selection Guide

By Task

TaskRecommended Model
General chatzen4
Maximum reasoningzen4-ultra
Deep reasoning (CoT)zen4-thinking
Code generationzen4-coder
Fast code iterationzen4-coder-flash
Premium code accuracyzen4-coder-pro
Image understandingzen3-vl
Multimodal (text+vision+audio)zen3-omni
Content moderationzen3-guard
Text embeddingszen3-embedding
Edge / mobilezen3-nano
Budget-friendlyzen4-mini
Extended context docszen4-max
High capabilityzen4-pro

By Budget

BudgetRecommended
Free tier ($5)zen4-mini, zen3-nano
Low costzen4-mini, zen3-vl, zen4-coder-flash
Standardzen4-pro, zen4-coder, zen4-thinking
Premiumzen4, zen4-ultra, zen4-coder-pro

Open Weights

All Zen models are also available as open weights for self-hosting:

Cloud API via Hanzo gives you managed infrastructure, usage tracking, and pay-per-token billing without running your own GPUs.

On this page