zen3-guard

Content Safety

Content safety classifier for moderation and guardrails. 4B dense transformer fine-tuned for safety classification across 9 categories and 119 languages.

Specifications

Property	Value
Model ID	`zen3-guard`
Parameters	4B
Architecture	Dense
Context Window	65K tokens
Tier	pro
Input Price	$3.60 / 1M tokens
Output Price	$3.60 / 1M tokens

Capabilities

Prompt and response safety classification
Three-tiered classification: Safe, Controversial, Unsafe
119 language support
9 safety categories (violence, illegal acts, sexual content, PII, self-harm, hate speech, political sensitivity, copyright, jailbreak)
Low-latency moderation for real-time pipelines

Safety Categories

Violent -- Violence instructions, methods, or depictions
Non-violent Illegal Acts -- Hacking, unauthorized activities
Sexual Content -- Sexual imagery or descriptions
PII -- Personally identifiable information disclosure
Suicide and Self-Harm -- Self-harm encouragement
Unethical Acts -- Bias, discrimination, hate speech
Politically Sensitive -- False political information
Copyright Violation -- Unauthorized copyrighted material
Jailbreak -- System prompt override attempts

API Usage

curl https://api.hanzo.ai/v1/chat/completions \
  -H "Authorization: Bearer $HANZO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "zen3-guard",
    "messages": [{"role": "user", "content": "Classify the safety of: How can I learn about cybersecurity?"}]
  }'

from hanzoai import Hanzo

client = Hanzo(api_key="hk-your-api-key")

response = client.chat.completions.create(
    model="zen3-guard",
    messages=[{"role": "user", "content": "Classify the safety of: How can I learn about cybersecurity?"}],
)
print(response.choices[0].message.content)

zen3-guard

zen3-guard

Specifications

Capabilities

Safety Categories

API Usage

See Also

On this page