Claude Haiku 4.5

Model Information

Display Name: Claude Haiku 4.5

API Model ID: anthropic/claude-haiku-4.5

Category: Text To Text

Description: Claude Haiku 4.5 is Anthropic's fastest and most affordable model, optimized for speed and efficiency. Ideal for high-throughput tasks, real-time applications, and cost-sensitive workloads requiring quick responses. **Key Features:** - 200K token context window - Up to 8K output tokens - Fastest response times in the Claude family - Native function/tool calling - JSON mode for structured outputs - Prompt caching for efficiency **Capabilities:** - Fast text generation and chat - Code completion and quick fixes - Data extraction and classification - Document summarization - Simple reasoning tasks - Structured data generation - High-volume batch processing **Best For:** - High-volume, low-latency applications - Chat and conversational AI - Quick summarization and classification - Cost-effective inference at scale - Real-time user-facing features **Technical Specs:** - Model ID: claude-haiku-4-5-20251001 - Context Window: 200,000 tokens - Max Output: 8,192 tokens - Modalities: Text input/output - API: Anthropic Messages API - Extended Thinking: Not supported - Tool Use: Native function calling

Context Window: 200,000 tokens

Max Output: 8,192 tokens

How to Use This Model

To use Claude Haiku 4.5 via the HInow.ai API, use the model ID: anthropic/claude-haiku-4.5

API Request Example (Chat/Text)


POST https://api.hinow.ai/v1/chat/completions
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json

{
  "model": "anthropic/claude-haiku-4.5",
  "messages": [
    {"role": "user", "content": "Your message here"}
  ]
}
              

Pricing

  • input: $1.10
  • output: $5.50

Available Parameters

  • temperature: Controls randomness (0-1). Default: 1 (Options: 0, 0.3, 0.5, 0.7, 1.0)
  • top_p: Nucleus sampling (0-1). Default: 0.9 (Options: 0.1, 0.5, 0.7, 0.9, 0.95, 1.0)
  • max_tokens: Max tokens to generate (1-8192) (Options: 256, 512, 1024, 2048, 4096, 8192)

Quick Reference

To use this model, set: "model": "anthropic/claude-haiku-4.5"

Featured: No

Documentation: https://hinow.ai/models/anthropic/claude-haiku-4.5

API Endpoint: https://api.hinow.ai/v1

Back to Models

Claude Haiku 4.5

anthropic/claude-haiku-4.5

$1.10 / $5.50
per 1M tokens (in/out)

About

Claude Haiku 4.5 is Anthropic's fastest and most affordable model, optimized for speed and efficiency. Ideal for high-throughput tasks, real-time applications, and cost-sensitive workloads requiring quick responses.

Key Features:

  • 200K token context window
  • Up to 8K output tokens
  • Fastest response times in the Claude family
  • Native function/tool calling
  • JSON mode for structured outputs
  • Prompt caching for efficiency

Capabilities:

  • Fast text generation and chat
  • Code completion and quick fixes
  • Data extraction and classification
  • Document summarization
  • Simple reasoning tasks
  • Structured data generation
  • High-volume batch processing

Best For:

  • High-volume, low-latency applications
  • Chat and conversational AI
  • Quick summarization and classification
  • Cost-effective inference at scale
  • Real-time user-facing features

Technical Specs:

  • Model ID: claude-haiku-4-5-20251001
  • Context Window: 200,000 tokens
  • Max Output: 8,192 tokens
  • Modalities: Text input/output
  • API: Anthropic Messages API
  • Extended Thinking: Not supported
  • Tool Use: Native function calling

Capabilities

Text To Text
Context200K tokens
Max Output8K tokens

Parameters

temperature

Controls randomness (0-1). Default: 1

00.30.50.71.0
top_p

Nucleus sampling (0-1). Default: 0.9

0.10.50.70.90.951.0
max_tokens

Max tokens to generate (1-8192)

2565121024204840968192

Code Examples

curl -X POST https://api.hinow.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $HINOW_API_KEY" \
  -d '{
    "model": "anthropic/claude-haiku-4.5",
    "messages": [
      {"role": "user", "content": "Hello! How are you?"}
    ],
    "parameters": {
      "temperature": "0",
      "top_p": "0.1",
      "max_tokens": "256"
    }
  }'