Gemma 3 4B IT

Model Information

Display Name: Gemma 3 4B IT

API Model ID: google/gemma-3-4b-it

Category: Image To Text

Description: Gemma 3 4B is Google's lightweight multimodal model, perfect for fast inference and resource-constrained environments. **Key Features:** - 128K token context window - Multimodal: processes text and images (vision) - Function calling and structured outputs (JSON) - Support for 140+ languages - Optimized for speed and efficiency **Capabilities:** - Fast text generation and chat - Image understanding (OCR, analysis) - Code generation - Function/tool calling **Best For:** - Real-time applications - Mobile/edge deployment - High-throughput scenarios - Cost-sensitive workloads **Technical Specs:** - Parameters: 4 billion - Precision: BF16

Context Window: 128,000 tokens

Max Output: 8,192 tokens

How to Use This Model

To use Gemma 3 4B IT via the HInow.ai API, use the model ID: google/gemma-3-4b-it

API Request Example (Chat/Text)


POST https://api.hinow.ai/v1/chat/completions
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json

{
  "model": "google/gemma-3-4b-it",
  "messages": [
    {"role": "user", "content": "Your message here"}
  ]
}
              

API Request Example (Image Generation)


POST https://api.hinow.ai/v1/images/generations
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json

{
  "model": "google/gemma-3-4b-it",
  "prompt": "Your image description here"
}
              

Pricing

  • input: $0.03
  • output: $0.06
  • image: $0.01

Available Parameters

  • temperature: Controls randomness (0-2). Default: 0.7 (Options: 0, 0.3, 0.5, 0.7, 1.0, 1.5, 2.0)
  • top_p: Nucleus sampling (0-1). Default: 0.9 (Options: 0.1, 0.5, 0.7, 0.9, 0.95, 1.0)
  • max_tokens: Max tokens to generate (1-8192) (Options: 256, 512, 1024, 2048, 4096, 8192)
  • response_format: Output format (Options: text, json_object, json_schema)

Quick Reference

To use this model, set: "model": "google/gemma-3-4b-it"

Featured: Yes

Documentation: https://hinow.ai/models/google/gemma-3-4b-it

API Endpoint: https://api.hinow.ai/v1

Back to Models
Gemma 3 4B IT

Gemma 3 4B IT

Featured

google/gemma-3-4b-it

$0.030 / $0.060
per 1M tokens (in/out)

About

Gemma 3 4B is Google's lightweight multimodal model, perfect for fast inference and resource-constrained environments.

Key Features:

  • 128K token context window
  • Multimodal: processes text and images (vision)
  • Function calling and structured outputs (JSON)
  • Support for 140+ languages
  • Optimized for speed and efficiency

Capabilities:

  • Fast text generation and chat
  • Image understanding (OCR, analysis)
  • Code generation
  • Function/tool calling

Best For:

  • Real-time applications
  • Mobile/edge deployment
  • High-throughput scenarios
  • Cost-sensitive workloads

Technical Specs:

  • Parameters: 4 billion
  • Precision: BF16

Capabilities

Image To TextText To Text
Context128K tokens
Max Output8K tokens

Parameters

temperature

Controls randomness (0-2). Default: 0.7

00.30.50.71.01.52.0
top_p

Nucleus sampling (0-1). Default: 0.9

0.10.50.70.90.951.0
max_tokens

Max tokens to generate (1-8192)

2565121024204840968192
response_format

Output format

textjson_objectjson_schema

Code Examples

curl -X POST https://api.hinow.ai/v1/responses \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $HINOW_API_KEY" \
  -d '{
    "model": "google/gemma-3-4b-it",
    "image_url": "https://example.com/image.jpg",
    "prompt": "Describe this image",
    "parameters": {
      "temperature": "0",
      "top_p": "0.1",
      "max_tokens": "256",
      "response_format": "text"
    }
  }'