Gemma 3 4B IT
Model Information
Display Name: Gemma 3 4B IT
API Model ID: google/gemma-3-4b-it
Category: Image To Text
Description: Gemma 3 4B is Google's lightweight multimodal model, perfect for fast inference and resource-constrained environments. **Key Features:** - 128K token context window - Multimodal: processes text and images (vision) - Function calling and structured outputs (JSON) - Support for 140+ languages - Optimized for speed and efficiency **Capabilities:** - Fast text generation and chat - Image understanding (OCR, analysis) - Code generation - Function/tool calling **Best For:** - Real-time applications - Mobile/edge deployment - High-throughput scenarios - Cost-sensitive workloads **Technical Specs:** - Parameters: 4 billion - Precision: BF16
Context Window: 128,000 tokens
Max Output: 8,192 tokens
How to Use This Model
To use Gemma 3 4B IT via the HInow.ai API, use the model ID: google/gemma-3-4b-it
API Request Example (Chat/Text)
POST https://api.hinow.ai/v1/chat/completions
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json
{
"model": "google/gemma-3-4b-it",
"messages": [
{"role": "user", "content": "Your message here"}
]
}
API Request Example (Image Generation)
POST https://api.hinow.ai/v1/images/generations
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json
{
"model": "google/gemma-3-4b-it",
"prompt": "Your image description here"
}
Pricing
- input: $0.03
- output: $0.06
- image: $0.01
Available Parameters
- temperature: Controls randomness (0-2). Default: 0.7 (Options: 0, 0.3, 0.5, 0.7, 1.0, 1.5, 2.0)
- top_p: Nucleus sampling (0-1). Default: 0.9 (Options: 0.1, 0.5, 0.7, 0.9, 0.95, 1.0)
- max_tokens: Max tokens to generate (1-8192) (Options: 256, 512, 1024, 2048, 4096, 8192)
- response_format: Output format (Options: text, json_object, json_schema)
Quick Reference
To use this model, set: "model": "google/gemma-3-4b-it"
Featured: Yes
Documentation: https://hinow.ai/models/google/gemma-3-4b-it
API Endpoint: https://api.hinow.ai/v1


