Gemini 3.5 Flash

Model Information

Display Name: Gemini 3.5 Flash

API Model ID: google/gemini-3.5-flash

Category: Image To Text

Description: Gemini 3.5 Flash is Google's latest and most capable Flash-tier model, released at Google I/O 2026. It delivers near-Pro-level intelligence at Flash-tier cost with 4x faster output than competing frontier models. **Key Features:** - 1M token context window (1,048,576 tokens) - Up to 65K output tokens - Vision: text, image, audio, video, PDF input - Native function calling and structured outputs - Thinking/reasoning mode with configurable levels - Code execution and grounding with Google Search - Agentic workflows and MCP tool use **Capabilities:** - Complex coding and software engineering - Advanced reasoning and analysis - Multimodal understanding (text, images, video, audio) - Long document processing (1M context) - Function/tool calling and orchestration - Structured data extraction - Real-time search grounding - Autonomous agent execution **Best For:** - Complex coding and agentic tasks - Multimodal applications at scale - High-throughput production workloads - Long-context document analysis - Tasks requiring reasoning at lower cost **Technical Specs:** - Model ID: gemini-3.5-flash - Context Window: 1,048,576 tokens (1M) - Max Output: 65,536 tokens - Modalities: Text, image, audio, video, PDF input - API: Google AI (generativelanguage.googleapis.com) - Thinking Mode: Supported (configurable levels) - Tool Use: Native function calling - Knowledge Cutoff: January 2026

Context Window: 1,048,576 tokens

Max Output: 65,536 tokens

How to Use This Model

To use Gemini 3.5 Flash via the HInow.ai API, use the model ID: google/gemini-3.5-flash

API Request Example (Chat/Text)


POST https://api.hinow.ai/v1/chat/completions
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json

{
  "model": "google/gemini-3.5-flash",
  "messages": [
    {"role": "user", "content": "Your message here"}
  ]
}
              

API Request Example (Image Generation)


POST https://api.hinow.ai/v1/images
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json

{
  "model": "google/gemini-3.5-flash",
  "prompt": "Your image description here"
}
              

Pricing

  • input: $1.65
  • output: $9.90

Available Parameters

  • temperature: Controls randomness (0-2). Default: 1 (Options: 0, 0.3, 0.5, 0.7, 1.0, 1.5, 2.0)
  • top_p: Nucleus sampling (0-1). Default: 0.95 (Options: 0.1, 0.5, 0.7, 0.9, 0.95, 1.0)
  • max_tokens: Max tokens to generate (1-65536) (Options: 256, 512, 1024, 2048, 4096, 8192, 16384, 32768, 65536)
  • response_format: Output format (Options: text, json_object)

Quick Reference

To use this model, set: "model": "google/gemini-3.5-flash"

Featured: Yes

Documentation: https://hinow.ai/models/google/gemini-3.5-flash

API Endpoint: https://api.hinow.ai/v1

Back to Models

Gemini 3.5 Flash

Featured

google/gemini-3.5-flash

$1.65 / $9.90
per 1M tokens (in/out)

About

Gemini 3.5 Flash is Google's latest and most capable Flash-tier model, released at Google I/O 2026. It delivers near-Pro-level intelligence at Flash-tier cost with 4x faster output than competing frontier models.

Key Features:

  • 1M token context window (1,048,576 tokens)
  • Up to 65K output tokens
  • Vision: text, image, audio, video, PDF input
  • Native function calling and structured outputs
  • Thinking/reasoning mode with configurable levels
  • Code execution and grounding with Google Search
  • Agentic workflows and MCP tool use

Capabilities:

  • Complex coding and software engineering
  • Advanced reasoning and analysis
  • Multimodal understanding (text, images, video, audio)
  • Long document processing (1M context)
  • Function/tool calling and orchestration
  • Structured data extraction
  • Real-time search grounding
  • Autonomous agent execution

Best For:

  • Complex coding and agentic tasks
  • Multimodal applications at scale
  • High-throughput production workloads
  • Long-context document analysis
  • Tasks requiring reasoning at lower cost

Technical Specs:

  • Model ID: gemini-3.5-flash
  • Context Window: 1,048,576 tokens (1M)
  • Max Output: 65,536 tokens
  • Modalities: Text, image, audio, video, PDF input
  • API: Google AI (generativelanguage.googleapis.com)
  • Thinking Mode: Supported (configurable levels)
  • Tool Use: Native function calling
  • Knowledge Cutoff: January 2026

Capabilities

Image To TextText To Text
Context1049K tokens
Max Output66K tokens

Parameters

temperature

Controls randomness (0-2). Default: 1

00.30.50.71.01.52.0
top_p

Nucleus sampling (0-1). Default: 0.95

0.10.50.70.90.951.0
max_tokens

Max tokens to generate (1-65536)

2565121024204840968192163843276865536
response_format

Output format

textjson_object

Code Examples

curl -X POST https://api.hinow.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $HINOW_API_KEY" \
  -d '{
    "model": "google/gemini-3.5-flash",
    "messages": [
      {
        "role": "user",
        "content": [
          {"type": "text", "text": "Describe this image"},
          {"type": "image_url", "image_url": {"url": "https://example.com/image.jpg"}}
        ]
      }
    ],
    "parameters": {
      "temperature": "0",
      "top_p": "0.1",
      "max_tokens": "256",
      "response_format": "text"
    }
  }'