Documentation Index Fetch the complete documentation index at: https://docs.gravitex.ai/llms.txt
Use this file to discover all available pages before exploring further.
One API. 100+ Models.
GravitexAI is the unified AI model platform built for agents. A single OpenAI SDK–compatible interface routes you to 100+ frontier models — no per-provider plumbing, no subscriptions, pay-as-you-go.
Better prices, higher uptime, no subscriptions. Distributed infrastructure with automatic failover routes around outages, while organization-level data policies give you fine-grained control over which providers can see your data.
⭐ Featured Models — This Week
Model Provider Tokens this week Best for Claude Opus 4.7 ⭐Anthropic 638.4B Production code, complex agents, long-horizon tasks Gemini 3.5 Flash Google 494.0B High-throughput chat, low-latency tool calls GPT-5.5 OpenAI 141.4B Reasoning, writing, all-round coding
For the complete catalog and live pricing, visit the GravitexAI model catalog .
Trusted by the World’s Leading Providers
GravitexAI integrates with the top model providers — all behind the same OpenAI-compatible API:
Anthropic · OpenAI · Google · Qwen · BytePlus (Doubao) · DeepSeek · GLM · MiniMax · MoonShot (Kimi)
Model Catalog
🎭 Anthropic Claude
Model Model ID Context Highlights Recommended For Claude Opus 4.7 ⭐claude-opus-4-7 1M Anthropic’s latest flagship with state-of-the-art coding & agent ability Complex reasoning, production code, agents Claude Opus 4.6 claude-opus-4-6 1M Anthropic’s flagship intelligence model; sets the bar for production code and complex agents Conversation, reasoning, coding, writing Claude Sonnet 4.6 claude-sonnet-4-6 1M Latest Sonnet mainline with strong all-round capabilities Conversation, thinking, writing, coding Claude Sonnet 4.5 claude-sonnet-4-5-20250929 200K Improved tool use, memory and context handling Conversation, thinking, writing, coding Claude Haiku 4.5 claude-haiku-4-5-20251001 200K Anthropic’s most efficient model — near-frontier performance, fast and cheap High-throughput chat, light tasks Claude Opus 4.5 claude-opus-4-5-20251101 200K Previous flagship, stable and reliable Conversation, thinking, writing, coding
🤖 OpenAI
GPT Series
Model Model ID Context Highlights Recommended For GPT-5.5 ⭐gpt-5.5 1M OpenAI’s latest mainline with stronger reasoning and overall capability Conversation, thinking, coding, writing GPT 5.4 gpt-5.4 1M New-generation mainline for complex professional tasks Conversation, thinking, coding, writing GPT 5.4 Pro gpt-5.4-pro 1M High-performance variant for difficult reasoning and engineering Conversation, thinking, coding GPT 5.4 Mini gpt-5.4-mini 1M Balanced performance and cost Conversation, writing, coding GPT 5.4 Nano gpt-5.4-nano 1M Lightweight low-cost variant for high-throughput Conversation, writing GPT 5.3 Chat gpt-5.3-chat 400K Stable general chat model Conversation, writing
Image Generation
Model Model ID Highlights GPT-Image-2 ⭐gpt-image-2 OpenAI’s next-generation image model
🌟 Google Gemini
Model Model ID Context Highlights Recommended For Gemini 3.5 Flash ⭐gemini-3.5-flash 1M Latest Gemini Flash — high throughput, low latency Tool calls, high-concurrency chat Gemini 3.1 Pro Preview gemini-3.1-pro-preview 1M Google’s upgraded core intelligence model for complex tasks Conversation, reasoning, writing, coding Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview 1M High-value fast model for high-throughput workloads Conversation, writing Gemini 3.1 Flash Image Preview gemini-3.1-flash-image-preview 1M Multimodal variant focused on image understanding & generation Vision, text-to-image, multimodal Gemini 3 Pro Image Preview gemini-3-pro-image-preview 1M Gemini image-focused variant Text-to-image, image generation Gemini 3 Flash Preview gemini-3-flash-preview 1M Fast response model for interactive and tool workflows Conversation, writing, coding
🐉 Alibaba Qwen
Model Model ID Context Highlights Recommended For Qwen3.5 Plus ⭐qwen3.5-plus See console Latest Qwen general model with strong all-round capabilities Conversation, reasoning, writing, coding Qwen3.5 Flash qwen3.5-flash See console Fast and low-cost variant for high-throughput Conversation, writing Qwen3 Max qwen3-max See console High-performance variant for complex generation and reasoning Conversation, reasoning, coding Qwen3 Coder Plus qwen3-coder-plus 1M Qwen3-based coding model with strong agent and tool capabilities Coding, tool use Qwen Image Plus qwen-image-plus 32K Qwen image model with strong text rendering Text-to-image, image generation
🌋 BytePlus Doubao Seed
Model Model ID Context Highlights Recommended For Doubao Seed 2.0 Pro ⭐doubao-seed-2-0-pro-260215 See console Latest mainline Doubao with stronger overall capability Conversation, reasoning, coding Doubao Seed 2.0 Code Preview doubao-seed-2-0-code-preview-260215 See console Preview model optimized for coding tasks Coding, tool use Doubao Seed 2.0 Lite doubao-seed-2-0-lite-260215 See console Lightweight model for cost-sensitive usage Conversation, writing Doubao Seed 2.0 Mini doubao-seed-2-0-mini-260215 See console Smaller fast model for low-latency scenarios Conversation, fast response Doubao Seed 1.8 doubao-seed-1-8-251228 See console Stable general model Conversation, writing
🔍 DeepSeek
Model Model ID Context Highlights Recommended For DeepSeek V3.2 251201 ⭐deepseek-v3-2-251201 128K Latest DeepSeek hybrid-reasoning model with stronger overall performance Conversation, thinking, writing, coding DeepSeek V3 250324 deepseek-v3-250324 128K Stable and cost-effective general model General use DeepSeek R1 250528 deepseek-r1-250528 64K Reasoning-focused model for logic tasks Math, reasoning
💎 Zhipu GLM
Model Model ID Context Highlights Recommended For glm-5 glm-5 See console Latest Zhipu chat model Conversation, writing, coding glm-4.7 glm-4.7 200K Latest flagship — stronger coding and multi-step reasoning, 355B params Conversation, long-horizon planning, coding, tool use
✨ MiniMax
Model Model ID Context Highlights Recommended For MiniMax M2 ⭐minimax-m2 See console MiniMax’s latest mainline model with strong all-round performance Conversation, thinking, writing, coding MiniMax Text-01 minimax-text-01 See console Long-context MoE model for complex workloads Long text, complex reasoning abab6.5s abab6.5s See console Cost-effective chat model High-throughput chat, writing
🌙 Moonshot Kimi
Model Model ID Context Highlights Recommended For kimi-k2.5 ⭐kimi-k2.5 256K Moonshot’s most capable model — native multimodal (vision + text), thinking and non-thinking modes Conversation, thinking Kimi K2 250905 kimi-k2-250905 256K MoE architecture, 1T params (32B active), strong coding & agent ability Conversation, thinking, writing, coding
One API for any model Fully OpenAI SDK–compatible. Swap providers without changing a single line of code.
Higher availability Distributed infrastructure with automatic failover routes around outages.
Price & performance Edge infrastructure for minimal latency. Transparent token-level pricing, pay only for what you use.
Custom data policies Fine-grained privacy controls — pick which providers may access your data.
💰 Pricing
Billing
Pay-as-you-go : Billed by actual Token usage
No minimum : Use what you pay for; balance never expires
Real-time billing : Deducted immediately after each call
Price Advantages
Direct upstream routing with competitive pricing vs. provider-direct
Contact us for bulk pricing
🛠️ Usage Guide
Model Selection
Coding
Writing
Fast Response
Image Generation
Long Text
Top performance : Claude Opus 4.7, Claude Opus 4.6, GPT-5.5, Claude Sonnet 4.6
Cost-effective : Claude Haiku 4.5, GPT 5.4 Mini, DeepSeek V3.2, glm-5
Alternative : Gemini 3.5 Flash, Qwen3 Coder Plus, kimi-k2.5, doubao-seed-2-0-code-preview-260215
Primary : GPT-5.5, Claude Opus 4.7, Claude Sonnet 4.6, glm-5
Alternative : Gemini 3.5 Flash, kimi-k2.5, MiniMax M2, doubao-seed-2-0-pro-260215
Primary : Claude Haiku 4.5, Gemini 3.5 Flash, Gemini 3.1 Flash Lite Preview
Alternative : glm-5, GPT 5.4 Nano, doubao-seed-2-0-lite-260215, abab6.5s
Recommended : GPT-Image-2, Qwen Image Plus, Gemini 3.1 Flash Image Preview
Ultra-long context : Gemini 3.5 Flash (1M), Claude Opus 4.7 (1M), GPT-5.5 (1M)
Coding : Claude 4.x series, kimi-k2.5, Qwen3 Coder Plus
Cost Optimization
Tiered usage : Use cheaper models for simple tasks, flagships for complex tasks
Test first : Iterate with smaller models before scaling up
Batch processing : Use Mini / Lite variants for bulk tasks
Cache reuse : Cache repeated query results
API Docs OpenAI SDK usage and detailed API spec
Quick Start Get your API key and ship your first request