Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.gravitex.ai/llms.txt

Use this file to discover all available pages before exploring further.

Introduction

GravitexAI audio APIs fall into two groups:
  • OpenAI format: /v1/audio/speech (TTS), /v1/audio/transcriptions (STT), /v1/audio/translations — compatible with the OpenAI Audio API.
  • Gemini native format: POST /v1beta/models/{model}:generateContent with responseModalities: ["AUDIO"] and speechConfig (e.g. gemini-2.5-flash-preview-tts).
Base URL: https://api.gravitex.ai. For Gemini auth, see Gemini native format.

Authentication

Authorization
string
required
Bearer Token, e.g. Bearer sk-xxxxxxxxxx (OpenAI and Gemini)
x-goog-api-key
string
Optional for Gemini: x-goog-api-key: sk-xxxxxxxxxx

Request examples

POST /v1/audio/speech
curl -X POST "https://api.gravitex.ai/v1/audio/speech" \
  -H "Authorization: Bearer sk-xxxxxxxxxx" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tts-1",
    "input": "Hello from GravitexAI.",
    "voice": "alloy",
    "response_format": "mp3"
  }' \
  --output speech.mp3

Common parameters

OpenAI format

Speech
  • model: e.g. tts-1, tts-1-hd
  • input: Text to speak (max 4096 chars)
  • voice: alloy, echo, fable, onyx, nova, shimmer
Transcriptions / translations
  • file: Audio file (multipart)
  • model: e.g. whisper-1

Gemini format (TTS)

  • model (path): e.g. gemini-2.5-flash-preview-tts, gemini-2.5-pro-preview-tts
  • contents[].parts[].text: Text or style instructions
  • generationConfig.responseModalities: must include "AUDIO"
  • generationConfig.speechConfig.voiceConfig.prebuiltVoiceConfig.voiceName: e.g. Kore, Puck, Charon
Gemini audio is produced only via generateContent, not /v1/audio/*. See Gemini native format for full parameters.