Documentation Index
Fetch the complete documentation index at: https://docs.gravitex.ai/llms.txt
Use this file to discover all available pages before exploring further.
Introduction
GravitexAI audio APIs fall into two groups:- OpenAI format:
/v1/audio/speech(TTS),/v1/audio/transcriptions(STT),/v1/audio/translationsâ compatible with the OpenAI Audio API. - Gemini native format:
POST /v1beta/models/{model}:generateContentwithresponseModalities: ["AUDIO"]andspeechConfig(e.g.gemini-2.5-flash-preview-tts).
https://api.gravitex.ai. For Gemini auth, see Gemini native format.
Authentication
Bearer Token, e.g.
Bearer sk-xxxxxxxxxx (OpenAI and Gemini)Optional for Gemini:
x-goog-api-key: sk-xxxxxxxxxxRequest examples
- OpenAI format
- Gemini format
- Text-to-speech
- Transcription
- Translation
POST
/v1/audio/speechCommon parameters
OpenAI format
Speech- model: e.g.
tts-1,tts-1-hd - input: Text to speak (max 4096 chars)
- voice:
alloy,echo,fable,onyx,nova,shimmer
- file: Audio file (multipart)
- model: e.g.
whisper-1
Gemini format (TTS)
- model (path): e.g.
gemini-2.5-flash-preview-tts,gemini-2.5-pro-preview-tts - contents[].parts[].text: Text or style instructions
- generationConfig.responseModalities: must include
"AUDIO" - generationConfig.speechConfig.voiceConfig.prebuiltVoiceConfig.voiceName: e.g.
Kore,Puck,Charon
Gemini audio is produced only via
generateContent, not /v1/audio/*. See Gemini native format for full parameters.