Speech-to-text API providing accurate audio transcription with speaker diarization and language detection. Best for workflows that convert spoken audio to text — meeting transcription, voice command processing, and audio content indexing. Unlike ElevenLabs (text-to-speech), Soniox handles the inverse: converting speech audio into structured text. 6 endpoints available through Lava’s AI Gateway. See the Soniox API docs for full documentation.Documentation Index
Fetch the complete documentation index at: https://lava.so/docs/llms.txt
Use this file to discover all available pages before exploring further.
Supports both managed (Lava’s API keys) and unmanaged (bring your own credentials) mode.
Endpoints
Create a transcription job
POSThttps://api.soniox.com/v1/transcriptions — 3.50 output / 1M tokens
- SDK
- cURL
Get transcript text
GEThttps://api.soniox.com/v1/transcriptions/{id}/transcript — Free
Get transcription status and results
GEThttps://api.soniox.com/v1/transcriptions/{id} — 3.50 output / 1M tokens
Upload an audio file
POSThttps://api.soniox.com/v1/files — Free
- SDK
- cURL
Get file details
GEThttps://api.soniox.com/v1/files/{id} — Free
List available speech models
GEThttps://api.soniox.com/v1/models — Free
- SDK
- cURL
Next Steps
All Providers
Browse all supported AI providers
Forward Proxy
Learn how to construct proxy URLs and authenticate requests