Streaming Transcription
Convert audio to text in real-time using WebSocket connections. Perfect for voice agents and live applications.Quick Start
Use our optimized streaming modelfireworks-asr-large
for the best real-time performance.
For a working example of streaming transcription see the following resources:
For more detailed information, see the full streaming API documentation and the source code
Pre-recorded Transcription
Convert audio files to text. Supports files up to 1GB in formats like MP3, FLAC, and WAV. Transcribe multiple hours of audio in minutes.Quick Start
For a working example of pre-recorded transcription see the Python notebook Available Models:whisper-v3
: Highest accuracy- model=
whisper-v3
- base_url=
https://audio-prod.us-virginia-1.direct.fireworks.ai
- model=
whisper-v3-turbo
: Faster processing- model=
whisper-v3-turbo
- base_url=
https://audio-turbo.us-virginia-1.direct.fireworks.ai
- model=
Pre-recorded Translation
Translate audio from any of our supported languages to English. Supports files up to 1GB in formats like MP3, FLAC, and WAV.Quick Start
Supported Languages
We support 95+ languages including English, Spanish, French, German, Chinese, Japanese, Russian, Portuguese, and many more. See the complete language list.Common Use Cases
- Call Center / Customer Service: Transcribe or translate customer calls
- Note Taking: Transcribe audio for automated note taking
- Voice Agents: Use streaming transcription to create voice assistants, for a seamless voice-agent experience checkout our Voice Agent Platform
Next Steps
- Explore advanced features like speaker diarization and custom prompts
- Checkout our voice agent platform Voice Agent Platform
- Contact us at inquiries@fireworks.ai for dedicated endpoints and enterprise features