speech endpoint that implements the following features based on the TTS model:Important Note: You must inform users that the audio they hear is generated by AI, not human voices.
| Format | Characteristics | Use Cases |
|---|---|---|
| MP3 | Default format | General use |
| Opus | Low latency | Web streaming and communication |
| AAC | Efficient compression | Mobile playback |
| FLAC | Lossless compression | Audio archiving |
| WAV | Uncompressed | Low-latency applications |
| PCM | Raw sampling | 24kHz, 16-bit signed |
Note: Current voices are primarily optimized for English.