Speech-to-text, text-to-speech, and audio understanding models.
Models can support multiple endpoints. We group them by model and list every supported API route.