OpenAI Unveils Advanced Voice Models for Seamless Transcription and Speech
OpenAI has announced three new proprietary voice models: gpt-4o-transcribe, gpt-4o-mini-transcribe, and gpt-4o-mini-tts. These models are designed to excel at transcription and speech in noisy environments, with diverse accents and varying speech speeds across 100+ languages. The gpt-4o-transcribe model boasts a 2.46% error rate in English, significantly lower than OpenAI’s two-year-old Whisper open-source text-to-speech model. The … Read more