After several postponements and delays, OpenAI has finally introduced a new voice mode in ChatGPT. As of now, the company has begun rolling out it to a small number of ChatGPT Plus subscribers. Powered by the latest GPT-4o AI model, OpenAI's advanced Voice Mode offers features such as real-time responses, natural voice, and the capability to sense the user's emotions.
The advanced Voice Mode was to be released as an alpha build sometime in June, however, OpenAI delayed the rollout by a month. The company highlighted that the new Voice Mode will allow users to interrupt the AI chatbot at any time and offer more natural interaction with voice modulations. OpenAI also shared a short video on X and highlighted how to turn on the feature once it becomes active.
OpenAI’s new Voice Mode: Here’s how it works
The advanced Voice Mode operates through a sophisticated AI model where the user's voice input is converted into text using speech recognition technology. This text is then processed by ChatGPT's language model to generate a suitable response. The generated text is then converted into a female voice with text-to-speech model.
The advanced voice mode is currently being tested with a small batch of ChatGPT Plus users. OpenAI said the users selected to receive this mode will be sent an email with instructions and a message in their mobile app. OpenAI plans to add more people on a rolling basis and all Plus users will receive the Voice mode by the end of this year.
Also read: OpenAI unveils cheaper small AI model GPT-4o mini
Discover the latest Business News, Sensex, and Nifty updates. Obtain Personal Finance insights, tax queries, and expert opinions on Moneycontrol or download the Moneycontrol App to stay updated!