New Delhi: OpenAI, a major player in the field of artificial intelligence located in San Francisco, has introduced a new version of its visual and spoken chatbot. ChatGPT, the chatbot, has been updated to allow both spoken and visual communication between users.
On Monday, a new version of ChatGPT was introduced with two improvements that make it seem more human.
To begin, it now can converse with its users through synthetic voices that are said to be more human-sounding than those of competing digital assistants. There are five distinct voices available, both male and female. Second, it now can interact with user-uploaded photos.
Use your voice to engage in a back-and-forth conversation with ChatGPT. Speak with it on the go, request a bedtime story, or settle a dinner table debate.
Sound on 🔊 pic.twitter.com/3tuWzX0wtS
— OpenAI (@OpenAI) September 25, 2023
OpenAI announced the expanded capabilities of the popular chatbot through a Twitter post and wrote, “ChatGPT can now see, hear, and speak. Rolling out over next two weeks, Plus users will be able to have voice conversations with ChatGPT (iOS & Android) and to include images in conversations (all platforms).”
ChatGPT can now see, hear, and speak. Rolling out over next two weeks, Plus users will be able to have voice conversations with ChatGPT (iOS & Android) and to include images in conversations (all platforms). https://t.co/uNZjgbR5Bm pic.twitter.com/paG0hMshXb
— OpenAI (@OpenAI) September 25, 2023
ChatGPT Voice
ChatGPT now provides five distinct voices from which to choose while responding to queries. OpenAI claims that it used professional voice actors to generate each voice and that it transcribed speech using its own Whisper speech recognition algorithm.
OpenAI says that their new text-to-speech model can generate human-like audio from only text and a few seconds of speech samples, which paves the way for many new creative and accessibility-focused apps like ChatGPT’s new voice features.
Spotify has teamed up with the AI firm to translate podcasts in the podcaster’s voice into other languages.
ChatGPT to See Images
To fuel Image understanding in ChatGPT, OpenAI leverages the multimodal capabilities of GPT-3.5 and GPT-4. ChatGPT users may now include photographs in their queries, allowing them to ask things like “Explore the contents of my fridge so I can plan a dinner” or “Analyze a complicated graph for work-related data.”
How to Use These New Features
ChatGPT users should enable the Voice Feature by going to the app’s “Settings“ menu and selecting “New Features.” The user must then actively engage in voice chats by tapping the headset icon located in the upper right corner of the home screen and then selecting the desired voice.
Users of the ChatGPT app will have to opt into testing the Voice function before they can use it generally. On the other hand, image search will always be enabled.