Begin typing your search above and press return to search.
proflie-avatar
Login
exit_to_app
election commmission
access_time 22 Nov 2024 4:02 AM GMT
Champions Trophy tournament
access_time 21 Nov 2024 5:00 AM GMT
The illness in health care
access_time 20 Nov 2024 5:00 AM GMT
The fire in Manipur should be put out
access_time 21 Nov 2024 9:19 AM GMT
America should also be isolated
access_time 18 Nov 2024 11:57 AM GMT
Munambam Waqf issue decoded
access_time 16 Nov 2024 5:18 PM GMT
DEEP READ
Munambam Waqf issue decoded
access_time 16 Nov 2024 5:18 PM GMT
Ukraine
access_time 16 Aug 2023 5:46 AM GMT
Foreign espionage in the UK
access_time 22 Oct 2024 8:38 AM GMT
exit_to_app
Homechevron_rightTechnologychevron_rightOpenAI’s ChatGPT can...

OpenAI’s ChatGPT can now see, hear and speak

text_fields
bookmark_border
OpenAI’s ChatGPT can now see, hear and speak
cancel

San Francisco: Sam Altman-run OpenAI on Monday announced it is rolling out new voice and image features in ChatGPT that can now help the AI chatbot see, hear and speak.

These new capabilities offer a new, more intuitive interface allowing users to have voice conversations or visually illustrate their discussions with ChatGPT, the company said in a statement.

“Voice mode and vision for chatGPT! really worth a try,” Altman posted on X.

The company said it is rolling out voice and images in ChatGPT to Plus and Enterprise users over the next two weeks.

“Voice is coming on iOS and Android (opt-in in your settings) and images will be available on all platforms,” said the Microsoft-backed company.

The new voice capability is powered by a new text-to-speech model, capable of generating human-like audio from just text and a few seconds of sample speech.

Also Read:Gmail adds 'Select all' option on Android to select 50 emails at once

“We collaborated with professional voice actors to create each of the voices. We also use Whisper, our open-source speech recognition system, to transcribe your spoken words into text,” said OpenAI.

Image understanding is powered by multimodal GPT-3.5 and GPT-4. These models apply their language reasoning skills to a wide range of images, such as photographs, screenshots, and documents containing both text and images.

The new voice technology opens doors to many creative and accessibility-focused applications.

However, “these capabilities also present new risks, such as the potential for malicious actors to impersonate public figures or commit fraud,” the company noted.

“This is why we are using this technology to power a specific use case — voice chat. Voice chat was created with voice actors we have directly worked with,” it added.

Spotify is using the power of this technology for the pilot of their Voice Translation feature, which helps podcasters expand the reach of their storytelling by translating podcasts into additional languages in the podcasters’ own voices.

“We’ve also taken technical measures to significantly limit ChatGPT’s ability to analyze and make direct statements about people since ChatGPT is not always accurate and these systems should respect individuals’ privacy,” said the company.

With inputs from agencies

Also Read:Musk’s Tesla humanoid robot does yoga, greets with Namaste


Show Full Article
TAGS:ChatGPTOpenAITechnology News
Next Story