OpenAI announces ChatGPT will soon ‘see, hear, and speak’

Share This Post

ChatGPT will soon offer new features that allow users to engage with it through images and voice recognition, according to an announcement from OpenAI on Sept. 25.

OpenAI announced that users will be able to interact with ChatGPT using voice commands, enabling a more personalized user experience. The company said that this feature is powered by a text-to-speech model that can generate audio from minimal sample speech created by professional voice actors. It said that the feature is also powered by its open-source speech recognition system, Whisper.

The voice features are expected to provide a wider range of use cases, such as assisting in tasks like reading bedtime stories, creating recipes, composing speeches, reciting poems, explaining common phrases, or even resolving “dinner table debates.”

OpenAI added that users will soon be able to provide images to ChatGPT (or select certain parts of images) for interpretation and response.

OpenAI acknowledges risks

OpenAI acknowledged the risk of fraud and impersonation and said that, accordingly, it is limiting voice features to its voice chat platform. It emphasized that it uses professional voice actors not user voices for output audio. OpenAI added that certain other groups are permitted to use voice capabilities for other purposes; Spotify, for example, is translating participating podcasts to new languages in each host’s original voice.

The company noted that image recognition carries privacy risks and said that, in response, it has limited ChatGPT’s ability to make statements about people. It noted that ChatGPT “is not always accurate” but said that general descriptions of images can be useful, citing its earlier work with Be My Eyes, an app for blind and low-vision people.

OpenAI said that it will introduce voice and image features to ChatGPT Plus and Enterprise over the next two weeks. It said that voice features will be available on iOS and Android on an opt-in basis, and that image features will be available on all platforms.

The post OpenAI announces ChatGPT will soon ‘see, hear, and speak’ appeared first on CryptoSlate.

Read Entire Article
spot_img
- Advertisement -spot_img

Related Posts

Race to a Billion Presale Raises Almost $200k: $RACE Token Could Revolutionize Predictive Meme Coin Gaming

The post Race to a Billion Presale Raises Almost $200k: $RACE Token Could Revolutionize Predictive Meme Coin Gaming appeared first on Coinpedia Fintech News Everyday in crypto means a new innovation,

Bitcoin Hits 90K, DOGE Moons on Trump News, Pepe Surges on CEX Listings — Week in Review

Bitcoin hits historic $90k, DOGE skyrockets as Elon Musk takes on Trump’s quest to slash regulations, and PEPE Surges 40% on Robinhood listing in this Week in Review Week in Review The crypto

Polkadot Price Soars 15% In One Day — Here’s Why $7.5 Might Be The Next Target

The cryptocurrency market saw some of its best days over the past week, with several altcoins enjoying the positive climate surrounding the industry at the moment While the top meme coins like

Crypto Staking and Other Crypto Opportunities To Not Miss

The post Crypto Staking and Other Crypto Opportunities To Not Miss appeared first on Coinpedia Fintech News The financial world of 2024 is increasingly embracing the idea of creating smart and easy

Winklevoss Twin Underscores DOGE Initiative To Combat Inflation

The post Winklevoss Twin Underscores DOGE Initiative To Combat Inflation appeared first on Coinpedia Fintech News US President-elect Donald Trump recently announced the creation of a new initiative

McDonald’s Partners With Ethereum NFT Series Doodles To Launch “GM Spread Joy” Event

The post McDonald’s Partners With Ethereum NFT Series Doodles To Launch “GM Spread Joy” Event appeared first on Coinpedia Fintech News McDonald’s announced a brand partnership with the