5/19/2024

Chat GPT Can Now Speak And Sing In Real Time


The AI race has just shifted into high gear, with US artificial intelligence pioneers OpenAI rolling out its new interface that works with audio and vision as well as text. The new model, called GPT-4o, has gone beyond the familiar chat-bot features and is capable of real-time, near-natural voice conversations. The developer OpenAI will also make it available to free users.

ChatGPT was already able to talk to users, but with long pauses to process the data. It often seemed a bit sluggish. This was because the feature required three internal applications, the company explained: transcribing the spoken text, processing and generating, and converting the response to speech. This caused delays.

We talk to computer scientist Mike Cook from the renowned Kings College London about the new Chat GPT-4o development.


0 comments:

Post a Comment

Grace A Comment!