ChatGPT’s highly anticipated Advanced Voice could arrive ‘next week’

by Yaron

OpenAI CEO and co-founder Sam Altman revealed on X (formerly Twitter) Thursday that its Advanced Voice feature will begin rolling out “next week,” though only for a few select ChatGPT-Plus subscribers.

The company plans to “start the alpha with a small group of users to gather feedback and expand based on what we learn.”

alpha rollout starts to plus subscribers next week!

— Sam Altman (@sama) July 25, 2024

Advanced Voice, which does away with the text prompt and enables users to converse directly with the AI as one would another human, was initially announced in May alongside the release of GPT-4o during the company’s Spring Update event. Unlike existing digital assistants like Siri and Google Assistant, which only provide canned answers to user queries, ChatGPT’s Advanced Voice provides human-like responses, nearly latency-free, and in multiple languages.

The GPT-4o model is able to respond to audio inputs in 320 milliseconds on average, which is on par with how quickly humans react to normal conversation. As you can see in the demo video below, the model can converse with multiple users simultaneously, improvise talking points and questions in both English and Portuguese as well as conveying them with human-ish emotions, including “laughter.”Learning a new language with ChatGPT Advanced Voice Mode

There’s no word yet on how the company will choose participants for alpha trial aside from them being $20/month ChatGPT Plus-tier subscribers. The alpha release was originally scheduled for June, though that date was pushed back “to reach our bar to launch” and improve its ability to detect and reject prohibited forms of content, as well as buttress the company’s IT infrastructure to accommodate the anticipated user load increase.

As the company announced in June, the feature’s full rollout won’t happen until at least this fall, and its exact timing will, again, depend on it “meeting our high safety and reliability bar.”

Giving ChatGPT the ability to converse naturally with its users is a huge advancement. Eliminating the need for a context window reduce user hardware requirements and expand the potential integrations and use cases for AI (such as increasing access to users with body mobility or dexterity limitations).

It can also help speed the technology’s adoption by the public by reducing the barrier to entry for less-tech-savvy users who are comfortable with interacting with their computers via “hey Siri” but blanch at the prospect of prompt engineering.Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…

  • Computing

GPT-5: everything we know so far about OpenAI’s next frontier model
A MacBook Pro on a desk with ChatGPT's website showing on its display.

There’s perhaps no product more hotly anticipated in tech right now than GPT-5. Rumors about it have been circulating ever since the release of GPT-4, OpenAI’s groundbreaking foundational model that’s been the basis of everything the company has launched over the past year, such as GPT-4o, Advanced Voice Mode, and the OpenAI o1-preview.

Those are all interesting in their own right, but a true successor to GPT-4 is still yet to come. Now that it’s been over a year a half since GPT-4’s release, buzz around a next-gen model has never been stronger.
When will GPT-5 be released?
OpenAI has continued a rapid rate of progress on its LLMs. GPT-4 debuted on March 14, 2023, which came just four months after GPT-3.5 launched alongside ChatGPT. OpenAI has yet to set a specific release date for GPT-5, though rumors have circulated online that the new model could arrive as soon as late 2024.

Read more

  • Computing

OpenAI could release its next-generation model by December
ChatGPT giving a response about its knowledge cutoff.

OpenAI plans to release its next-generation frontier model, code-named Orion and rumored to actually be GPT-5, by December, according to an exclusive report from The Verge. However, OpenAI boss Sam Altman is already pushing back.

According to “sources familiar with the plan,” Orion will not initially be released to the general public, as the previous GPT-4 variants were. Instead, the company intends to hand the new model over to select businesses and partners, who will then use it as a platform to build their own products and services. This is the same strategy that Nvidia is pursuing with its NVLM 1.0 family of large language models (LLMs).

Read more

  • Computing

Radiohead’s Thom Yorke among thousands of artists who issue AI protest
Thom Yorke on stage.

Leading actors, authors, musicians, and novelists are among 11,500 artists to have put their name to a statement calling for a halt to the unlicensed use of creative works to train generative AI tools like OpenAI’s ChatGPT, describing it as a “threat” to the livelihoods of creators.

The open letter, comprising just 29 words, says: “The unlicensed use of creative works for training generative AI is a major, unjust threat to the livelihoods of the people behind those works, and must not be permitted.”

Related Posts

Leave a Comment