OpenAI just took the shackles off the free version of ChatGPT

by Yaron


OpenAI announced the release of its newest snack-sized generative model, dubbed GPT-4o mini, which is both less resource intensive and cheaper to operate than its standard GPT-4o model, allowing developers to integrate the AI technology into a far wider range of products.

It’s a big upgrade for developers and apps, but it also expands the capabilities and reduces limitations on the free version of ChatGPT. GPT-4o mini is now available to users on the Free, Plus, and Team tiers through the ChatGPT web and app for users and developers starting today, while ChatGPT Enterprise subscribers will gain access next week. GPT-4o mini will replace the company’s existing small model, GPT-3.5 Turbo, for end users beginning today.

The older model is still available to developers through the API if they don’t want to switch to 4o mini just yet. The company says it will retire the older model eventually but has not yet set a date.

GPT-4o has been available to free ChatGPT accounts since May, but there have been limitations around demand. According to the updated FAQ page, GPT-4o proper still has those limitations in place, but you’ll now get downgraded to GPT-4o mini rather than GPT-3.5 when you hit your limit. In theory, that’s a big win for those that haven’t upgraded to ChatGPT Plus.

We’re continuing to make advanced AI accessible to all with the launch of GPT-4o mini, now available in the API and rolling out in ChatGPT today. https://t.co/sTxtOfUapJ

— OpenAI (@OpenAI) July 18, 2024

Per data from Artificial Analysis, OpenAI’s newest AI model scored 82% on the MMLU reasoning benchmark, beating Gemini 1.5 Flash by 3% and Claude 3 Haiku by 7%. For reference, the highest MMLU benchmark to date was set by Gemini Ultra, Google’s top-of-the-line AI, with a score of 90%.

What’s more, OpenAI claims that GPT-4o mini is 60% cheaper to operate than GPT-3.5 Turbo. Developers will pay 15 cents per million input tokens and 60 cents per million output tokens. OpenAI says GPT-4o mini is “the most capable and cost-efficient small model available today,” per CNBC.

Where do those cost savings come from? Well, not every task that can be enhanced by AI needs the full weight and capability of a full-sized model like GPT, Claude or Gemini. Like swatting flies with a sledgehammer, utilizing a standard size LLM for simple but high-volume tasks is overkill and wastes both money and compute resources — which is where small LLMs such as Google’s Gemini 1.5 Flash, Meta’s Llama 3 8b, or Anthropic’s Claude 3 Haiku come in. They’re able to perform these simple, repetitive tasks faster and more cost-efficiently than the larger iterations.

According to OpenAI, GPT-4o mini will have the same size context window, 128,000 tokens (roughly a book’s worth of content), as the full-size version with the same knowledge cutoff as well, October 2023, though the company did not specify the new model’s exact size. The model API currently only offers text and vision capabilities, but video and audio will be coming in the future as well.

The announcement comes just a few weeks after OpenAI provided a long-awaited update on its anticipated, advanced Voice Mode as part of GPT-4o. The company’s update indicated that a smaller alpha release was still to come in late July, with a wider rollout being held for this fall.

Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
  • Computing

Your ChatGPT conversation history is now searchable
ChatGPT chat search

OpenAI debuted a new way to more efficiently manage your growing ChatGPT chat history on Tuesday: a search function for the web app. With it, you’ll be able to quickly surface previous references and chats to cite within your current ChatGPT conversation.

“We’re starting to roll out the ability to search through your chat history on ChatGPT web,” the company announced via a post on X (formerly Twitter). “Now you can quickly & easily bring up a chat to reference, or pick up a chat where you left off.”

Read more

  • Computing

GPT-5: everything we know so far about OpenAI’s next frontier model
A MacBook Pro on a desk with ChatGPT's website showing on its display.

There’s perhaps no product more hotly anticipated in tech right now than GPT-5. Rumors about it have been circulating ever since the release of GPT-4, OpenAI’s groundbreaking foundational model that’s been the basis of everything the company has launched over the past year, such as GPT-4o, Advanced Voice Mode, and the OpenAI o1-preview.

Those are all interesting in their own right, but a true successor to GPT-4 is still yet to come. Now that it’s been over a year a half since GPT-4’s release, buzz around a next-gen model has never been stronger.
When will GPT-5 be released?
OpenAI has continued a rapid rate of progress on its LLMs. GPT-4 debuted on March 14, 2023, which came just four months after GPT-3.5 launched alongside ChatGPT. OpenAI has yet to set a specific release date for GPT-5, though rumors have circulated online that the new model could arrive as soon as late 2024.

Read more

  • Computing

OpenAI could release its next-generation model by December
ChatGPT giving a response about its knowledge cutoff.

OpenAI plans to release its next-generation frontier model, code-named Orion and rumored to actually be GPT-5, by December, according to an exclusive report from The Verge. However, OpenAI boss Sam Altman is already pushing back.

According to “sources familiar with the plan,” Orion will not initially be released to the general public, as the previous GPT-4 variants were. Instead, the company intends to hand the new model over to select businesses and partners, who will then use it as a platform to build their own products and services. This is the same strategy that Nvidia is pursuing with its NVLM 1.0 family of large language models (LLMs).

Related Posts

Leave a Comment