On Thursday, OpenAI officially revealed GPT-5 to the world. The much-hyped presentation was sparse on many particular benchmarks evaluating GPT-5 to its previous fashions, however OpenAI’s employees was adamant: this mannequin is the most effective, most educated, and strongest one up to now.
GPT-5 has its haters
Lots of the customers who’ve been check driving GPT-5 within the 24 hours since, nonetheless, disagree. A visit to r/ChatGPT is sufficient to see the scope of the scenario: The entrance web page is stuffed with posts complaining concerning the present state of the mannequin, together with: “GPT-5 is the biggest [piece] of garbage even as a paid user,” “OpenAI just pulled the biggest bait-and-switch in AI history and I’m done,” and “ChatGPT-5 rollout is an unmitigated disaster.”
Some of the distinguished complaints considerations OpenAI’s resolution to deprecate earlier fashions, one thing the corporate introduced unceremoniously through the GPT-5 presentation. GPT-4o, o3, 4.5, and different fashions are not accessible to make use of. Going ahead, customers will solely have entry to GPT-5 and its subsequent fashions (e.g. GPT-5 mini). Many customers are upset that OpenAI took away earlier fashions in a single day with zero warning, particularly once they really feel the alternative does not supply the identical expertise. Some have even canceled their subscriptions as a result.
I do know folks use ChatGPT for remedy, and I am conscious that individuals have fashioned deep attachments to the expertise, however I am going to admit, I used to be a bit shocked to learn a few of the emotional reactions to dropping entry to those fashions. In one post, a consumer detailed how they relied on particular person fashions for various duties: They’d use 4o for inventive concepts, o3 for logic issues, o3-Professional for deep analysis, and 4.5 for duties associated to writing. Another user talked about how they used 4o to assist with their anxiousness and despair, as, of their view, the mannequin felt “human.” They consider persons are grieving the lack of 4o, which tracks, not less than with some other 4o-specific posts. There are folks on the market who actually like these fashions, and are distraught following their removing.
However past mourning, some customers simply assume GPT-5 is not excellent. In case you ask the mannequin what number of instances the letter “b” happens within the phrase “blueberry,” it reportedly says “three”: as soon as at the start, as soon as within the phrase “blue,” and as soon as in “berry.” This is not essentially a brand new drawback—LLMs have had trouble spelling “strawberry” as well—however its not an excellent search for OpenAI’s “greatest” mannequin ever. One X user highlighted an instance of GPT-5’s incapacity to resolve a “easy linear equation,” versus Google’s Gemini 2.5’s skill to resolve it with out concern, whereas this user posted GPT-5’s era of a map of the USA, with many of the states labeled with gibberish.
Some customers teased OpenAI over its vague benchmarking data. Rhys on X sarcastically posted “these gpt-5 numbers are insane,” and hooked up a graph that charted every GPT model by quantity (GPT-1 lands at “1” on the Y axis, GPT-2 at “2,” and so forth till you attain GPT-5 at “5.”
This Tweet is currently unavailable. It might be loading or has been removed.
There are additionally criticisms of auto-switching, one in all GPT-5’s core options. Free and Plus ChatGPT customers aren’t ready to decide on the precise mannequin, however in OpenAI’s view, that is a very good factor. GPT-5 is meant to be clever sufficient to select the correct mannequin for you primarily based in your question: easy questions use weaker fashions, whereas extra complicated requests use strongest fashions. But when OpenAI is so positive that is a very good factor, why does it nonetheless supply the power to manually change fashions, so long as you pay $200 per month for a Pro plan?
Not everybody agrees that GPT-5 is dangerous, thoughts you. There are customers who seem like having fun with the mannequin, appreciating the concise responses and fast performance. However the majority of discourse I am seeing on social media and boards is impartial to detrimental. Even posts that initially appear constructive find yourself criticizing the mannequin:
What do you assume up to now?
This Tweet is currently unavailable. It might be loading or has been removed.
4o lives on, for now
Since beginning this piece, OpenAI has responded to the backlash. CEO Sam Altman posted a series of updates on X that appear to backtrack a bit on the selections customers have criticized most severely: Price limits will double for ChatGPT Plus customers for now; GPT-5 ought to appear smarter beginning immediately; it is going to be straightforward to see which mannequin is answering a given question; and manually selecting the considering mannequin might be extra easy. Altman additionally acknowledged the preliminary rollout goes slower than anticipated, which is sensible since I nonetheless do not have entry to the brand new mannequin.
However the greatest announcement of the bunch ought to come as welcome information to many customers: 4o is again, not less than for Plus customers. In case you pay $20 a month for ChatGPT, you’ll be able to maintain utilizing 4o in the interim. Altman says the corporate is watching utilization, and can decide on how lengthy it can supply legacy fashions for sooner or later.
I am curious how customers reply going ahead: Will those that canceled resubscribe to maintain utilizing 4o? Then once more, why trouble, if OpenAI is planning on taking away that mannequin once more someday sooner or later? One factor’s for positive: This doubtless is not how OpenAI anticipated GPT-5’s rollout to go.
Disclosure: Ziff Davis, Lifehacker’s dad or mum firm, in April filed a lawsuit towards OpenAI, alleging it infringed Ziff Davis copyrights in coaching and working its AI programs.
Trending Merchandise
