Fortune

OpenAI’s lead over other AI companies has largely vanished, ‘State of AI’ report finds

By Jeremy Kahn,

3 days ago

Hello and welcome to Eye on AI. In this edition…AI's fast-falling cost…Google goes nuclear…LLMs may be dumber than you think…and a filmmaker burned by genAI backlash.

Every year for the past seven, Nathan Benaich, the founder and solo general partner at the early-stage AI investment firm Air Street Capital , has produced a magisterial “State of AI” report. Benaich and his collaborators marshal an impressive array of data to provide a great snapshot of the technology’s evolving capabilities, the landscape of companies developing it, a survey of how AI is being deployed, and a critical examination of the challenges still facing the field.

OpenAI's lead mostly vanishes

One of the big takeaways from this year’s report, which was published late last week , is that OpenAI’s lead over other AI labs has largely eroded. Anthropic’s Claude 3.5 Sonnet, Google’s Gemini 1.5, X’s Grok 2, and even Meta’s open-source Llama 3.1 405 B model have equaled, or narrowly surpassed on some benchmarks, OpenAI’s GPT-4o.

But, on the other hand, OpenAI still retains an edge for the moment on reasoning tasks with the release of its o1 “Strawberry” model—which Air Street’s report rightly characterized as a weird mix of incredibly strong logical abilities for some tasks, and surprisingly weak ones for others. (For more on the fragility of o1’s reasoning abilities, see the “Research” section below.)

Inference costs fall rapidly

Another big takeaway, Benaich told me, is the extent to which the cost of using a trained AI model—an activity known as "inference"—is falling rapidly. There are several reasons for this. One is linked to that first big takeaway: With models less differentiated from one another on capabilities and performance, companies are forced to compete on price.

Another reason is that engineers for companies such as OpenAI and Anthropic—and their hyperscaler partners Microsoft and AWS, respectively—are discovering ways to optimize how the largest models run on big GPU clusters. The cost of outputs from OpenAI’s GPT-4o today is 100-times less per token (which is about equivalent to 1.5 words) than it was for GPT-4 when that model debuted in March 2023. Google’s Gemini 1.5 Pro now costs 76% less per output token than it did when that model was launched in February 2024.

AI researchers have also become good at creating small AI models that can equal the performance of larger LLMs on dialogue, summarization, or even coding, while being much cheaper to run. Taken together, these two trends mean that the economics of implementing AI-based solutions are starting to look much more attractive than they did a year ago. This may ultimately help businesses find the return on investment from generative AI that they have complained has been elusive so far.

Robotics makes a come back

Another key trend Benaich picks up on is how robotics is coming back into vogue, with robotics companies marrying LLMs and new “world models” to existing tech to make significant progress in making robots more capable and easier (as well as cheaper) to deploy and customize.

Benaich’s State of AI report always ends with some bold predictions for the year ahead (and Benaich grades himself each year on how he’s done.) Among the things he got right last year: that a Hollywood production would make use of genAI models for visual effects and that there would be limited progress on international AI governance efforts. Among those he got wrong: that a company would spend more than $1 billion training a single LLM.

This year, among the report's predictions, are that an open source alternative to OpenAI’s o1 will surpass it across a range of benchmarks and that a $10 billion investment from a sovereign state into a U.S. AI company will cause the U.S. government to institute a national security review. We’ll check back next year to see how Benaich did.

Fortune Brainstorm AI takes the pulse of a fast-changing industry

The State of AI report is not the only way to find a fantastic overview of what’s happening in AI. Another great place to gain a vantage point on AI’s rapidly evolving landscape and find out how AI is impacting business is Fortune’s upcoming Brainstorm AI conference in San Francisco. This must-attend annual event is coming up on December 9 and 10, held at the St. Regis Hotel.

This year’s conference will include conversations with, among many others: Amazon’s head scientist for artificial general intelligence, Rohit Prasad , who will update us on how the Everything Store is trying to ensure it doesn’t get left behind in the race to build superpowerful—and super useful—AI; Liz Reid , Google’s vice president of search, who will discuss the future of Google’s signature product in an AI world; Christopher Young , Microsoft’s executive vice president of business development, strategy, and ventures, who will discuss how the tech giant is trying to see around corners to what is coming next for AI; Daniela Braga , the founder and CEO of Defined.ai who will tell us what it really takes to build AI models that work for customers; and Colin Kaepernick , former Super Bowl quarterback for the San Francisco 49ers and current founder and CEO of Lumi, a company that builds AI-powered tools for content creators, who will speak about his own transformation from professional athlete to entrepreneur, and what AI may mean for influencers, brands, and beyond.

I’ll be there, of course, helping to cochair the discussion with a gaggle of ultra-talented colleagues. I hope you will all consider joining me! And I’m very excited to be able to offer Eye on AI readers a special discounted rate—20% off the regular price of attendance! Just write the code KAHN20 in the Additional Comments section of the application to secure your discount. You can click here to find out more. Follow the link on that page to apply to attend. Remember to use the discount code!

With that, here’s more AI news.

Jeremy Kahn
jeremy.kahn@fortune.com
@jeremyakahn

This story was originally featured on Fortune.com

Comments /

Add a Comment

YOU MAY ALSO LIKE

Local News

Meta reportedly fires staffer on $400k a year for spending $25 meal credits on toothpaste and tea

Fortune1 day ago

The exact multimillion dollar figure the American Dream now costs, according to research

Fortune8 hours ago

College grads struggle to find work at Google, Amazon, and Meta as tech hiring stalls

Fortune2 days ago

The Suicide of 'Happy Days' Actress & Former Child Star Kathy O'Dare: 14 Sad Years Later

Herbie J Pilato7 days ago

How to prevent millions of invisible law-free AI agents casually wreaking economic havoc

Fortune1 day ago

Earth Will Have a "Second Moon" for 57 Days: What does this mean?

M Henderson14 days ago

Cruise Line's Largest Ship Ever Still Stranded Off Florida Coast, Delaying Debut

J. Souza7 days ago

Navigating Aging: Historic numbers of Americans are living by themselves as they age

Northern Kentucky Tribune12 days ago

Degree requirements are holding back company profits and a roaring economy, experts say

Fortune2 days ago

If Elected Again, Donald Trump Will Use the Military as an All-Powerful Tool to Deploy on U.S. Soil

Bucks County Beacon4 days ago

Guild founder Rachel Romer had a stroke at 34. Her nurses inspired her company’s next phase

Fortune2 days ago

Japan shares fall on weak chip market outlook while LVMH sales drop drags Europe down

Fortune2 days ago

Bank of America CEO Brian Moynihan says U.S. economy is the envy of the world—but could lose its power due to national debt

Fortune2 days ago

Job-hopping could mean bigger salaries, but a $300K lifetime loss in retirement savings

Fortune3 hours ago

Robby Starbuck and the Fearless Fund

Fortune1 day ago

China shares fall on underwhelming stimulus news while Europe pops on rate-cut hopes

Fortune1 day ago

Harvard fundraising tops $1 billion despite being rocked by protests and a presidential ouster

Fortune1 day ago

Top AI leaders say they don’t want women to get left behind in the tech revolution

Fortune2 days ago

Wall Street retreats from records as tech stocks drop

Fortune2 days ago

98% of workers say breaks boost productivity, but most skip lunch due to workload

Fortune2 days ago

Andreessen Horowitz defense tech investor Katherine Boyle says ‘Ukraine changed everything’

Fortune3 days ago

Nasdaq Private Market CEO says only 25 of 1,200 unicorns are actively trading on the secondaries market

Fortune1 day ago

The number of super-rich people is exploding—and it’s raising the bar for what’s considered ‘wealthy’

Fortune3 days ago

TSMC says it will take ‘prompt action to ensure compliance’ after reported U.S. probe into possible work with Huawei

Fortune8 hours ago

Werther’s Candy introduces jeans with 30 pockets, which can hold one caramel each

Fortune5 hours ago

Corporate landlord with 12,000 homes in CA agrees to $48 million settlement for deceiving renters

The HD Post23 days ago

Deloitte, Johnson & Johnson execs say people aren’t using their health benefits until they are in crisis

Fortune2 days ago

Jerome Powell has accidentally jammed the property market—especially for the ultra-rich

Fortune5 hours ago

It’s essential to note our commitment to transparency:

Our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. As a platform hosting over 100,000 pieces of content published daily, we cannot pre-vet content, but we strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation.

Comments / 0

Community Policy