Interesting Engineering

OpenAI o1: New AI model launched with ‘PhD-level’ reasoning, math, coding skills

By Kapil Kajal,

6 hours ago

ChatGPT developer OpenAI has announced the launch of a new series of AI reasoning models to solve hard problems on September 12.

Codenamed Strawberry, the new AI models are officially called OpenAI o1.

Interestingly, OpenAI has trained these models to spend more time thinking through problems before responding, much like a person would.

According to OpenAI, the models can reason through complex tasks and solve harder problems than previous models in science, coding, and math.

PhD level skills

OpenAI said that the models learn to refine their thinking process, try different strategies, and recognize their mistakes through training.

In the tests, the new models perform similarly to PhD students on challenging benchmark physics, chemistry, and biology tasks. The model also excels in math and coding.

In a qualifying exam for the International Mathematics Olympiad (IMO), currently available GPT-4o correctly solved only 13% of problems, while the reasoning model scored 83%.

OpenAI evaluated the coding abilities of new models in contests and reached the 89th percentile in Codeforces competitions.

As an early model, it still needs many features that make ChatGPT useful, like browsing the web for information and uploading files and images.

For many common cases, GPT-4o will soon be more capable. However, this is a significant advancement for complex reasoning tasks and represents a new level of AI capability.

Given this, OpenAI is resetting the counter back to 1 and naming this series OpenAI o1.

Harnessing reasoning capabilities

As part of developing these new models, OpenAI has also developed a new safety training approach that harnesses their reasoning capabilities to ensure their adhesion to safety and alignment guidelines.

Safety was measured by conducting the model’s user test on jailbreaking. On one of the hardest jailbreaking tests, GPT-4o scored 22 (on a scale of 0 to 100), while the o1-preview model scored 84.

OpenAI said that these enhanced reasoning capabilities are useful for tackling complex problems in science, coding, math, and similar fields.

For example, o1 can be used by healthcare researchers to annotate cell sequencing data, physicists to generate complicated mathematical formulas needed for quantum optics, and developers in all fields to build and execute multi-step workflows.

The o1 series excels at accurately generating and debugging complex code.

To offer developers a more efficient solution, OpenAI has also released OpenAI o1-mini, a faster, cheaper reasoning model that is particularly effective at coding.

As a smaller model, o1-mini is 80% cheaper than o1-preview, making it a powerful, cost-effective model for applications that require reasoning but not broad world knowledge.

New models available from today

ChatGPT Plus users can access o1 models in ChatGPT starting today.

The model picker allows you to select both o1-preview and o1-mini manually. At launch, the weekly rate limits will be 30 messages for o1-preview and 50 for o1-mini.

OpenAI also works to increase those rates and enable ChatGPT to choose the right model for a given prompt automatically.

Developers who qualify for API usage tier 5 can start prototyping with both models in the API today with a rate limit of 20 RPM.

The API for these models currently doesn’t include function calls, streaming, support for system messages, and other features.

OpenAI also plans to bring o1-mini access to all ChatGPT Free users.

Expand All

Read in NewsBreak

Comments /

Add a Comment

YOU MAY ALSO LIKE

Local News

New stretchable electronics for soft robots to allow embedded computation

Interesting Engineering1 day ago

Life in 3D: 1mm robot folds, flips and moves with just a spark of power

Interesting Engineering13 hours ago

World’s first aerospike engine aircraft to take to the skies in September

Interesting Engineering2 days ago

Every household can get four free COVID-19 tests by mail, starting late September

Northern Kentucky Tribune5 days ago

Chick-fil-A customer says they’ll never order a full meal again after seeing kids’ meal

NewsNinja22 days ago

Mountaintop falling into sea caused mega-tsunami, shook Earth for 9 days

Interesting Engineering8 hours ago

Game-changing energy harvester generates endless electricity from seawater

Interesting Engineering14 hours ago

World’s strongest battery could extend EV range by 70%, make phones credit card-thin

Interesting Engineering1 day ago

Tesvolt Ocean’s ‘Kaptein’ power storage offers high energy density, fast charging

Interesting Engineeringlast hour

SpaceX makes history with world’s 1st private spacewalk 458 miles above Earth

Interesting Engineering14 hours ago

Wonder Jelly

Alameda Post17 days ago

Solar device makes 20L drinking water a day from seawater with 93% efficiency

Interesting Engineering18 hours ago

Microbes make nutrients out of thin air; richer source of protein than beef, fish

Interesting Engineering10 hours ago

New plant produces lithium 500 times faster with 96% recovery rate from brine

Interesting Engineering11 hours ago

Carnival Cruise Line Rushes Passengers Back to Ship as Weather Worsens

J. Souza28 days ago

Meet your next cat: Five cats you can adopt right now

Cats of Kansas City6 hours ago

In a 1st, Japanese eels seen escaping predator’s stomach after getting swallowed

Interesting Engineering2 days ago

Fentanyl-meth combo ravages homeless in Denver, so why aren't there better treatments?

David Heitz5 days ago

Cheap aluminum paste used to build TOPCon solar cells with 22.56% efficiency

Interesting Engineering11 hours ago

Thousands of parents die of overdoses; advocates say their kids need more help

Northern Kentucky Tribune24 days ago

‘World’s largest, most advanced’ dark matter detector to be built by UK

Interesting Engineering1 day ago

In Memory of TV Star Meshach Taylor ('Designing Women'): 10 Years After His Tragic Death From Cancer

Herbie J Pilato11 days ago

New 450Wh/kg solid-state battery to boost range of Mercedes future EVs by 80%

Interesting Engineering2 days ago

The 4 Meanest Signs in the Zodiac

Emily Standley Allard27 days ago

Meet The Tiny 6lb Dog Looking For Love

Dianna Carney15 days ago

US Air Force hypersonic missiles will sweat like humans to stay cool, fly faster

Interesting Engineering1 day ago

Sticky fingers: Colorado ranks seventh in U.S. for retail theft, study shows

David Heitz14 days ago

Big Lots files bankruptcy amid closing 74 stores in California

The HD Post3 days ago

Opinion: Many homeless, addicted people in Denver crave sobriety

David Heitz22 days ago

Japan to build supercomputer 1000 times faster than world’s most powerful machines

Interesting Engineering1 day ago

It’s essential to note our commitment to transparency:

Our Terms of Use acknowledge that our services may not always be error-free, and our Community Standards emphasize our discretion in enforcing policies. As a platform hosting over 100,000 pieces of content published daily, we cannot pre-vet content, but we strive to foster a dynamic environment for free expression and robust discourse through safety guardrails of human and AI moderation.

Comments / 0

Community Policy