Open in App
  • Local
  • U.S.
  • Election
  • Politics
  • Sports
  • Lifestyle
  • Education
  • Real Estate
  • Newsletter
  • Interesting Engineering

    OpenAI o1: New AI model launched with ‘PhD-level’ reasoning, math, coding skills

    By Kapil Kajal,

    6 hours ago

    https://img.particlenews.com/image.php?url=0RWIS4_0vUJtwkS00

    ChatGPT developer OpenAI has announced the launch of a new series of AI reasoning models to solve hard problems on September 12.

    Codenamed Strawberry, the new AI models are officially called OpenAI o1.

    Interestingly, OpenAI has trained these models to spend more time thinking through problems before responding, much like a person would.

    According to OpenAI, the models can reason through complex tasks and solve harder problems than previous models in science, coding, and math.

    PhD level skills

    OpenAI said that the models learn to refine their thinking process, try different strategies, and recognize their mistakes through training.

    In the tests, the new models perform similarly to PhD students on challenging benchmark physics, chemistry, and biology tasks. The model also excels in math and coding.

    In a qualifying exam for the International Mathematics Olympiad (IMO), currently available GPT-4o correctly solved only 13% of problems, while the reasoning model scored 83%.

    OpenAI evaluated the coding abilities of new models in contests and reached the 89th percentile in Codeforces competitions.

    As an early model, it still needs many features that make ChatGPT useful, like browsing the web for information and uploading files and images.

    For many common cases, GPT-4o will soon be more capable. However, this is a significant advancement for complex reasoning tasks and represents a new level of AI capability.

    Given this, OpenAI is resetting the counter back to 1 and naming this series OpenAI o1.

    Harnessing reasoning capabilities

    As part of developing these new models, OpenAI has also developed a new safety training approach that harnesses their reasoning capabilities to ensure their adhesion to safety and alignment guidelines.

    Safety was measured by conducting the model’s user test on jailbreaking. On one of the hardest jailbreaking tests, GPT-4o scored 22 (on a scale of 0 to 100), while the o1-preview model scored 84.

    OpenAI said that these enhanced reasoning capabilities are useful for tackling complex problems in science, coding, math, and similar fields.

    For example, o1 can be used by healthcare researchers to annotate cell sequencing data, physicists to generate complicated mathematical formulas needed for quantum optics, and developers in all fields to build and execute multi-step workflows.

    The o1 series excels at accurately generating and debugging complex code.

    To offer developers a more efficient solution, OpenAI has also released OpenAI o1-mini, a faster, cheaper reasoning model that is particularly effective at coding.

    As a smaller model, o1-mini is 80% cheaper than o1-preview, making it a powerful, cost-effective model for applications that require reasoning but not broad world knowledge.

    New models available from today

    ChatGPT Plus users can access o1 models in ChatGPT starting today.

    The model picker allows you to select both o1-preview and o1-mini manually. At launch, the weekly rate limits will be 30 messages for o1-preview and 50 for o1-mini.

    OpenAI also works to increase those rates and enable ChatGPT to choose the right model for a given prompt automatically.

    Developers who qualify for API usage tier 5 can start prototyping with both models in the API today with a rate limit of 20 RPM.

    The API for these models currently doesn’t include function calls, streaming, support for system messages, and other features.

    OpenAI also plans to bring o1-mini access to all ChatGPT Free users.

    Expand All
    Comments /
    Add a Comment
    YOU MAY ALSO LIKE
    Local News newsLocal News
    Alameda Post17 days ago
    Emily Standley Allard27 days ago

    Comments / 0