AUGAF
  • Home
  • Politics
  • Business
  • National
  • News
  • Finance
  • Technology
  • Sports
  • International
  • CommoditiesNew
  • Contact
No Result
View All Result
  • Home
  • Politics
  • Business
  • National
  • News
  • Finance
  • Technology
  • Sports
  • International
  • CommoditiesNew
  • Contact
No Result
View All Result
AUGAF
No Result
View All Result
Home International

Open AI Introduces A New Series of Reasoning Models for Solving Hard Problems

admin-augaf by admin-augaf
September 13, 2024
in International, Technology
Reading Time: 3 mins read
0
Open AI Introduces A New Series of Reasoning Models for Solving Hard Problems
Share on FacebookShare on TwitterWhatsapp

London September 13 2024: Open AI developed a new series of AI models designed to spend more time thinking before they respond. They can reason through complex tasks and solve harder problems than previous models in science, coding, and math.

Today, we are releasing the first of this series in ChatGPT and our API. This is a preview and we expect regular updates and improvements. Alongside this release, we’re also including evaluations for the next update, currently in development.

How it works

We trained these models to spend more time thinking through problems before they respond, much like a person would. Through training, they learn to refine their thinking process, try different strategies, and recognize their mistakes.

In our tests, the next model update performs similarly to PhD students on challenging benchmark tasks in physics, chemistry, and biology. We also found that it excels in math and coding. In a qualifying exam for the International Mathematics Olympiad (IMO), GPT-4o correctly solved only 13% of problems, while the reasoning model scored 83%. Their coding abilities were evaluated in contests and reached the 89th percentile in Codeforces competitions. You can read more about this in our technical research post.

As an early model, it doesn’t yet have many of the features that make ChatGPT useful, like browsing the web for information and uploading files and images. For many common cases GPT-4o will be more capable in the near term.

But for complex reasoning tasks this is a significant advancement and represents a new level of AI capability. Given this, we are resetting the counter back to 1 and naming this series OpenAI o1.

Safety

As part of developing these new models, we have come up with a new safety training approach that harnesses their reasoning capabilities to make them adhere to safety and alignment guidelines. By being able to reason about our safety rules in context, it can apply them more effectively.

One way we measure safety is by testing how well our model continues to follow its safety rules if a user tries to bypass them (known as “jailbreaking”). On one of our hardest jailbreaking tests, GPT-4o scored 22 (on a scale of 0-100) while our o1-preview model scored 84. You can read more about this in the system card and our research post.

To match the new capabilities of these models, we’ve bolstered our safety work, internal governance, and federal government collaboration. This includes rigorous testing and evaluations using our Preparedness Framework(opens in a new window), best-in-class red teaming, and board-level review processes, including by our Safety & Security Committee.

To advance our commitment to AI safety, we recently formalized agreements with the U.S. and U.K. AI Safety Institutes. We’ve begun operationalizing these agreements, including granting the institutes early access to a research version of this model. This was an important first step in our partnership, helping to establish a process for research, evaluation, and testing of future models prior to and following their public release.

Whom it’s for

These enhanced reasoning capabilities may be particularly useful if you’re tackling complex problems in science, coding, math, and similar fields. For example, o1 can be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers in all fields to build and execute multi-step workflows.

OpenAI o1-mini

The o1 series excels at accurately generating and debugging complex code. To offer a more efficient solution for developers, we’re also releasing OpenAI o1-mini, a faster, cheaper reasoning model that is particularly effective at coding. As a smaller model, o1-mini is 80% cheaper than o1-preview, making it a powerful, cost-effective model for applications that require reasoning but not broad world knowledge.

How to use OpenAI o1

ChatGPT Plus and Team users will be able to access o1 models in ChatGPT starting today. Both o1-preview and o1-mini can be selected manually in the model picker, and at launch, weekly rate limits will be 30 messages for o1-preview and 50 for o1-mini. We are working to increase those rates and enable ChatGPT to automatically choose the right model for a given prompt.

Tags: AIChatGpt
admin-augaf

admin-augaf

Related Posts

China Detains Investment Bankers, Takes Passports in Corruption Sweep
International

China Plans Nationwide Subsidies to Boost Birthrate

July 4, 2025
High Alert on River Ravi After India Released Water
Business

Pakistan Tops Sovereign Risk Improvement, Bloomberg Intelligence

June 28, 2025
Early US Intel Assessment Suggests Strikes on Iran Did Not Destroy Nuclear Sites – CNN
International

Early US Intel Assessment Suggests Strikes on Iran Did Not Destroy Nuclear Sites – CNN

June 25, 2025
Fair Global Consult Fair Global Consult Fair Global Consult
ADVERTISEMENT

Recent News

Pakistan Textile Exports increased 26 percent to USD 14.26 billion YoY in 9MFY22: APTMA

Pakistan’s Textile Exports Surge 32% in July, Led by Value-Added Segments

August 22, 2025
Gold

Gold Fields Half-Year Profit Triples on Record Prices

August 22, 2025
Pakistan will get back $900 million payment of Reko Diq dispute if conditions not met

ADB To Provide $410 Million For Reko Diq Project

August 22, 2025
Moody

Moody’s Upgrade Ratings of Five Pakistani Banks

August 20, 2025
EPQL accept PPIB proposal to operate plant on comingled fuel but at its own cost

EPQL Executed Supplemental Agreement to PPA with CPPA for Additional Gas

August 20, 2025

Popular News

  • NSS

    President Prohibit National Savings For Changing Rates on Existing Certificates Retrospectively

    0 shares
    Share 0 Tweet 0
  • Pakistan Rupee Appreciate against Dollar in Interbank as IMF Confirmed Board Review Date

    0 shares
    Share 0 Tweet 0
  • Pakistan Rupee Fall After 13 Days of Successive Gains against Dollar on Lower Remittances and Strengthening of US Dollar

    0 shares
    Share 0 Tweet 0
  • Petrol Prices in Pakistan to Return to July 2023 Levels

    0 shares
    Share 0 Tweet 0
  • Pakistan Central Bank Issued Show Cause Notice to Eight Banks Over Currency Speculation

    0 shares
    Share 0 Tweet 0

Categories

  • Budget
  • Business
  • Culture
  • Finance
  • International
  • National
  • News
  • Politics
  • PTI
  • Sports
  • Technology
AUGAF Logo

Follow us on social media:

Recent News

  • Pakistan’s Textile Exports Surge 32% in July, Led by Value-Added Segments
  • Gold Fields Half-Year Profit Triples on Record Prices
  • ADB To Provide $410 Million For Reko Diq Project

Category

  • Budget
  • Business
  • Culture
  • Finance
  • International
  • National
  • News
  • Politics
  • PTI
  • Sports
  • Technology

Recent News

Pakistan Textile Exports increased 26 percent to USD 14.26 billion YoY in 9MFY22: APTMA

Pakistan’s Textile Exports Surge 32% in July, Led by Value-Added Segments

August 22, 2025
Gold

Gold Fields Half-Year Profit Triples on Record Prices

August 22, 2025
  • Home
  • Politics
  • News
  • Business
  • National
  • Finance
  • Technology
  • International

© 2021 AUGAF.

No Result
View All Result
  • Home
  • Politics
  • Business
  • National
  • News
  • Finance
  • Technology
  • Sports
  • International
  • Commodities
  • Contact

© 2021 AUGAF.