BTC-Newswire

BTC-Newswire

Did OpenAI Cheat on Its Big Math Test?

Press Releases Crypto News DeCrypt Did OpenAI Cheat on Its Big Math Test?

Crypto news 900

How intelligent is a model that memorizes the answers before an exam? That’s the question facing OpenAI after it unveiled o3 in December, and touted its model’s impressive benchmarks. At the time, some pundits hailed it as being almost as powerful as AGI, the level at which artificial intelligence is capable of achieving the same performance as a human on any task required by the user.

But money changes everything—even math tests, apparently.

OpenAI’s victory lap over its o3 model’s stunning 25.2% score on FrontierMath, a challenging mathematical benchmark developed by Epoch AI, hit a snag when it turned out the company wasn’t just acing the test—OpenAI helped write it, too.

“We gratefully acknowledge OpenAI for their support in creating the benchmark,” Epoch AI wrote in an updated footnote on the FrontierMath whitepaper—and this was enough to raise some red flags among enthusiasts.

Image: Epoch AI via ArXiv

Worse, OpenAI had not only funded FrontierMath’s development but also had access to its problems and solutions to use as it saw fit. Epoch AI later Go to Source to See Full Article
Author: Jose Antonio Lanz

BTC NewswireAuthor posts

BTC Newswire Crypto News at your Fingertips

Related Posts ...

Bitcoin whale decrypt style 03 gID 7 360x320

Bitcoin Sell Pressure Is Easing, But Whales Keep Dumping on Exchanges: CryptoQuant

Decrypt style kalshi 02 gID 7 2 360x320

After State Lawsuit Losing Streak, Kalshi Nabs Victory in Tennessee—This Could Be Why

Strategy microstrategy learn gID 7 pID 5 360x320

What Is Strategy (MSTR)? The Bitcoin Treasury Company

Trump decrypt style 09 gID 7 360x320

Bitcoin Rises After Supreme Court Rules Against Trump Tariffs

Amc cinemas decrypt style gID 7 360x320

AMC Theatres Blocks AI Short Film From Screening in Pre-Show Advertising

SouthKorea Bitcoin3 gID 7 3 360x320

South Korean Lawmakers Slam Regulators Over Bithumb’s $43 Billion Bitcoin Blunder

Crypto news 900 360x320

House Dems Raise National Security Alarms Over Trump Family’s Crypto Bank Charter Request

Polymarket decrypt style gID 7 3 360x320

Netherlands Bans Polymarket Over ‘Illegal Gambling Services’

Comments are disabled.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Powered by GDPR Cookie Compliance

Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

Enable or Disable Cookies Enabled Disabled