In brief

  • OpenAI’s GDPval benchmark tested real jobs—legal briefs, code, reports—and found AI matching human experts at breakneck speed.
  • Claude and GPT-5 outperformed seasoned professionals in 44 occupations, improving threefold in just over a year.
  • The study showed the first wave of disruption will hit office-based jobs, from coders to lawyers and journalists.

OpenAI unveiled GDPval on Thursday—a benchmark that tries to assess qualitatively whether AI can do your actual job.

These are not hypothetical exam questions, but real deliverables: legal briefs, engineering blueprints, nursing care plans, financial reports—the kind of work, that is, that pays mortgages. The researchers deliberately focused on occupations where at least 60% of tasks are computer-based—roles they describe as “predominantly digital.”

That scope covers professional services such as software developers, lawyers, accountants, and project managers; finance and insurance positions like analysts and customer service reps; and information-sector jobs ranging from journalists and editors to producers and AV technicians. Healthcare administration, white-collar manufacturing roles, and sales or real estate managers also feature prominently.

Go to Source to See Full Article
Author: Jose Antonio Lanz

BTC NewswireAuthor posts

BTC Newswire Crypto News at your Fingertips

Comments are disabled.