Home›Artificial Intelligence›Article

AI · OpenAI

OpenAI's GPT-5 Achieves Near-Human Performance on Complex Multi-Step Reasoning Tasks

By AI Research DeskApril 14, 2026 · 5 min read

OpenAI has released benchmark results showing that GPT-5 achieves near-human performance on complex multi-step reasoning tasks across mathematics, law, and medicine.

The model demonstrated a 94% accuracy rate on the LSAT, 91% on the USMLE Step 2, and 97% on the AMC 12 mathematics competition. These results represent significant improvements over GPT-4, which scored 88%, 83%, and 91% respectively.

Most impressively, GPT-5 showed the ability to maintain coherent reasoning chains across 15+ steps — a capability that previous models struggled with beyond 5-7 steps.

OpenAI's research paper attributes the improvement to a new training methodology called 'recursive self-refinement,' where the model iteratively improves its own reasoning during inference.

The implications for professional services are significant. Legal research, medical diagnosis, and financial analysis — all fields that require multi-step reasoning — could be substantially augmented by these capabilities.

Artificial Intelligence

AI · OpenAI

The Sovereign Post

■ More from Artificial Intelligence

AI · Stanford Report

Stanford's 2026 AI Index: Models Keep Getting Better With No Signs of Plateau

Despite widespread predictions that AI development would hit a wall, Stanford's annual AI Index report confirms that top models continue to improve at an accelerating pace. People are adopting AI faster than they adopted the personal computer or the internet.

Stanford HAI Research Team · April 13, 2026

AI · Anthropic

Anthropic Leads Global AI Rankings as Claude Opus 4.6 Tops 50% on Humanity's Last Exam

Anthropic's Claude Opus 4.6 and Google's Gemini 3.1 Pro now top 50% on the most rigorous AI benchmark available, Humanity's Last Exam, marking a historic milestone in AI capability.

Tech Desk · April 13, 2026

AI · Enterprise

96% of Enterprises Now Deploy AI Agents, But 94% Raise Concerns Over Sprawl

OutSystems' global 2026 State of AI Development report reveals that agentic AI has gone fully mainstream in the enterprise. Nearly every organisation surveyed — 96% — is already using AI agents in some capacity.

Enterprise Desk · April 12, 2026

PwC: 74% of AI's Economic Value Is Captured by Just 20% of Companies

A landmark PwC study of 1,217 senior executives across 25 sectors finds that the AI economy is already deeply unequal. The top 20% of AI-performing organisations capture nearly three-quarters of AI's total economic value.

Business Desk · April 13, 2026