Programming Language Benchmarks

14h

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

MMLU-Pro holds steady at 85.0, AIME 2025 slightly improves to 89.3, while GPQA-Diamond dips from 80.7 to 79.9. Coding and agent benchmarks tell a similar story, with Codeforces ratings rising from ...

Yen on MSN

Anthropic launches new AI model, touting coding supremacy

US startup Anthropic on Monday announced the launch of its new generative artificial intelligence model, Claude Sonnet 4.5, ...

18h

Anthropic sets AI coding record with new flagship Claude Sonnet 4.5 model

Anthropic evaluated the model’s programming capabilities using a benchmark called SWE-bench Verified. Sonnet 4.5 set a new industry record with a 82% score. The next two highest scores were also ...

PCMag on MSN

Anthropic Debuts Claude Sonnet 4.5 With More Coding, Less 'Deception'

The latest upgrade brings the ability to save your progress and create custom agents, with fewer behavioral issues, such as ...

Datawizz raises $12.5M to cut AI costs by routing smaller, smarter models

“Companies spent $8.4 billion on API calls to LLMs in just the first half of 2025 — more than double the figure for all of 2024,” said Iddo Gino (pictured), founder and chief executive of Datawizz.

Anthropic launches Claude Sonnet 4.5, claims it's the world's best coding model Anthropic launches Claude Sonnet 4.5, claims it's the world's best coding model 0 0

Anthropic has released Claude Sonnet 4.5, a new large language model that excels at coding tasks and outperforms competitors' ...

3hon MSN

Show inaccessible results

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

Anthropic launches new AI model, touting coding supremacy

Anthropic sets AI coding record with new flagship Claude Sonnet 4.5 model

Anthropic Debuts Claude Sonnet 4.5 With More Coding, Less 'Deception'

Datawizz raises $12.5M to cut AI costs by routing smaller, smarter models

Anthropic launches Claude Sonnet 4.5, claims it's the world's best coding model Anthropic launches Claude Sonnet 4.5, claims it's the world's best coding model 0 0

Nasscom planning local benchmarks for Indic AI models

Anthropic releases 'best coding model in the world' with Claude Sonnet 4.5

Ant Group Open-Sources Ring-1T-Preview, a Trillion-Parameter Reasoning Model Scoring Higher Than GPT-5

Anthropic’s Claude Sonnet 4.5 AI Model Introduced as the ‘Best Coding Model in the World’