News
Claude Opus 4.1 scores 74.5% on the SWE-bench Verified benchmark, indicating major improvements in real-world programming, bug detection, and agent-like problem solving.
In May, Anthropic released Claude Opus 4, which the company dubbed its most powerful model yet and the best coding model in ...
Explore Claude Opus 4.1, Anthropic’s groundbreaking new AI model with advanced coding, multilingual, and problem-solving ...
Anthropic launched Claude Opus 4.1 today, an upgraded version of its flagship AI model that achieves 74.5% accuracy on ...
Anthropic has released Claude Opus 4.1, which is said to deliver better coding and agent performance with improved safety.
Anthropic says Claude Opus 4.1 improves software engineering accuracy to 74.5%. That compares to 62.3% with Claude Sonnet 3.7 ...
Anthropic's Claude Opus 4.1 achieves 74.5% on coding benchmarks, leading the AI market, but faces risk as nearly half its $3.1B API revenue depends on just two customers.
The party is over next month. Anthropic just announced new weekly rate limits to go with their already de facto shrunk limits ...
OpenAI CEO Sam Altman went so far as to call GPT-5 “the best model in the world.” That may be pride or hyperbole, as ...
Anthropic launches automated AI security tools for Claude Code that scan code for vulnerabilities and suggest fixes, ...
ChatGPT and Claude 4 are two of the smartest AI assistants available but they’re built with different strengths. Here’s how they compare to help you choose.
OpenAI has launched a new large language model, GPT-5, but it may not have ruffled too many feathers this time ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results