Anthropic launches Claude 4.5, a powerful AI model that outperforms GPT-5 in coding, aiming to dominate the enterprise ...
Like ACP, AP2 is an open-source protocol designed to let AI agents securely complete purchases. But while ACP emphasizes keeping merchants in control using their existing processors, AP2 focuses on ...
eSelf, a startup developing interactive, photorealistic talking AI video avatars, has introduced a new feature called Share ...
Composite raises $5.6M seed funding to automate repetitive browser tasks with AI agents that transform existing browsers into ...
MMLU-Pro holds steady at 85.0, AIME 2025 slightly improves to 89.3, while GPQA-Diamond dips from 80.7 to 79.9. Coding and agent benchmarks tell a similar story, with Codeforces ratings rising from ...
Will the application of AI reduce staff in pursuit of efficiency, or can we design systems that preserve human dignity, ...
Microsoft unveils new AI agents in GitHub Copilot and Azure Migrate that automate legacy code modernization, helping ...
According to the company, Liquid Nanos deliver performance that rivals far larger models on specialized, agentic workflows ...
Perplexity AI launches comprehensive search API giving developers access to hundreds of billions of web pages, challenging ...
Meta released an agentic testing environment, Agents Research Environment, and a new benchmark called Gaia2 to measure ...
The Parallel-R1 framework uses reinforcement learning to teach models how to explore multiple reasoning paths at once, ...
ChatGPT Pulse is OpenAI's experiment in creating more autonomous, ambient agents for ChatGPT Pro subscribers on mobile.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results