News
MiniMax has introduced Hailuo 02, the second generation of its video AI model, with major upgrades in both performance and price.
According to ByteDance, Seedance 1.0 outperforms existing models in several areas, including how well it follows user prompts, the quality of motion, and image sharpness. On the benchmarking platform ...
Yann LeCun, Meta's chief AI scientist, has taken a direct shot at Anthropic CEO Dario Amodei on Threads, making clear just how sharply the AI community is split over the future of general artificial ...
An internal strategy document reveals how OpenAI plans to turn ChatGPT into a ubiquitous super assistant by mid-2025, serving as a personalized gateway to the entire internet.
OpenAI's computer-using agent is getting an upgrade: The new o3 model is designed to make Operator more precise, more structured and more successful on the web.
Anthropic is treating its new Claude Opus 4 language model as safety-critical after tests revealed some troubling behavior, including escape attempts, blackmail, and autonomous whistleblowing.
Anthropic has released its next generation of AI models, Claude Opus 4 and Claude Sonnet 4, and is introducing new safety measures designed to prevent their use in developing chemical, biological, ...
During a recent Reddit Q&A with the Codex team, OpenAI VP of Research Jerry Tworek described GPT-5 as the company's next foundational model. The goal isn't to launch a radically different system, it ...
OpenAI has released a new benchmark for testing AI systems in healthcare. Called HealthBench, it's designed to evaluate how well language models handle realistic medical conversations. According to ...
An independent project demonstrates how Google's Gemini 2.5 Pro language model can complete the classic Game Boy game Pokémon Blue, albeit with significant technical support.
Researchers say the ranking system favors major providers like OpenAI, Google, and Meta. LMArena disputes the claims.
Researchers at the University of Zurich conducted an unauthorized experiment on the popular Reddit community r/ChangeMyView (CMV), using AI-powered accounts to test the persuasive ability of large ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results