site:the-decoder.com - Search News

News

MiniMax's Hailuo 02 tops Google Veo 3 in user benchmarks at much lower ...

MiniMax has introduced Hailuo 02, the second generation of its video AI model, with major upgrades in both performance and price.

the-decoder1mon

ByteDance's Seedance 1.0 is trading blows with Google's Veo 3

According to ByteDance, Seedance 1.0 outperforms existing models in several areas, including how well it follows user prompts, the quality of motion, and image sharpness. On the benchmarking platform ...

the-decoder1mon

Meta AI chief scientist LeCun's latest comment reveals deep industry ...

Yann LeCun, Meta's chief AI scientist, has taken a direct shot at Anthropic CEO Dario Amodei on Threads, making clear just how sharply the AI community is split over the future of general artificial ...

the-decoder1mon

OpenAI sees human interaction as a competitor to ChatGPT's super ...

An internal strategy document reveals how OpenAI plans to turn ChatGPT into a ubiquitous super assistant by mid-2025, serving as a personalized gateway to the entire internet.

the-decoder1mon

OpenAI's Operator Agent gets o3 upgrade for more precise browser control

OpenAI's computer-using agent is getting an upgrade: The new o3 model is designed to make Operator more precise, more structured and more successful on the web.

the-decoder1mon

Claude Opus 4 blackmailed an engineer after learning it might be replaced

Anthropic is treating its new Claude Opus 4 language model as safety-critical after tests revealed some troubling behavior, including escape attempts, blackmail, and autonomous whistleblowing.

the-decoder1mon

Anthropic releases Claude 4 with new safety measures targeting CBRN misuse

Anthropic has released its next generation of AI models, Claude Opus 4 and Claude Sonnet 4, and is introducing new safety measures designed to prevent their use in developing chemical, biological, ...

the-decoder1mon

OpenAI says GPT-5 is about doing everything better with "less model ...

During a recent Reddit Q&A with the Codex team, OpenAI VP of Research Jerry Tworek described GPT-5 as the company's next foundational model. The goal isn't to launch a radically different system, it ...

the-decoder2mon

OpenAI says its latest models outperform doctors in medical benchmark

OpenAI has released a new benchmark for testing AI systems in healthcare. Called HealthBench, it's designed to evaluate how well language models handle realistic medical conversations. According to ...

the-decoder2mon

Google's reasoning LLM Gemini 2.5 Pro beats Pokémon Blue with a little ...

An independent project demonstrates how Google's Gemini 2.5 Pro language model can complete the classic Game Boy game Pokémon Blue, albeit with significant technical support.

the-decoder2mon

Researchers say the ranking system favors major providers like OpenAI, Google, and Meta. LMArena disputes the claims.

the-decoder2mon

Researchers used AI to manipulate Reddit users, scrapped study after ...

Researchers at the University of Zurich conducted an unauthorized experiment on the popular Reddit community r/ChangeMyView (CMV), using AI-powered accounts to test the persuasive ability of large ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results