AI News: Llm
All articles tagged with "llm". Curated daily from top AI sources.
Chinese AI Startup Releases Groundbreaking Model, Beating GPT-5.5 at Lower Cost
Chinese AI startup Z.ai has released a new large language model called GLM-5.2, which performs better than GPT-5.5 on certain tasks at a lower cost. The model is available for free and can be used locally, which may be appealing to businesses concerned about regulatory changes.
AI Startup Sakana Launches Fugu System to Beat Vendor Lock-in
** Sakana, an AI startup, launches Fugu, a system that delivers top AI performance without relying on single models. Fugu avoids vendor lock-in and geopolitical export controls.
Anthropic's AI Tool Learns Your Company's Rhythm on Slack
A new AI tool is watching Slack messages to learn about your company's workflow. This is a strategic play by Anthropic to capture valuable knowledge and context.
Google makes Interactions API the default interface for Gemini models and agents
** Google has made changes to its Gemini platform, a system for creating conversational AI agents and chatbots. The Interactions API is now the default way to interact with Gemini models and agents, making it easier to use and develop new features. This shift is expected to improve the performance and user experience of Gemini-powered chatbots.
Turn Off AI-Powered Writing in Google Docs
Google Docs users can now disable AI-powered writing features. A new option allows users to opt out of AI-powered suggestions in the document editor.
Claude AI Requires Identity Verification for Some Features
Claude AI, a popular conversational AI model, will now require users to verify their identity to access certain features. This change aims to improve the model's safety and security.
AI App Builder Creates Functional Program in Just 5 Minutes
A new AI app builder claims to create functional programs in just 5 minutes. This is a significant improvement over traditional coding methods.
Anthropic Hints at Price Hike Pause for AI Agent SDK
Anthropic, a prominent AI company, has made a surprising move regarding its Claude Agent SDK. The company was set to introduce a new pricing model that would have significantly increased costs for heavy users. Instead, Anthropic is "pausing" this change.
New AI Framework Boosts Performance by 2.5 Times with Less Computation
Researchers have developed a new AI framework called Arbor that significantly improves the performance of AI-driven research and optimization. This breakthrough is expected to revolutionize the way companies use AI to automate complex engineering tasks.
Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks
Xiaomi's MiMo team has released an open-source AI coding assistant called MiMo Code, which outperforms Anthropic's Claude Code on long tasks. MiMo Code uses a unique approach to remember and recall context, making it a more reliable coding partner.
Adobe Introduces AI Agents for Creative Cloud Apps
Adobe is adding AI agents to its popular Creative Cloud apps, including Photoshop and Premiere. This new feature will allow users to describe what they want, and the software will handle the work.
Powerful AI Model Claude Fable Can't Answer Simple Biology Questions
A new version of the AI model Claude Fable has been released, but it has a surprising weakness in answering basic biology questions. This raises questions about the limits of artificial intelligence.
Google's Gemini-SQL2 Surpasses Text-to-SQL Benchmarks with Record Accuracy
Google's AI research has achieved a breakthrough in converting natural language into executable SQL queries. Gemini-SQL2 outperforms previous models by a significant margin, with 80.04 percent accuracy on the BIRD benchmark.
Google Seeks to Revamp Smart Speakers with New AI Tech
Google is trying to give its smart speakers a fresh start with a new AI-powered assistant. The goal is to make interactions feel more natural and conversational.
Anthropic Releases Most Powerful AI Model Yet, Claude Fable 5
Anthropic, a leading AI company, has just announced its most powerful AI model yet, Claude Fable 5. This new model surpasses its predecessors in areas like software engineering, knowledge work, and vision.
What It's Like to Work with the Latest AI Breakthrough: Mythos
Get ready to meet the latest AI sensation: Mythos. This revolutionary technology is being hailed as a major leap forward in AI capabilities, and its impact is being felt across industries. But what does it actually do?
Google Cloud's New Format Turns Scattered Docs into AI-Friendly Files
Google Cloud has created a new format that helps organize scattered knowledge into a format that's easily understood by AI agents. This format is called the Open Knowledge Format (OKF) and it turns documents into Markdown files.
Open model Kimi K2.7 Code undercuts GPT-5.5 and Claude by up to 12x on price per token
Moonshot AI's new open-weights model, Kimi K2.7 Code, outperforms GPT-5.5 and Claude in price, not necessarily quality. This affordable model is built for programming and offers an alternative to more expensive options.
Anthropic Releases Claude Fable 5: A Major AI Model Update
Anthropic, a leading AI research company, has released an updated version of its AI model, Claude Fable 5. This new version promises significant improvements in language understanding and generation capabilities. The update aims to enhance the model's ability to learn from human feedback and adapt to new tasks.
GPT-5.5 Beats Claude Fable 5 in AI Benchmark Challenge
** Researchers from the University of California, Berkeley, have created a new AI benchmark that tests models on real-world professional workflows. OpenAI's GPT-5.5 has beaten Anthropic's Claude Fable 5 on this challenging test.