VFF - The signal in the noise
News

Xiaomi open-sources MiMo Code, claims edge over Claude on long coding tasks

Read original
Share
Xiaomi open-sources MiMo Code, claims edge over Claude on long coding tasks

Xiaomi has open-sourced MiMo Code V0.1.0, a terminal-native AI coding assistant that claims to outperform Anthropic's Claude Code on long-horizon, multi-step coding tasks (200+ steps) according to internal benchmarks. The tool uses a cross-session memory system with SQLite FTS5 to retain context across extended work sessions, addressing a core limitation of existing AI coding agents. Xiaomi is also offering limited free access to MiMo-V2.5, its flagship model with a million-token context window.

  • Xiaomi released MiMo Code V0.1.0 under MIT license on GitHub, installable via single terminal command
  • MiMo Code scored 82% on SWE-bench Verified, 62% on SWE-bench Pro, and 73% on Terminal Bench 2, versus Claude Code's 79%, 55%, and 69% respectively
  • Core innovation is a four-layer cross-session memory system using SQLite FTS5 that prevents context loss during long coding sessions
  • System deploys independent checkpoint-writer subagent to maintain project state while main agent continues work, plus /dream command for periodic session compression

AI coding agents have struggled with context degradation during extended sessions, forcing developers to repeatedly re-explain project context. MiMo Code's persistent memory architecture addresses this fundamental limitation through structured checkpoints and independent note-taking, potentially enabling more productive long-horizon coding workflows. The open-source release and claimed performance gains over Claude Code signal meaningful progress in agentic AI coding capabilities.

For development teams, reduced context loss means fewer interruptions and re-explanations during complex multi-step tasks, potentially improving developer productivity on large projects. Xiaomi's open-source approach and free tier access to MiMo-V2.5 create competitive pressure on commercial offerings from Anthropic and OpenAI while building developer adoption in the coding AI space.

  • Persistent memory architectures may become table stakes for AI coding agents, shifting competition from raw model capability to agent system design
  • Xiaomi's entry into open-source AI coding tools expands the competitive landscape beyond Anthropic and OpenAI, particularly for developers seeking self-hosted or cost-free options
  • The checkpoint-writer subagent pattern demonstrates a viable approach to solving context window limitations without requiring larger models, potentially applicable to other agentic AI systems

Monitor whether MiMo Code's performance claims hold up in independent testing and real-world developer adoption. Track whether Anthropic and OpenAI respond with similar memory architectures in Claude Code and other offerings. Observe whether Xiaomi's approach to session compression and long-term memory distillation becomes a standard pattern in the agentic AI coding space.

Share

Our Briefing

Weekly signal. No noise. Built for founders, operators, and AI-curious professionals.

No spam. Unsubscribe any time.

Related stories

NVIDIA Blackwell Leads First Agentic AI Benchmark
TrendingNews

NVIDIA Blackwell Leads First Agentic AI Benchmark

Artificial Analysis released AgentPerf, the first benchmark designed specifically for agentic AI workloads, showing NVIDIA's Blackwell Ultra NVL72 platform delivering 20x more agents per megawatt than Hopper-based systems. The benchmark reflects the fundamentally different performance characteristics of agentic AI, which chains dozens to hundreds of LLM calls with tool execution rather than single-turn completions. Results are based on real coding agent trajectories across 12+ programming languages, providing infrastructure providers and enterprises with direct metrics for deployment decisions.

by Shruti Koparkar· NVIDIA Blog (AI)
PixelRAG bypasses text parsing, cuts RAG costs 10x

PixelRAG bypasses text parsing, cuts RAG costs 10x

Researchers from UC Berkeley, Princeton, EPFL, and Databricks introduced PixelRAG, a retrieval system that bypasses traditional text parsing by rendering web pages as screenshots and indexing them directly for vision-language models. Tested on 30 million Wikipedia screenshot tiles, PixelRAG improved accuracy by up to 18.1% over text-based RAG systems and reduced token costs by 10x. The approach addresses fundamental information loss in conventional HTML-to-text conversion pipelines.

· VentureBeat AI
NanoClaw and JFrog Block Malicious Code from AI Agents
TrendingNews

NanoClaw and JFrog Block Malicious Code from AI Agents

NanoClaw and JFrog have launched an integration that routes autonomous AI agents through vetted software registries to block malicious code downloads. The system acts as an automated immune system, intercepting compromised packages and guiding agents to approved alternatives. The partnership offers free access for open-source users and commercial licensing for enterprises, addressing a growing security gap as AI agents autonomously install packages without human oversight.

by carl.franzen@venturebeat.com (Carl Franzen)· VentureBeat AI
Google's 'Faithful Uncertainty' Lets LLMs Hedge Instead of Hallucinate
TrendingNews

Google's 'Faithful Uncertainty' Lets LLMs Hedge Instead of Hallucinate

Google researchers propose 'faithful uncertainty,' a technique that allows large language models to express qualified guesses rather than either confidently hallucinating or refusing to answer. The approach reframes hallucinations as 'confident errors' and enables models to hedge responses appropriately, preserving utility while maintaining trustworthiness. This addresses a core tradeoff in LLM deployment where eliminating factual errors typically forces models to abstain from answering questions they actually know.

by bendee983@gmail.com (Ben Dickson)· VentureBeat AI