News

Xiaomi open-sources MiMo Code, claims edge over Claude on long coding tasks

carl.franzen@venturebeat.com (Carl Franzen)Jun 13, 2026 · 2 days ago

Xiaomi has open-sourced MiMo Code V0.1.0, a terminal-native AI coding assistant that claims to outperform Anthropic's Claude Code on long-horizon, multi-step coding tasks (200+ steps) according to internal benchmarks. The tool uses a cross-session memory system with SQLite FTS5 to retain context across extended work sessions, addressing a core limitation of existing AI coding agents. Xiaomi is also offering limited free access to MiMo-V2.5, its flagship model with a million-token context window.

TL;DR

Xiaomi released MiMo Code V0.1.0 under MIT license on GitHub, installable via single terminal command
MiMo Code scored 82% on SWE-bench Verified, 62% on SWE-bench Pro, and 73% on Terminal Bench 2, versus Claude Code's 79%, 55%, and 69% respectively
Core innovation is a four-layer cross-session memory system using SQLite FTS5 that prevents context loss during long coding sessions
System deploys independent checkpoint-writer subagent to maintain project state while main agent continues work, plus /dream command for periodic session compression

Why It Matters

AI coding agents have struggled with context degradation during extended sessions, forcing developers to repeatedly re-explain project context. MiMo Code's persistent memory architecture addresses this fundamental limitation through structured checkpoints and independent note-taking, potentially enabling more productive long-horizon coding workflows. The open-source release and claimed performance gains over Claude Code signal meaningful progress in agentic AI coding capabilities.

Business Impact

For development teams, reduced context loss means fewer interruptions and re-explanations during complex multi-step tasks, potentially improving developer productivity on large projects. Xiaomi's open-source approach and free tier access to MiMo-V2.5 create competitive pressure on commercial offerings from Anthropic and OpenAI while building developer adoption in the coding AI space.

Key Implications

Persistent memory architectures may become table stakes for AI coding agents, shifting competition from raw model capability to agent system design
Xiaomi's entry into open-source AI coding tools expands the competitive landscape beyond Anthropic and OpenAI, particularly for developers seeking self-hosted or cost-free options
The checkpoint-writer subagent pattern demonstrates a viable approach to solving context window limitations without requiring larger models, potentially applicable to other agentic AI systems

What to Watch

Monitor whether MiMo Code's performance claims hold up in independent testing and real-world developer adoption. Track whether Anthropic and OpenAI respond with similar memory architectures in Claude Code and other offerings. Observe whether Xiaomi's approach to session compression and long-term memory distillation becomes a standard pattern in the agentic AI coding space.

AI Agents Model Releases Open Source Coding / Dev Tools

Our Briefing

Weekly signal. No noise. Built for founders, operators, and AI-curious professionals.

No spam. Unsubscribe any time.

Artificial Analysis released AgentPerf, the first benchmark designed specifically for agentic AI workloads, showing NVIDIA's Blackwell Ultra NVL72 platform delivering 20x more agents per megawatt than Hopper-based systems. The benchmark reflects the fundamentally different performance characteristics of agentic AI, which chains dozens to hundreds of LLM calls with tool execution rather than single-turn completions. Results are based on real coding agent trajectories across 12+ programming languages, providing infrastructure providers and enterprises with direct metrics for deployment decisions.

by Shruti Koparkar1 day ago· NVIDIA Blog (AI)

AI AgentsNews

PixelRAG bypasses text parsing, cuts RAG costs 10x

Researchers from UC Berkeley, Princeton, EPFL, and Databricks introduced PixelRAG, a retrieval system that bypasses traditional text parsing by rendering web pages as screenshots and indexing them directly for vision-language models. Tested on 30 million Wikipedia screenshot tiles, PixelRAG improved accuracy by up to 18.1% over text-based RAG systems and reduced token costs by 10x. The approach addresses fundamental information loss in conventional HTML-to-text conversion pipelines.

2 days ago· VentureBeat AI

AI AgentsTrendingNews

NanoClaw and JFrog Block Malicious Code from AI Agents

NanoClaw and JFrog have launched an integration that routes autonomous AI agents through vetted software registries to block malicious code downloads. The system acts as an automated immune system, intercepting compromised packages and guiding agents to approved alternatives. The partnership offers free access for open-source users and commercial licensing for enterprises, addressing a growing security gap as AI agents autonomously install packages without human oversight.

by carl.franzen@venturebeat.com (Carl Franzen)2 days ago· VentureBeat AI

AI AgentsTrendingNews

Google's 'Faithful Uncertainty' Lets LLMs Hedge Instead of Hallucinate

Google researchers propose 'faithful uncertainty,' a technique that allows large language models to express qualified guesses rather than either confidently hallucinating or refusing to answer. The approach reframes hallucinations as 'confident errors' and enables models to hedge responses appropriately, preserving utility while maintaining trustworthiness. This addresses a core tradeoff in LLM deployment where eliminating factual errors typically forces models to abstain from answering questions they actually know.

by bendee983@gmail.com (Ben Dickson)2 days ago· VentureBeat AI

Xiaomi open-sources MiMo Code, claims edge over Claude on long coding tasks

TL;DR

Why It Matters

Business Impact

Key Implications

What to Watch

Our Briefing

NVIDIA Blackwell Leads First Agentic AI Benchmark

PixelRAG bypasses text parsing, cuts RAG costs 10x

NanoClaw and JFrog Block Malicious Code from AI Agents

Google's 'Faithful Uncertainty' Lets LLMs Hedge Instead of Hallucinate

Related stories

NVIDIA Blackwell Leads First Agentic AI Benchmark

PixelRAG bypasses text parsing, cuts RAG costs 10x

NanoClaw and JFrog Block Malicious Code from AI Agents

Google's 'Faithful Uncertainty' Lets LLMs Hedge Instead of Hallucinate