Skip to main content
Featured Newsletter

BestBlogs Issue #86: Infrastructure

Hey there! Welcome to BestBlogs.dev Issue #86.

One word kept surfacing across every layer this week: infrastructure. On the tenth anniversary of AlphaGo, Demis Hassabis wrote a personal retrospective tracing the arc from Go to protein folding and mathematical discovery, then laid out a clear AGI roadmap: Gemini's multimodal perception is merging with AlphaGo's logical planning, evolving AI from a tool into an "AI co-scientist." On the application side, OpenClaw has officially surpassed React as the most-starred project in GitHub history—no longer just an open-source tool, but an agent operating system sinking into foundational infrastructure. From a solo developer's six-layer governance model, to three generations of enterprise code review, to Jensen Huang's AI "five-layer cake," this week's content collectively answers one question: when AI coding becomes table stakes, your real competitive edge comes from the infrastructure you build.

This week I focused on assembling a personal content workflow using Skills—connecting the full pipeline from content ingestion, curation, deep reading, persona-based content creation, multi-platform publishing, to analytics. The goal is to upgrade fragmented information consumption into a content operating system with a feedback loop. Still iterating, but I can already feel the qualitative shift that comes from wiring tools into a system—which resonates deeply with the core insight running through this week's articles.

Here are 10 highlights worth your attention this week:

🏆 On the tenth anniversary of AlphaGo, Google DeepMind co-founder Demis Hassabis wrote a personal reflection on the Move 37 moment and its decade-long impact. The real legacy isn't beating a human champion—it's validating a general search-and-reasoning methodology that was then transplanted into AlphaFold , FunSearch , and chip design. His AGI roadmap is clear: Gemini's multimodal perception combined with AlphaGo's logical planning, pushing AI from tool to autonomous "co-scientist."

🔮 Two major foundation models debuted this week. Gemini Embedding 2 is Google's first native multimodal embedding model, unifying text, image, audio, and video into a single vector space with 100+ language support and MRL flexible dimension compression—a critical upgrade for multimodal RAG architectures. NVIDIA Nemotron 3 Super fills the open-source gap for agentic reasoning: 120B parameters, 1M context length, and a Mamba-Transformer hybrid architecture delivering 5x throughput gains, making it the best open-source choice for complex, long-horizon multi-agent tasks.

🤖 Two foundational agent component studies are worth bookmarking. Tongyi Lab's open-source Mobile-Agent-v3.5 achieves unified GUI automation across desktop, mobile, and browser through a hybrid data flywheel and reinforcement learning, hitting open-source SOTA on 20+ benchmarks. Microsoft Research's PlugMem distills agent interaction history into structured facts and reusable skills, delivering higher-quality decision context with fewer tokens, outperforming traditional retrieval methods in dialogue and web browsing scenarios.

🦞 Professor Hung-Yi Lee's video "Dissecting the Lobster" deconstructs AI Agent architecture with textbook clarity: system prompts for identity, RAG and compression for breaking context limits, heartbeat mechanisms for 24/7 autonomous operation, and Sub-Agent orchestration for complex task decomposition. Tencent Engineering's hands-on guide complements this with a complete deployment path from hardware selection to multi-agent coordination, along with a key safety warning: any system pursuing high autonomy must plan for the worst-case scenario of full data exposure. Theory meets practice—together they form the best entry point for understanding the OpenClaw ecosystem.

🏗️ After six months of intensive Claude Code usage, Tw93 distilled a six-layer governance model: CLAUDE.md, Tools/MCP, Skills, Hooks, Subagents, and Verifiers. The core insight: agent failures rarely stem from insufficient model capability—they come from context pollution, tool redundancy, and lack of deterministic constraints. HackerNoon's "Scalability Triangle" offers a complementary decision framework—MCP handles dynamic data interaction, Subagents handle task isolation and model routing, Skills handle static knowledge injection—with clear boundaries to prevent over-engineering. Read together, these two pieces are the most systematic treatment of Claude Code engineering practices available.

⚡ OpenAI's Build Hour and Dewu's Spec Coding case study showcase production-grade agent engineering from two perspectives. OpenAI proposes Harness Engineering with seven readability metrics, arguing that embedding agent.md rules in the codebase lets AI independently ship PRs. Dewu's team validated their three-layer specification system (Rules/Code/UI) with 2,754 real tool calls over 10 days: the 36% efficiency gain required systematic upfront investment in specs. The article also candidly documents where AI breaks down in complex CI environments—that honesty makes this field report even more valuable.

🔍 Kuaishou's intelligent Code Review is this week's most instructive enterprise case study. Three generations of architecture evolution—from LLM heuristics to knowledge engine plus deterministic rules, then to agentic autonomous decision-making—pushed code review adoption from 7.9% to 54%, cutting MR turnaround time by nearly 10%. The breakthrough: building 1,100+ hard rules to eliminate AI hallucinations, achieving a paradigm shift from personal assistant to organization-level collaborator. This evolution path offers direct lessons for any team pushing AI engineering into production.

🌐 Founder Park surveyed 500+ OpenClaw -related products on Product Hunt from February, spanning cloud hosting, Skill development, Agent social networks, and competitors. An entire ecosystem has emerged without top-down planning—completely bottom-up. OpenClaw isn't just a tool anymore; it's an operating-system-level platform. Meanwhile, LangChain argues that as implementation costs plummet, the software development bottleneck is shifting from building to reviewing. The talent landscape will bifold into full-stack "builders" and architecture-focused "reviewers," with product sense becoming the core competency across all roles.

🎨 Three articles interrogate the human position in the AI era from different angles. A YC design expert's retrospective on Vibe Coded websites reveals the homogeneity trap: over-reliance on LLMs produces cookie-cutter fade-in animations—AI is an execution lever, not a substitute for thinking. Elys founder Tristan offers another dimension: a person's soul is the sum of all their context, and AI social products must anchor one end to real humans—memory slots and entropy reduction are the true technical moats. Read together, they point to the same conclusion: the more powerful the tools, the more precious human judgment becomes.

📈 Four macro pieces paint a panoramic view of AI. Jensen Huang's bylined essay deconstructs AI into a five-layer cake from energy to applications, arguing that open-source models are the catalyst activating full-stack demand. a16z's top-100 consumer AI report identifies personal memory as the next core moat. "2026 Letter to AI Founders" draws on the printing press, electric motor, and cloud computing to derive a law of profit conservation—when implementation stops being the bottleneck, value migrates to architectural judgment and product intuition. And a 70-page solo PPT deck delivers the most comprehensive data review of the Q1 2026 US-China AI landscape.

Hope this issue sparks some new ideas. Stay curious, and see you next week!

Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning

NVIDIA's Nemotron 3 Super is a 120B MoE model optimized for the agentic era. Utilizing a Hybrid Mamba-Transformer architecture, it supports a 1M context window while delivering 5x higher throughput via Latent MoE and Multi-token Prediction. It bridges the gap between efficiency and reasoning depth, offering an open-source, high-performance solution for complex multi-step workflows and autonomous software development.

魔搭ModelScope社区
mp.weixin.qq.com
03-08
2771 words · 12 min
93
Alibaba Tongyi Lab Open-Sources Mobile-Agent-v3.5: A Truly "Multi-Platform Ready" Native GUI Agent Foundation Model

Tongyi Lab released GUI-Owl-1.5, a native GUI Agent family for desktop, mobile, and browser platforms. By leveraging a Hybrid Data Flywheel and MRPO reinforcement learning, it overcomes stability issues in long-range tasks. Achieving SOTA results across 20+ benchmarks like OSWorld, it offers both Instruct and Thinking variants to support edge-to-cloud collaborative deployment, serving as a powerful open-source foundation for GUI automation.

Microsoft Research Blog
microsoft.com
03-10
967 words · 4 min
92
PlugMem: A single memory system adaptable across AI agent tasks

Microsoft Research presents PlugMem, a general-purpose memory module designed to fix "memory redundancy" in AI agents. By distilling interaction histories into structured knowledge units, it enables high-efficiency, low-cost knowledge reuse. Benchmarks show PlugMem outperforms traditional retrieval methods in QA and web navigation, delivering higher utility for decision-making while consuming significantly fewer context tokens.

Google DeepMind Blog
deepmind.google
03-10
1356 words · 6 min
92
AlphaGo at 10: How AI Innovation Is Paving the Path to AGI

Demis Hassabis reflects on a decade of AI progress since AlphaGo, highlighting its transition from game mastery to scientific breakthroughs like AlphaFold. The core insight is that the search and reasoning principles of AlphaGo are now being integrated into Gemini. This strategic combination aims to build AGI capable of independent scientific hypothesis and solving complex, open-ended global challenges.

Hung-yi Lee
youtube.com
03-09
4207 words · 17 min
93
Dissecting OpenClaw: Understanding the Inner Workings of AI Agents

Professor Hung-yi Lee dissects the engineering of AI Agents: establishing identity via system prompts, managing long-term memory through RAG and context compaction, and enabling 24/7 autonomy with heartbeat mechanisms. The content highlights core workflows like tool calling and sub-agent collaboration while providing practical security advice on permission and physical isolation.

Tw93 Blog
tw93.fun
03-12
10004 words · 41 min
94
What You Don't Know About Claude Code: Architecture, Governance, and Engineering Practices - Tw93

This article provides a deep dive into developer Tw93’s six-month journey with Claude Code. Moving beyond mere prompt engineering, the author introduces a systematic six-layer architectural model—comprising CLAUDE.md, Tools, Skills, Hooks, Subagents, and Verifiers. It offers profound insights into managing context pollution, optimizing Prompt Caching for performance, and establishing verification loops to ensure reliable outputs. For developers aiming to push the boundaries of AI agents and transition from "chatting with AI" to "engineering with agents," this post serves as an essential, high-value technical roadmap.

OpenAI
youtube.com
03-10
14614 words · 59 min
94
Build Hour: API & Codex

OpenAI and Basis reveal the architecture of Agent-driven development. Key takeaways: GPT 5.4's computer use primitives, the seven metrics of Harness Engineering for codebase legibility, and using agent.md to eliminate "AI Slop." The bottom line: Stop treating AI as a typist; treat it as an engineering partner capable of autonomous PR delivery.

HackerNoon
hackernoon.com
03-12
3096 words · 13 min
92

This piece distills Claude Code’s extensibility into a simple logic: use MCP for external data, Subagents for task isolation and cost-saving model routing, and Skills for static knowledge injection. It offers a practical "decision function" to help developers avoid common anti-patterns—like hardcoding static text in MCP or querying databases via Skills—ensuring an optimal balance between performance and cost.

得物技术
mp.weixin.qq.com
03-11
7366 words · 30 min
92
Exploring the Boundaries of AI Programming: Spec Coding Project Practice Based on Claude Code | Dewu Tech

Based on a 10-day trial with 2,754 tool calls, this post explains how the Spec Coding workflow redefines AI development efficiency. The key lies in eliminating AI uncertainty through a three-layer hierarchy (Rules/Code/UI) and using MCP tools to connect external data. It shares a 36% efficiency gain while objectively analyzing AI failure modes in complex CI environments and hidden dependencies. A must-read for advanced AI coding.

快手技术
mp.weixin.qq.com
03-09
9149 words · 37 min
93
From 7.9% to 54% Adoption: The Three-Stage Evolution of Kuaishou's Intelligent Code Review

Kuaishou's Intelligent CR: Boosting adoption from 7.9% to 54% via a 3-gen evolution (Heuristic -> Knowledge-driven -> Agentic). Key innovations include a context-aware engine and 1,100+ rules to eliminate AI hallucinations. It demonstrates how to effectively shorten review cycles by 10% through a hybrid of deterministic engineering and autonomous AI agents.

腾讯技术工程
mp.weixin.qq.com
03-09
10592 words · 43 min
92
Mastering OpenClaw: Core Architecture, Working Principles, and Agent Deployment Steps

A complete OpenClaw handbook covering everything from hardware selection (Mac Mini) to multi-agent orchestration. It dives into memory mechanisms, skill management, and task allocation, featuring real-world cases like automated research and local ComfyUI integration. Critically, it warns users to treat local data security with extreme caution given the framework's operational risks.

LangChain Blog
blog.langchain.com
03-10
1992 words · 8 min
93
How Coding Agents Are Reshaping Engineering, Product and Design

This article explores how AI coding agents are fundamentally reshaping the collaboration between Engineering, Product, and Design (EPD). The author argues that as implementation becomes "cheap," the development bottleneck shifts from writing code to reviewing it and applying systems thinking. The traditional waterfall PRD process is being replaced by prototype-led iterations. Future roles will likely bifurcate into versatile "Builders" or high-level "Reviewers," making "product sense" a mandatory skill for everyone. It is a profound framework for tech professionals looking to adapt their skill sets in an AI-accelerated landscape.

Founder Park
mp.weixin.qq.com
03-06
12364 words · 50 min
92

Tristan, founder of Elys, reveals a new AI social paradigm: treating Context as the sum of a soul and implementing active memory through "memory slots." Key takeaways include the necessity of human involvement for authenticity, using AI to achieve "entropy reduction" in human connections, and the leap from emotional companionship to high-dimensional social networking. A direct look at the technical bottlenecks and aesthetics of AI-era social products.

Y Combinator
youtube.com
03-06
11889 words · 48 min
92
Design Experts Review Vibe Coded Websites

This video offers a sharp critique of the "sameness" in AI-driven startup designs. YC experts highlight how relying solely on LLMs leads to redundant animations, generic "Bento box" layouts, and broken UX patterns like scroll hijacking. The key message: AI is a tool for execution, not a replacement for strategy. Founders must maintain editorial control to ensure their landing pages reflect unique brand identities rather than generic AI-generated templates.

a16z
youtube.com
03-10
14800 words · 60 min
94
The Top 100 Consumer AI Apps | The a16z Show

The a16z report highlights AI's evolution from simple text prompts to sophisticated Agents and desktop ecosystems. While ChatGPT maintains dominance, Claude and Gemini are carving out niches in prosumer and creative segments. Key takeaways: the surge of consumer-grade agents, the battle for AI-native browser entry points, and "Memory" emerging as the ultimate competitive moat.

十字路口Crossing
mp.weixin.qq.com
03-07
9463 words · 38 min
93
2026: Generosity, Cruelty, and the Fog—A Letter to AI Entrepreneurs

Authored by AI builder JiaYuan, this article provides a profound analysis of the "commoditization" of coding in 2026. By chronicling the paradigm shift experienced by figures like Andrej Karpathy, the author connects current AI trends with historical cycles—from the printing press to the electric motor and AWS. The core thesis applies the "Law of Conservation of Attractive Profits," arguing that as implementation costs hit near-zero, value migrates toward system architecture, product intuition, and user empathy. It is a sobering strategic guide for entrepreneurs navigating the "Hell Mode" of competition, emphasizing that the bottleneck is no longer "how to build," but "what to build."

屠龙之术
xiaoyuzhoufm.com
03-09
31173 words · 125 min
94
Vol.89 AI Industry 2025 Annual Summary Supplement (V4 "Can't Wait" Edition) --- 70-page PPT Solo

Veteran investor Zhuang Minghao delivers a high-density breakdown of the global AI landscape in early 2026. This session covers the evolution of US Capex narratives, NVIDIA’s market dominance, the rise of "OpenClaw" and Agentic AI, and the shifting valuations of Chinese AI unicorns like Moonshot and Zhipu post-Spring Festival. It offers a professional synthesis of market data and critical reflections on the "AI eating software" discourse. A must-watch/read for professionals looking to synchronize with the latest AI investment logic and technical trends of Q1 2026.

    BestBlogs Issue #86: Infrastructure | BestBlogs.dev