Featured Newsletter

BestBlogs Issue #86: Infrastructure

Hey there! Welcome to BestBlogs.dev Issue #86.

One word kept surfacing across every layer this week: infrastructure. On the tenth anniversary of AlphaGo, Demis Hassabis wrote a personal retrospective tracing the arc from Go to protein folding and mathematical discovery, then laid out a clear AGI roadmap: Gemini's multimodal perception is merging with AlphaGo's logical planning, evolving AI from a tool into an "AI co-scientist." On the application side, OpenClaw has officially surpassed React as the most-starred project in GitHub history—no longer just an open-source tool, but an agent operating system sinking into foundational infrastructure. From a solo developer's six-layer governance model, to three generations of enterprise code review, to Jensen Huang's AI "five-layer cake," this week's content collectively answers one question: when AI coding becomes table stakes, your real competitive edge comes from the infrastructure you build.

This week I focused on assembling a personal content workflow using Skills—connecting the full pipeline from content ingestion, curation, deep reading, persona-based content creation, multi-platform publishing, to analytics. The goal is to upgrade fragmented information consumption into a content operating system with a feedback loop. Still iterating, but I can already feel the qualitative shift that comes from wiring tools into a system—which resonates deeply with the core insight running through this week's articles.

Here are 10 highlights worth your attention this week:

🏆 On the tenth anniversary of AlphaGo, Google DeepMind co-founder Demis Hassabis wrote a personal reflection on the Move 37 moment and its decade-long impact. The real legacy isn't beating a human champion—it's validating a general search-and-reasoning methodology that was then transplanted into AlphaFold , FunSearch , and chip design. His AGI roadmap is clear: Gemini's multimodal perception combined with AlphaGo's logical planning, pushing AI from tool to autonomous "co-scientist."

🔮 Two major foundation models debuted this week. Gemini Embedding 2 is Google's first native multimodal embedding model, unifying text, image, audio, and video into a single vector space with 100+ language support and MRL flexible dimension compression—a critical upgrade for multimodal RAG architectures. NVIDIA Nemotron 3 Super fills the open-source gap for agentic reasoning: 120B parameters, 1M context length, and a Mamba-Transformer hybrid architecture delivering 5x throughput gains, making it the best open-source choice for complex, long-horizon multi-agent tasks.

🤖 Two foundational agent component studies are worth bookmarking. Tongyi Lab's open-source Mobile-Agent-v3.5 achieves unified GUI automation across desktop, mobile, and browser through a hybrid data flywheel and reinforcement learning, hitting open-source SOTA on 20+ benchmarks. Microsoft Research's PlugMem distills agent interaction history into structured facts and reusable skills, delivering higher-quality decision context with fewer tokens, outperforming traditional retrieval methods in dialogue and web browsing scenarios.

🦞 Professor Hung-Yi Lee's video "Dissecting the Lobster" deconstructs AI Agent architecture with textbook clarity: system prompts for identity, RAG and compression for breaking context limits, heartbeat mechanisms for 24/7 autonomous operation, and Sub-Agent orchestration for complex task decomposition. Tencent Engineering's hands-on guide complements this with a complete deployment path from hardware selection to multi-agent coordination, along with a key safety warning: any system pursuing high autonomy must plan for the worst-case scenario of full data exposure. Theory meets practice—together they form the best entry point for understanding the OpenClaw ecosystem.

🏗️ After six months of intensive Claude Code usage, Tw93 distilled a six-layer governance model: CLAUDE.md, Tools/MCP, Skills, Hooks, Subagents, and Verifiers. The core insight: agent failures rarely stem from insufficient model capability—they come from context pollution, tool redundancy, and lack of deterministic constraints. HackerNoon's "Scalability Triangle" offers a complementary decision framework—MCP handles dynamic data interaction, Subagents handle task isolation and model routing, Skills handle static knowledge injection—with clear boundaries to prevent over-engineering. Read together, these two pieces are the most systematic treatment of Claude Code engineering practices available.

⚡ OpenAI's Build Hour and Dewu's Spec Coding case study showcase production-grade agent engineering from two perspectives. OpenAI proposes Harness Engineering with seven readability metrics, arguing that embedding agent.md rules in the codebase lets AI independently ship PRs. Dewu's team validated their three-layer specification system (Rules/Code/UI) with 2,754 real tool calls over 10 days: the 36% efficiency gain required systematic upfront investment in specs. The article also candidly documents where AI breaks down in complex CI environments—that honesty makes this field report even more valuable.

🔍 Kuaishou's intelligent Code Review is this week's most instructive enterprise case study. Three generations of architecture evolution—from LLM heuristics to knowledge engine plus deterministic rules, then to agentic autonomous decision-making—pushed code review adoption from 7.9% to 54%, cutting MR turnaround time by nearly 10%. The breakthrough: building 1,100+ hard rules to eliminate AI hallucinations, achieving a paradigm shift from personal assistant to organization-level collaborator. This evolution path offers direct lessons for any team pushing AI engineering into production.

🌐 Founder Park surveyed 500+ OpenClaw -related products on Product Hunt from February, spanning cloud hosting, Skill development, Agent social networks, and competitors. An entire ecosystem has emerged without top-down planning—completely bottom-up. OpenClaw isn't just a tool anymore; it's an operating-system-level platform. Meanwhile, LangChain argues that as implementation costs plummet, the software development bottleneck is shifting from building to reviewing. The talent landscape will bifold into full-stack "builders" and architecture-focused "reviewers," with product sense becoming the core competency across all roles.

🎨 Three articles interrogate the human position in the AI era from different angles. A YC design expert's retrospective on Vibe Coded websites reveals the homogeneity trap: over-reliance on LLMs produces cookie-cutter fade-in animations—AI is an execution lever, not a substitute for thinking. Elys founder Tristan offers another dimension: a person's soul is the sum of all their context, and AI social products must anchor one end to real humans—memory slots and entropy reduction are the true technical moats. Read together, they point to the same conclusion: the more powerful the tools, the more precious human judgment becomes.

📈 Four macro pieces paint a panoramic view of AI. Jensen Huang's bylined essay deconstructs AI into a five-layer cake from energy to applications, arguing that open-source models are the catalyst activating full-stack demand. a16z's top-100 consumer AI report identifies personal memory as the next core moat. "2026 Letter to AI Founders" draws on the printing press, electric motor, and cloud computing to derive a law of profit conservation—when implementation stops being the bottleneck, value migrates to architectural judgment and product intuition. And a 70-page solo PPT deck delivers the most comprehensive data review of the Q1 2026 US-China AI landscape.

Hope this issue sparks some new ideas. Stay curious, and see you next week!

Subscribe Now

1Gemini Embedding 2: Our first natively multimodal embedding model
2Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning
3Alibaba Tongyi Lab Open-Sources Mobile-Agent-v3.5: A Truly "Multi-Platform Ready" Native GUI Agent Foundation Model
4PlugMem: A single memory system adaptable across AI agent tasks
5AlphaGo at 10: How AI Innovation Is Paving the Path to AGI
6Dissecting OpenClaw: Understanding the Inner Workings of AI Agents
7What You Don't Know About Claude Code: Architecture, Governance, and Engineering Practices - Tw93
8Build Hour: API & Codex
9The Extensibility Triangle That Stopped Me Over-Engineering Claude Code
10Exploring the Boundaries of AI Programming: Spec Coding Project Practice Based on Claude Code | Dewu Tech
11From 7.9% to 54% Adoption: The Three-Stage Evolution of Kuaishou's Intelligent Code Review
12Mastering OpenClaw: Core Architecture, Working Principles, and Agent Deployment Steps
13How Coding Agents Are Reshaping Engineering， Product and Design
14The OpenClaw Ecosystem is Exploding: We Deconstructed Over 40 Related Products on Product Hunt
15A Conversation with Elys Founder Tristan: The Human Soul is the Sum of All Context; We Have Never Been Truly Connected
16Design Experts Review Vibe Coded Websites
17Full Version | Jensen Huang's Rare Signed Article: AI is a Five-Layer Cake
18The Top 100 Consumer AI Apps | The a16z Show
192026: Generosity, Cruelty, and the Fog—A Letter to AI Entrepreneurs
20Vol.89 AI Industry 2025 Annual Summary Supplement (V4 "Can't Wait" Edition) --- 70-page PPT Solo

Gemini Embedding 2: Our first natively multimodal embedding model

The Keyword (blog.google)

blog.google

03-10

551 words · 3 min

Gemini Embedding 2: Our first natively multimodal embedding model

Gemini Embedding 2 introduces a unified embedding space for text, images, video, and audio. Key features include support for 100+ languages, interleaved multimodal inputs, and flexible dimensions via MRL. It is a powerful upgrade for developers seeking SOTA performance in multimodal RAG and data management.

Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning

NVIDIA Technical Blog

developer.nvidia.com

03-11

2283 words · 10 min

Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning

NVIDIA's Nemotron 3 Super is a 120B MoE model optimized for the agentic era. Utilizing a Hybrid Mamba-Transformer architecture, it supports a 1M context window while delivering 5x higher throughput via Latent MoE and Multi-token Prediction. It bridges the gap between efficiency and reasoning depth, offering an open-source, high-performance solution for complex multi-step workflows and autonomous software development.

Alibaba Tongyi Lab Open-Sources Mobile-Agent-v3.5: A Truly "Multi-Platform Ready" Native GUI Agent Foundation Model

魔搭ModelScope社区

mp.weixin.qq.com

03-08

2771 words · 12 min

Alibaba Tongyi Lab Open-Sources Mobile-Agent-v3.5: A Truly "Multi-Platform Ready" Native GUI Agent Foundation Model

Tongyi Lab released GUI-Owl-1.5, a native GUI Agent family for desktop, mobile, and browser platforms. By leveraging a Hybrid Data Flywheel and MRPO reinforcement learning, it overcomes stability issues in long-range tasks. Achieving SOTA results across 20+ benchmarks like OSWorld, it offers both Instruct and Thinking variants to support edge-to-cloud collaborative deployment, serving as a powerful open-source foundation for GUI automation.

PlugMem: A single memory system adaptable across AI agent tasks

Microsoft Research Blog

microsoft.com

03-10

967 words · 4 min

PlugMem: A single memory system adaptable across AI agent tasks

Microsoft Research presents PlugMem, a general-purpose memory module designed to fix "memory redundancy" in AI agents. By distilling interaction histories into structured knowledge units, it enables high-efficiency, low-cost knowledge reuse. Benchmarks show PlugMem outperforms traditional retrieval methods in QA and web navigation, delivering higher utility for decision-making while consuming significantly fewer context tokens.

AlphaGo at 10: How AI Innovation Is Paving the Path to AGI

Google DeepMind Blog

deepmind.google

03-10

1356 words · 6 min

AlphaGo at 10: How AI Innovation Is Paving the Path to AGI

Demis Hassabis reflects on a decade of AI progress since AlphaGo, highlighting its transition from game mastery to scientific breakthroughs like AlphaFold. The core insight is that the search and reasoning principles of AlphaGo are now being integrated into Gemini. This strategic combination aims to build AGI capable of independent scientific hypothesis and solving complex, open-ended global challenges.

Dissecting OpenClaw: Understanding the Inner Workings of AI Agents

Hung-yi Lee

youtube.com

03-09

4207 words · 17 min

Dissecting OpenClaw: Understanding the Inner Workings of AI Agents

Professor Hung-yi Lee dissects the engineering of AI Agents: establishing identity via system prompts, managing long-term memory through RAG and context compaction, and enabling 24/7 autonomy with heartbeat mechanisms. The content highlights core workflows like tool calling and sub-agent collaboration while providing practical security advice on permission and physical isolation.

What You Don't Know About Claude Code: Architecture, Governance, and Engineering Practices - Tw93

Tw93 Blog

tw93.fun

03-12

10004 words · 41 min

What You Don't Know About Claude Code: Architecture, Governance, and Engineering Practices - Tw93

This article provides a deep dive into developer Tw93’s six-month journey with Claude Code. Moving beyond mere prompt engineering, the author introduces a systematic six-layer architectural model—comprising CLAUDE.md, Tools, Skills, Hooks, Subagents, and Verifiers. It offers profound insights into managing context pollution, optimizing Prompt Caching for performance, and establishing verification loops to ensure reliable outputs. For developers aiming to push the boundaries of AI agents and transition from "chatting with AI" to "engineering with agents," this post serves as an essential, high-value technical roadmap.

Build Hour: API & Codex

OpenAI

youtube.com

03-10

14614 words · 59 min

OpenAI and Basis reveal the architecture of Agent-driven development. Key takeaways: GPT 5.4's computer use primitives, the seven metrics of Harness Engineering for codebase legibility, and using agent.md to eliminate "AI Slop." The bottom line: Stop treating AI as a typist; treat it as an engineering partner capable of autonomous PR delivery.

The Extensibility Triangle That Stopped Me Over-Engineering Claude Code

HackerNoon

hackernoon.com

03-12

3096 words · 13 min

This piece distills Claude Code’s extensibility into a simple logic: use MCP for external data, Subagents for task isolation and cost-saving model routing, and Skills for static knowledge injection. It offers a practical "decision function" to help developers avoid common anti-patterns—like hardcoding static text in MCP or querying databases via Skills—ensuring an optimal balance between performance and cost.

Exploring the Boundaries of AI Programming: Spec Coding Project Practice Based on Claude Code | Dewu Tech

得物技术

mp.weixin.qq.com

03-11

7366 words · 30 min

Exploring the Boundaries of AI Programming: Spec Coding Project Practice Based on Claude Code | Dewu Tech

Based on a 10-day trial with 2,754 tool calls, this post explains how the Spec Coding workflow redefines AI development efficiency. The key lies in eliminating AI uncertainty through a three-layer hierarchy (Rules/Code/UI) and using MCP tools to connect external data. It shares a 36% efficiency gain while objectively analyzing AI failure modes in complex CI environments and hidden dependencies. A must-read for advanced AI coding.

From 7.9% to 54% Adoption: The Three-Stage Evolution of Kuaishou's Intelligent Code Review

快手技术

mp.weixin.qq.com

03-09

9149 words · 37 min

From 7.9% to 54% Adoption: The Three-Stage Evolution of Kuaishou's Intelligent Code Review

Kuaishou's Intelligent CR: Boosting adoption from 7.9% to 54% via a 3-gen evolution (Heuristic -> Knowledge-driven -> Agentic). Key innovations include a context-aware engine and 1,100+ rules to eliminate AI hallucinations. It demonstrates how to effectively shorten review cycles by 10% through a hybrid of deterministic engineering and autonomous AI agents.

Mastering OpenClaw: Core Architecture, Working Principles, and Agent Deployment Steps

腾讯技术工程

mp.weixin.qq.com

03-09

10592 words · 43 min

Mastering OpenClaw: Core Architecture, Working Principles, and Agent Deployment Steps

A complete OpenClaw handbook covering everything from hardware selection (Mac Mini) to multi-agent orchestration. It dives into memory mechanisms, skill management, and task allocation, featuring real-world cases like automated research and local ComfyUI integration. Critically, it warns users to treat local data security with extreme caution given the framework's operational risks.

How Coding Agents Are Reshaping Engineering， Product and Design

LangChain Blog

blog.langchain.com

03-10

1992 words · 8 min

How Coding Agents Are Reshaping Engineering， Product and Design

This article explores how AI coding agents are fundamentally reshaping the collaboration between Engineering, Product, and Design (EPD). The author argues that as implementation becomes "cheap," the development bottleneck shifts from writing code to reviewing it and applying systems thinking. The traditional waterfall PRD process is being replaced by prototype-led iterations. Future roles will likely bifurcate into versatile "Builders" or high-level "Reviewers," making "product sense" a mandatory skill for everyone. It is a profound framework for tech professionals looking to adapt their skill sets in an AI-accelerated landscape.

The OpenClaw Ecosystem is Exploding: We Deconstructed Over 40 Related Products on Product Hunt

Founder Park

mp.weixin.qq.com

03-08

5128 words · 21 min

The OpenClaw Ecosystem is Exploding: We Deconstructed Over 40 Related Products on Product Hunt

OpenClaw has evolved beyond a repository into an OS-level ecosystem reshaping the internet. This report covers five key sectors—social, memory, deployment, security, and hardware—showing how Agents rapidly scale via SKILL.md plugins. It captures the essential shift toward an autonomous Agent-to-Agent commercial loop.

A Conversation with Elys Founder Tristan: The Human Soul is the Sum of All Context; We Have Never Been Truly Connected

Founder Park

mp.weixin.qq.com

03-06

12364 words · 50 min

Tristan, founder of Elys, reveals a new AI social paradigm: treating Context as the sum of a soul and implementing active memory through "memory slots." Key takeaways include the necessity of human involvement for authenticity, using AI to achieve "entropy reduction" in human connections, and the leap from emotional companionship to high-dimensional social networking. A direct look at the technical bottlenecks and aesthetics of AI-era social products.

Design Experts Review Vibe Coded Websites

Y Combinator

youtube.com

03-06

11889 words · 48 min

Design Experts Review Vibe Coded Websites

This video offers a sharp critique of the "sameness" in AI-driven startup designs. YC experts highlight how relying solely on LLMs leads to redundant animations, generic "Bento box" layouts, and broken UX patterns like scroll hijacking. The key message: AI is a tool for execution, not a replacement for strategy. Founders must maintain editorial control to ensure their landing pages reflect unique brand identities rather than generic AI-generated templates.

Full Version | Jensen Huang's Rare Signed Article: AI is a Five-Layer Cake

Web3天空之城

mp.weixin.qq.com

03-11

2224 words · 9 min

Full Version | Jensen Huang's Rare Signed Article: AI is a Five-Layer Cake

Jensen Huang defines AI through a "five-layer cake" architecture: Energy, Chips, Infrastructure, Models, and Applications. He highlights the transition from static software to real-time intelligence, driving a multi-trillion-dollar global investment in "AI factories." Notably, he views open-source models like DeepSeek-R1 as key drivers for infrastructure demand.

The Top 100 Consumer AI Apps | The a16z Show

a16z

youtube.com

03-10

14800 words · 60 min

The Top 100 Consumer AI Apps | The a16z Show

The a16z report highlights AI's evolution from simple text prompts to sophisticated Agents and desktop ecosystems. While ChatGPT maintains dominance, Claude and Gemini are carving out niches in prosumer and creative segments. Key takeaways: the surge of consumer-grade agents, the battle for AI-native browser entry points, and "Memory" emerging as the ultimate competitive moat.

2026: Generosity, Cruelty, and the Fog—A Letter to AI Entrepreneurs

十字路口Crossing

mp.weixin.qq.com

03-07

9463 words · 38 min

2026: Generosity, Cruelty, and the Fog—A Letter to AI Entrepreneurs

Authored by AI builder JiaYuan, this article provides a profound analysis of the "commoditization" of coding in 2026. By chronicling the paradigm shift experienced by figures like Andrej Karpathy, the author connects current AI trends with historical cycles—from the printing press to the electric motor and AWS. The core thesis applies the "Law of Conservation of Attractive Profits," arguing that as implementation costs hit near-zero, value migrates toward system architecture, product intuition, and user empathy. It is a sobering strategic guide for entrepreneurs navigating the "Hell Mode" of competition, emphasizing that the bottleneck is no longer "how to build," but "what to build."

Vol.89 AI Industry 2025 Annual Summary Supplement (V4 "Can't Wait" Edition) --- 70-page PPT Solo

屠龙之术

xiaoyuzhoufm.com

03-09

31173 words · 125 min

Vol.89 AI Industry 2025 Annual Summary Supplement (V4 "Can't Wait" Edition) --- 70-page PPT Solo

Veteran investor Zhuang Minghao delivers a high-density breakdown of the global AI landscape in early 2026. This session covers the evolution of US Capex narratives, NVIDIA’s market dominance, the rise of "OpenClaw" and Agentic AI, and the shifting valuations of Chinese AI unicorns like Moonshot and Zhipu post-Spring Festival. It offers a professional synthesis of market data and critical reflections on the "AI eating software" discourse. A must-watch/read for professionals looking to synchronize with the latest AI investment logic and technical trends of Q1 2026.

BestBlogs Issue #86: Infrastructure

Contents