Featured Newsletter

BestBlogs Issue #73: Outliers and Regression

Hello everyone, and welcome to BestBlogs.dev Issue 73.

This week, I’ve given the BestBlogs newsletter template a fresh design. To help lighten reading load, I have strictly capped the selection at just 20 essential articles, each accompanied by a dedicated "Why Read" note. My hope is that these small tweaks will help you cut through the noise and efficiently zero in on the content that truly matters amidst your busy schedule.

It has been an incredibly dense week for AI, defined by a mix of rapid tooling iterations and profound strategic debates. On the model front, Google , OpenAI , and xAI all played major cards with Gemini 3 , GPT-5.1 , and Grok 4.1 respectively—each pushing the boundaries of reasoning depth, latency, and developer experience. Parallel to these releases, we saw some heavyweight intellectual exchanges: from Elon Musk and Jensen Huang debating the physical limits of compute, to Fei-Fei Li unveiling her vision for spatial intelligence, and Microsoft executive Wei Qing reflecting on organizational transformation. Whether you are a hands-on builder or a big-picture strategist, this week has something for you.

Here are the 10 standout highlights from this week:

🌐 Google has unleashed the Gemini 3 ecosystem, introducing a Deep Think mode to boost long-chain reasoning capabilities, alongside the intelligent Antigravity IDE and a natural-language CLI tool for developers.

⚡ GPT-5.1 offers developers a new versatile option. It defaults to a low-latency "no-reasoning" mode, integrates web search directly into the API for the first time, and extends Prompt caching duration to 24 hours.

🏆 Grok 4.1 is dominating the leaderboards. By scaling up RLHF by an order of magnitude and utilizing agentic reward models, it has made significant strides in reducing hallucinations and improving emotional interaction.

🎙️ In a deep retrospective on Microsoft’s cultural turnaround, executive Wei Qing outlines his decision-making frameworks and argues that in the human-machine relationship, humans provide the "outliers" while machines handle the "regression."

🏭 Elon Musk and Jensen Huang shared the stage to dissect the "AI Factory" concept. They predicted that due to terrestrial energy and heat constraints, future compute clusters might eventually migrate to space-based solar satellites.

🌍 Fei-Fei Li ’s team at World Labs launched Marble , the first model capable of generating fully navigable 3D worlds from simple prompts, marking a major leap toward true World Models.

🎨 Nano Banana Pro leverages Gemini 3 ’s multimodal reasoning to solve "logical hallucinations" in image generation, while seamlessly integrating with Veo 3 for professional image-to-video workflows.

🛠️ Agent Development Insights: This week features a guide on 12 context engineering practices to boost performance, plus a deep dive arguing that Claude Skills are essentially upgrading "prompt engineering" into "process engineering."

🧠 Addressing the "goldfish memory" of current AI, EverMemOS proposes a brain-inspired four-layer architecture—covering Agent, Memory, Index, and Interface layers—to give machines a persistent, accumulating "soul."

🛍️ Application Layer moves: Alibaba’s Qwen App enters public beta testing "AI+X" e-commerce monetization, while Slack founder Stewart Butterfield shares masterclass product philosophies, including the metaphor of "tilting the umbrella."

I hope this curated list brings you new insights and inspiration. Stay curious, and I'll see you next week!

Subscribe Now

1Gemini-3 and its Ecosystem: A Deep Dive
2Introducing GPT-5.1 for developers
3Musk Quietly Releases Grok 4.1, Leading All Leaderboards in the Large Model Arena
4Large Language Models: A Three-Stage Learning Process
5The Godmother of AI on jobs， robots & why world models are next | Dr. Fei-Fei Li
65 things to try with Gemini 3 Pro in Gemini CLI
7Agent Revolution! Understanding the Core Development Pipeline of AI Agents
8Claude Skills: More Than Just Storing Prompts in a Folder?
9From The Legend of Zelda to AI Agent: The Information Layering Design Philosophy Behind Claude Skills
10AI Memory Revolution: How EverMemOS Gives Machines Real Intelligence
11Nano Banana Pro Released: Google Intensifies Competition with Gemini 3 and Veo 3
12The Tao Fangbo I Know and His Second Me Project
13Alibaba's Qianwen APP Launches Public Beta, Aiming to be Chinese ChatGPT | Hands-on Review
14Mastering Lark App Mode: A Step-by-Step Guide
15Mental models for building products people love ft. Stewart Butterfield
16Satya Nadella describes how lessons from Microsoft’s history apply to today’s boom
17Musk and Huang Discuss the Future of AI: A Comprehensive Analysis (10,000+ Words, Video Included)
18E42 Meng Yan Talks with Wei Qing: The Silent Protagonist
19Zhang Fan on AI's ToB Potential: A Vision from the Former Zhipu COO and YL Intelligence Founder/CEO
20Exclusive Sharing from a Leading Company's CIO: How AI Reshapes the Future Decade for Developers

Gemini-3 and its Ecosystem: A Deep Dive

赛博禅心

mp.weixin.qq.com

11-18

2638 words · 11 min

This article provides a comprehensive breakdown of Google's Gemini 3 and its supporting ecosystem architecture. Key coverage includes: Gemini 3 Pro's 1501 Elo benchmark performance on LMArena; Deep Think mode with Thought Signatures and Thinking Levels for enhanced long-chain reasoning; Antigravity as a task-oriented IDE for the Agent era supporting multi-Agent collaboration and autonomous operations; Gemini CLI for natural language to Shell conversion; Generative UI for dynamic interface generation in search; and ecosystem integration through Android Studio Otter and Firebase AI Logic SDK.

Introducing GPT-5.1 for developers

Simon Willison's Weblog

simonwillison.net

11-13

402 words · 2 min

Simon Willison provides a detailed analysis of GPT-5.1's developer features. The core update is "none" reasoning mode becoming the default, optimized for low-latency scenarios with improved tool calling, coding, and instruction following, plus first-time API-level web search integration. Adaptive reasoning is another highlight—the model dynamically adjusts thinking depth based on task complexity, offering fast responses for simple tasks to reduce costs while maintaining deep reasoning for complex ones. Extended prompt cache retention extends caching to 24 hours at no extra cost by moving caches from GPU memory to local storage. The article also covers new built-in tools like apply_patch, valuable for building LLM-powered code editing applications.

Musk Quietly Releases Grok 4.1, Leading All Leaderboards in the Large Model Arena

量子位

qbitai.com

11-18

1378 words · 6 min

Musk Quietly Releases Grok 4.1, Leading All Leaderboards in the Large Model Arena

Grok 4.1 tops LLM Arena leaderboards with thinking and non-thinking modes at positions one and two. Key breakthrough: RLHF scaled by an order of magnitude with agentic reward models. Significant improvements in emotional interaction, creative writing, and hallucination reduction.

Large Language Models: A Three-Stage Learning Process

Hung-yi Lee

youtube.com

11-17

9972 words · 40 min

Large Language Models: A Three-Stage Learning Process

Professor Hung-yi Lee masterfully breaks down the complete LLM learning pipeline from pre-training to alignment with his signature accessible teaching style. The lecture brilliantly uses the analogy of "preschool, school, entering society" to make complex technical concepts intuitive. Beyond covering the staggering scale of 15T tokens and practical Chinchilla scaling laws, it reveals a profound insight: SFT and RLHF don't teach new knowledge but rather unlock the potential within pre-trained models. Essential viewing for developers and researchers wanting to understand the mechanics behind ChatGPT and similar models.

The Godmother of AI on jobs， robots & why world models are next | Dr. Fei-Fei Li

Lenny's Podcast

youtube.com

11-16

20450 words · 82 min

The Godmother of AI on jobs， robots & why world models are next | Dr. Fei-Fei Li

In this podcast, Dr. Fei-Fei Li, the "Godmother of AI," systematically traces artificial intelligence's evolution from the AI winter to today's deep learning revolution. She provides deep insights into how ImageNet, combined with neural networks and GPUs, formed the "golden recipe" for modern AI, while emphasizing world models as the next critical frontier.

Li introduces Marble, World Labs' newly launched product—the world's first model that generates navigable 3D worlds from simple prompts, already showing value in virtual production, game development, and robotic simulation. She also shares profound thoughts on human-centered AI and offers career advice for young AI professionals. This is a rare opportunity to understand AI's underlying logic and future direction from both historical and forward-looking perspectives.

5 things to try with Gemini 3 Pro in Gemini CLI

Google Developers Blog

developers.googleblog.com

11-18

1601 words · 7 min

5 things to try with Gemini 3 Pro in Gemini CLI

Google has integrated Gemini 3 Pro, its most intelligent AI model, into Gemini CLI, bringing a new development experience directly to the terminal. The article demonstrates this combination's capabilities through five practical scenarios: generating complete web applications with 3D graphics from a single prompt, converting hand-drawn UI sketches into frontend code, executing complex Git commands via natural language, auto-generating user documentation from codebases, and orchestrating debugging workflows across multiple cloud services. Gemini 3 Pro's core strengths lie in its advanced reasoning and multimodal understanding, accurately interpreting complex instructions while synthesizing text, images, and code.

Agent Revolution! Understanding the Core Development Pipeline of AI Agents

腾讯云开发者

mp.weixin.qq.com

11-18

13524 words · 55 min

Agent Revolution! Understanding the Core Development Pipeline of AI Agents

A comprehensive guide to the AI Agent development lifecycle. The core highlight is identifying Context Engineering as the key to Agent performance, offering 12 specific optimization practices. It also deeply analyzes the engineering pros and cons of the MCP protocol (such as connection stability and logging challenges) and compares frameworks like AutoGen, LangGraph, and Crew AI.

Claude Skills: More Than Just Storing Prompts in a Folder?

刘小排r

mp.weixin.qq.com

11-14

6097 words · 25 min

Claude Skills: More Than Just Storing Prompts in a Folder?

E42 Meng Yan Talks with Wei Qing: The Silent Protagonist

This nearly four-hour deep conversation features Wei Qing analyzing Microsoft's cultural transformation from the Ballmer to Nadella era as an insider—the fundamental shift from know-it-all to learn-it-all, and how the "three-error method" (acknowledge, understand, correct) reshaped the innovation DNA of a 100,000-person organization.

The real value lies in Wei's thinking frameworks: the "want-can-should-may" four-dimensional model for tech decisions, the "five beliefs" theory emphasizing unity of faith and action, and the SCBIG model integrating systems and inverse thinking. Most profound is his insight on human-machine relations—human value lies in providing outliers while machines inherently regress to means, and civilization's direction depends on what data we feed into the corpus.

Zhang Fan on AI's ToB Potential: A Vision from the Former Zhipu COO and YL Intelligence Founder/CEO

十字路口Crossing

xiaoyuzhoufm.com

11-16

2013 words · 9 min

Zhang Fan on AI's ToB Potential: A Vision from the Former Zhipu COO and YL Intelligence Founder/CEO

Former Zhipu AI COO Zhang Fan systematically presents his fundamental thinking on AI ToB entrepreneurship. He argues AI should be viewed as digital employees rather than software tools, targeting the labor market instead of the software market.

His core methodology is "commercial reinforcement learning": defining business objectives and feedback mechanisms to let AI evolve in real business environments and transform into role-specific productivity. Zhang emphasizes enterprises need to build core barriers combining "50% business advantages + 50% model amplification" and advises entrepreneurs to elevate their understanding of AI's "model nature" as a strategic rather than technical issue.

Exclusive Sharing from a Leading Company's CIO: How AI Reshapes the Future Decade for Developers

InfoQ 中文

mp.weixin.qq.com

11-20

7972 words · 32 min

Exclusive Sharing from a Leading Company's CIO: How AI Reshapes the Future Decade for Developers

Alibaba Cloud CIO Jiang Linquan analyzes developer transformation in the AI era. Key insights: measure efficiency by end-to-end person-months, not code volume; AI lowers full-stack barriers, creating hybrid roles like "Product Design Frontend" and "Architecture Backend"; R&D should achieve self-contained efficiency first before business transformation. Knowledge serves as AI's "fuel," and developers need left-shift thinking, curiosity, and resilience.

BestBlogs Issue #73: Outliers and Regression

Contents