Featured Newsletter

BestBlogs Issue #80: From Chat to Action

Hello! Welcome to BestBlogs.dev Issue 80 AI Article Recommendations.

This week's theme is From Chat to Action .

At the Tsinghua AGI-Next Summit, Zhipu's Tang Jie put it bluntly: after DeepSeek's breakthrough, the Chat era is essentially over—the next frontier is getting things done. Yang Zhilin framed the destination of this paradigm shift as the Agentic Intelligence Era, where models evolve from passive text generators into autonomous agents capable of planning and decision-making.

This isn't hype. This week, Alibaba's Qianwen App integrated over 400 services, enabling users to order food, book flights, and check social security with a single command. Claude Cowork brought agent capabilities to the desktop. Cursor and Claude both released official agent best practices. From foundation models to consumer products, the entire industry is answering the same question: how can AI actually get things done for people?

Here are the 10 highlights worth your attention this week:

🎤 The Tsinghua AGI-Next Summit brought together China's AI elite for what may be the year's most information-dense technical dialogue. Tang Jie traced Zhipu's decade-long journey from cognitive intelligence to agents, arguing that Intelligence Efficiency will define the next phase of competition. Yang Zhilin shared Kimi's technical roadmap for the first time, centered on Token Efficiency and Long Context, revealing key details about the Muon optimizer and KimiLinear architecture. Lin Junyang candidly assessed China's chances of overtaking at around 20%, but noted that necessity breeds innovation—hardware-software integration may be the breakthrough path. Yao Shunyu joined remotely, observing a clear divergence between toB and toC: higher model intelligence translates directly into greater productivity value.

📊 A comprehensive annual AI review synthesizing 200 papers declares 2025 the end of the Scaling Law brute-force era. Technical focus has shifted to four core domains: fluid reasoning, long-term memory, spatial intelligence, and meta-learning.

🔍 The Qwen team released the Qwen3-VL-Embedding and Reranker series, filling a gap in high-performance multimodal retrieval tools for the open-source community. The two-stage pipeline—dual-tower embedding for recall plus cross-encoder for reranking—sets new open-source records on benchmarks like MMEB-v2. Essential reading for developers building multimodal RAG systems.

🤖 Qianwen App received a landmark update, integrating Taobao, Fliggy, and over 400 Alibaba ecosystem services to transform into a full-featured agent. Users can now complete food delivery, flight bookings, and government services with a single command, powered by Qwen3-Max and the MCP protocol. Meanwhile, Simon Willison tested Anthropic's Claude Cowork , demonstrating its potential for desktop automation workflows while cautioning users about prompt injection risks.

🛠️ Cursor officially released agent best practices, moving beyond basic prompt input to advocate a plan-first-code-later strategy. The guide covers .cursor/rules for global configuration, SKILL.md for dynamic capabilities, and Hooks for automation loops. Alibaba Cloud developers published a deep dive on Claude Skills , clarifying the distinction between Skills and MCP and providing a complete progression from official best practices to real-world implementation.

🧩 LangChain published a detailed comparison of four multi-agent architecture patterns: subagents, skills, handoffs, and routers. Through quantitative analysis of model calls, latency, and token consumption, the guide offers a clear decision framework. The core advice: stick with simple single-agent designs until you hit clear scaling bottlenecks.

📐 A Trae technical expert at ByteDance dissected Agentic Coding from first principles. The key insight: improving AI collaboration isn't about unlimited context, but about short conversation patterns and compound interest engineering. Tencent's team shared a three-month speckit retrospective, proposing a new architecture based on context engineering and composite engineering that decouples agents from skills for automated knowledge accumulation and retrieval.

🎬 The viral Louvre Cats AI video creators shared an extensive behind-the-scenes breakdown covering the entire workflow: concept development, character selection, storyboarding, and art direction. The core insight: upfront human planning like hand-drawn storyboards remains essential—AI amplifies rather than replaces creative vision.

💼 OpenAI and Google engineers shared lessons from deploying over 50 AI products, focusing on the challenges of non-determinism. The core framework balances agency versus control: start with low-agency, high-control V1 versions and iterate through continuous calibration. "Pain is the new moat"—a truth worth pondering for any team moving from prototype to production.

🎙️ Two podcasts explored value reconstruction in the AI era from different angles. Oasis Capital's Zhang Jinjian reflected on three years of going all-in on AI , proposing that the defining challenge of the next decade is building subjectivity—in an age where AI amplifies individual traits, being authentically yourself is no longer sentimental advice but the only survival strategy. Another episode examined how AI Coding is transforming software from high-value asset to low-cost commodity, shifting competitive moats from what you can build to who controls distribution and trust.

From chat to action, from conversation to execution—the signals at the start of 2026 are unmistakable. Model companies are competing on intelligence efficiency, application layers are racing to deploy real-world solutions, and the real competition has only just begun. Stay curious, and see you next week!

Subscribe Now

1New Benchmark in Multimodal Retrieval: Qwen3-VL-Embedding & Reranker Now Open Source!
2After $500 Million Financing, Yang Zhilin Shares Kimi's Technical Focus for the First Time (Full Speech Included)
3After Reading 200 Papers: Analyzing DeepMind, Meta, DeepSeek - What AGI Narratives Are Chinese and American Giants Describing? | 2025 AI Annual Review
4Deconstructing Agentic Coding from First Principles: From Theory to Practice
5Cognitive Reconstruction: After Three Months with Speckit, I Abandoned It - Escaping the Dilemma of Powerful Tools That Are Hard to Use Well
6Choosing the Right Multi-Agent Architecture
7Best Practices for Rapid Development of High-Quality Claude Agent Skills
8Cursor Agent Best Practices
9First impressions of Claude Cowork， Anthropic’s general agent
10AI Starts to "Take Action", Alibaba's Qwen Leads the World
11What OpenAI & Google engineers learned deploying 50+ AI products in production
12The Louvre Kitten AI Video: A 10,000-Word Creative Experience Sharing That Went Viral Online, Possibly Their Most Unreserved One Yet
13Yao Shunyu Lectures Face-to-Face with Tang Jie, Yang Zhilin, and Lin Junyang! Four Schema Heroes Debate Heroes at Zhongguancun
14The First Three Years of All-in AI | A Conversation with Zhang Jinjian, Partner at Vitalbridge
15AI Coding: Manifesting Reality Through Words—What Will Remain Valuable in the Future?

New Benchmark in Multimodal Retrieval: Qwen3-VL-Embedding & Reranker Now Open Source!

通义大模型

mp.weixin.qq.com

01-08

1978 words · 8 min

New Benchmark in Multimodal Retrieval: Qwen3-VL-Embedding & Reranker Now Open Source!

The Alibaba Qwen team has officially released the Qwen3-VL-Embedding and Qwen3-VL-Reranker model series, addressing a critical gap in high-performance open-source multimodal retrieval tools. Built upon Qwen3-VL, these models enable a unified semantic space for text, images, videos, and visual documents like charts or UI components. By leveraging a two-stage workflow—utilizing a dual-tower Embedding model for efficient recall and a single-tower Reranker with cross-attention for precision—this series has set new benchmarks on leaderboards like MMEB-v2. It serves as an essential infrastructure for developers building multimodal RAG systems or tackling complex cross-modal search challenges.

After $500 Million Financing, Yang Zhilin Shares Kimi's Technical Focus for the First Time (Full Speech Included)

腾讯科技

mp.weixin.qq.com

01-10

6254 words · 26 min

After $500 Million Financing, Yang Zhilin Shares Kimi's Technical Focus for the First Time (Full Speech Included)

In a recent keynote, Yang Zhilin, founder of Moonshot AI, unveiled Kimi's 2025 roadmap, centered on "Agentic Intelligence." He emphasizes that beyond the standard Scaling Law, improving "Token Efficiency" and "Long Context" capabilities is vital for evolving models from passive tools into active agents. Key technical breakthroughs include a new second-order optimizer that doubles token efficiency and the KimiLinear architecture, which solves the performance degradation of linear attention in long sequences. Yang also shares his philosophy on "technical taste," viewing AGI as a key to expanding the limits of human civilization.

After Reading 200 Papers: Analyzing DeepMind, Meta, DeepSeek - What AGI Narratives Are Chinese and American Giants Describing? | 2025 AI Annual Review

腾讯科技

mp.weixin.qq.com

01-12

18539 words · 75 min

After Reading 200 Papers: Analyzing DeepMind, Meta, DeepSeek - What AGI Narratives Are Chinese and American Giants Describing? | 2025 AI Annual Review

This article provides a profound retrospective of AI technological evolution in 2025 and an outlook for 2026. The author argues that 2025 marked the end of the "Brute Force" era dominated by simple Scaling Laws, shifting focus toward four pillars: Fluid Reasoning, Long-term Memory, Spatial Intelligence, and Meta-learning. By analyzing cutting-edge research such as Test-Time Compute (TTC), Titans architecture, and world models, the text illustrates AI's transition from being "encyclopedic" to truly "intelligent." Whether you are interested in RL engineering optimizations or the next generation of non-frozen model architectures, this systematic overview and its extensive bibliography offer invaluable insights.

Deconstructing Agentic Coding from First Principles: From Theory to Practice

字节跳动技术团队

mp.weixin.qq.com

01-12

18348 words · 74 min

Deconstructing Agentic Coding from First Principles: From Theory to Practice

This in-depth article by a Trae technical expert explores the first principles of Agentic Coding, shifting the focus from "larger context windows" to optimized human-AI collaboration. It dissects the autoregressive nature of LLMs, the constraints of attention mechanisms, and how Reinforcement Learning enables models to "act." The author advocates for "Short Dialogues" and "Compounding Engineering"—the practice of accumulating project-specific knowledge into reusable assets. By emphasizing that good developer experience (DX) benefits AI as much as humans, the article provides a robust framework for mastering AI coding tools. It is an essential read for developers aiming to transition from AI users to AI orchestrators.

Cognitive Reconstruction: After Three Months with Speckit, I Abandoned It - Escaping the Dilemma of Powerful Tools That Are Hard to Use Well

腾讯技术工程

mp.weixin.qq.com

01-09

20445 words · 82 min

Cognitive Reconstruction: After Three Months with Speckit, I Abandoned It - Escaping the Dilemma of Powerful Tools That Are Hard to Use Well

This article explores the paradigm shift from Spec-Driven Development (SDD) to advanced "AI Engineering." By analyzing the limitations of tools like speckit and openspec in complex corporate environments, the author proposes a new architecture centered on Context Engineering and Compounding Engineering. The core focus is decoupling Agents and Skills to enable automated knowledge accumulation and retrieval, effectively breaking the cycle of constant marginal costs in AI-assisted programming to achieve "compound interest" in R&D efficiency.

Choosing the Right Multi-Agent Architecture

LangChain Blog

blog.langchain.com

01-14

1552 words · 7 min

Choosing the Right Multi-Agent Architecture

This article provides a comprehensive guide on transitioning from single-agent setups to multi-agent architectures. It categorizes and compares four fundamental patterns—Subagents, Skills, Handoffs, and Routers—analyzing their performance in terms of latency, token cost, and parallelization. The piece emphasizes a pragmatic approach: start with a single agent and only graduate to complex multi-agent systems when facing clear context management or team coordination constraints. By providing quantitative benchmarks and decision frameworks, it serves as an essential resource for AI developers looking to build scalable and efficient agentic applications.

Best Practices for Rapid Development of High-Quality Claude Agent Skills

阿里云开发者

mp.weixin.qq.com

01-16

4767 words · 20 min

Best Practices for Rapid Development of High-Quality Claude Agent Skills

This article provides a deep dive into Anthropic's newly released "Skills" feature, offering a comprehensive implementation guide for developers. By comparing Skills (logic/best practice encapsulation) with MCP (tool/API connectivity), the author clarifies their distinct roles and introduces core methodologies like "progressive disclosure" and "AI-driven development." The piece goes beyond summarizing official best practices by showcasing a real-world "Prompt Optimizer" use case, demonstrating how to leverage context engineering to generate high-quality skills rapidly. It serves as an essential manual for developers looking to enhance agent autonomy within the Claude desktop, Claude Code, or API environments.

Cursor Agent Best Practices

宝玉的分享

baoyu.io

01-12

5014 words · 21 min

This comprehensive guide from the Cursor Team outlines a systematic approach to collaborating with programming agents. Moving beyond basic prompting, it introduces the "Plan-before-Code" strategy and details how to leverage .cursor/rules for global constraints and SKILL.md for dynamic capabilities. The article also explores advanced features like automated hooks, parallel multi-agent worktrees, and cloud-based agents for asynchronous tasks.

First impressions of Claude Cowork， Anthropic’s general agent

Simon Willison's Weblog

simonwillison.net

01-12

1158 words · 5 min

First impressions of Claude Cowork， Anthropic’s general agent

Anthropic’s Claude Cowork brings general-purpose agent capabilities to the desktop, allowing users to automate file tasks and research within a secure sandbox. While Simon Willison highlights its impressive productivity gains, he warns that the threat of prompt injection remains a critical concern for users handling sensitive data in this new agentic era.

AI Starts to "Take Action", Alibaba's Qwen Leads the World

量子位

qbitai.com

01-15

4751 words · 20 min

AI Starts to "Take Action", Alibaba's Qwen Leads the World

Alibaba's Qwen App has evolved into a full-stack Agent, integrating 400+ services including Taobao and Fliggy. Leveraging the Qwen3-Max model and MCP/A2A protocols, it now handles complex end-to-end tasks like booking travel and bulk shopping via simple voice commands. This marks a shift from AI as a talker to a doer, creating the first large-scale "search-to-fulfillment" AI assistant in the industry.

What OpenAI & Google engineers learned deploying 50+ AI products in production

Lenny's Podcast

youtube.com

01-11

10596 words · 43 min

What OpenAI & Google engineers learned deploying 50+ AI products in production

This episode features AI experts Aishwarya Reganti and Kiriti Badam, who dive deep into why traditional software playbooks fail in the age of LLMs. They introduce the Agency-Control Trade-off, explaining that increasing an agent's autonomy inevitably reduces human control, necessitating a gradual deployment strategy. The conversation outlines the CC/CD (Continuous Calibration, Continuous Development) framework, moving beyond "Vibes vs. Evals" to a data-driven feedback loop. With insights from deploying 50+ AI products at companies like OpenAI and Google, this is an essential guide for teams struggling with reliability and seeking a realistic path toward autonomous agents.

The Louvre Kitten AI Video: A 10,000-Word Creative Experience Sharing That Went Viral Online, Possibly Their Most Unreserved One Yet

数字生命卡兹克

mp.weixin.qq.com

01-16

9620 words · 39 min

The Louvre Kitten AI Video: A 10,000-Word Creative Experience Sharing That Went Viral Online, Possibly Their Most Unreserved One Yet

This article provides a detailed sharing of the complete workflow and insights behind The Louvre Kitten AI video creation, presented by the two main creators Haixin and Aweng. It covers the entire process from initial concept development, character selection (why white and orange cats were ultimately chosen), film tone setting, music production (using Suno AI for generation and variations), storyboard design (emphasizing information density, rhythm, and emotional progression), to art style control (application of Islamic art style, AI model selection Nano Banana Pro, prompt engineering), as well as techniques for iterating complex scenes and live-action compositing. The creators emphasize the importance of pre-production human planning such as hand-drawn storyboarding in AI-assisted creation, and the refined use of AI tools, such as how to strategically utilize different AI functions to optimize workflow, reduce unnecessary prompt length, and efficiently iterate and correct errors through AI, ultimately achieving high-quality visual effects. The article not only shares technical details but also reveals the artistic creation thought process.

Yao Shunyu Lectures Face-to-Face with Tang Jie, Yang Zhilin, and Lin Junyang! Four Schema Heroes Debate Heroes at Zhongguancun

量子位

qbitai.com

01-11

39241 words · 157 min

Yao Shunyu Lectures Face-to-Face with Tang Jie, Yang Zhilin, and Lin Junyang! Four Schema Heroes Debate Heroes at Zhongguancun

This comprehensive review captures the strategic insights from China's AI pioneers at the AGI-Next summit. Featuring leaders from Zhipu, Moonshot, and Alibaba, the article explores the pivotal shift from "Chat" to "Actionable Agents." It delves into technical breakthroughs such as Muon optimizers and Linear Attention mechanisms, designed to enhance Scaling Law efficiency under compute constraints. The text provides a high-density roadmap for 2025, addressing model architecture evolution, industry bifurcation, and the competitive landscape between China and the US. It is an essential read for professionals seeking to understand the underlying logic and future trajectory of LLMs.

The First Three Years of All-in AI | A Conversation with Zhang Jinjian, Partner at Vitalbridge

42章经

xiaoyuzhoufm.com

01-10

1129 words · 5 min

The First Three Years of All-in AI | A Conversation with Zhang Jinjian, Partner at Vitalbridge

In this insightful podcast episode, Zhang Jinjian, Partner at Oasis Capital, conducts a comprehensive retrospective of the three-year AI investment cycle since 2023. Triggered by the landmark public listings of Zhipu and MiniMax, the discussion explores the technical convergence of LLMs and Embodied AI—described as the "North and South Slope" paths to AGI. Moving beyond mere market analysis, Zhang introduces "Subjectivity" as the central theme for the next decade. He argues that as AI amplifies individual traits, "living as oneself" shifts from a romantic ideal to the only viable survival strategy.

AI Coding: Manifesting Reality Through Words—What Will Remain Valuable in the Future?

AI炼金术

xiaoyuzhoufm.com

01-15

1726 words · 7 min

AI Coding: Manifesting Reality Through Words—What Will Remain Valuable in the Future?

This episode explores how AI Coding turns software into a disposable commodity. Key insights: AI-driven "dialogue-to-execution" workflows will eliminate middle-layer tools and shrink organizations into 3-5 person powerhouses. As production becomes effortless, the true competitive moats shift from technical building capacity to distribution channels and established user trust.

BestBlogs Issue #80: From Chat to Action

Contents