**Pat Grady:** 大家早上好。各位感觉怎么样?好的。好的。再热情一点。嘿,感谢大家来到这里。我们真的非常感激。我们举办这个活动,是作为对社区的一种服务,因为我们正在经历一个重要的时代,能够充当一个让大家聚在一起的聚集点,对我们来说是一种荣幸。这次是我们迄今为止最好的议程,也是迄今为止最好的参会阵容。所以我们首先要说的就是感谢。我们知道大家都忙得不可开交。感谢你们今天来到这里。那么,我们确实准备了一个非常出色的议程。照惯例,穿着无可挑剔的 Sonya、Constantine 和我会先说几句话。你们知道,我们有幸身处大量有趣的人的大量对话之中。所以每年一次,我们会试着把这些做一些提炼,然后把我们听到的东西分享给大家。我先说几句关于整体校准的话,然后 Sonya 会讲讲我们今天看到的情况,接着 Constantine 会讲讲我们认为明天可能会发生什么。
说到校准,我们先把视野拉远,回到这些基于硅的晶体管——正是它们让这个地区得名"硅谷"——它们被构建成系统,通过网络连接,以互联网的形式走向公众,支撑了社交媒体和云计算等应用,最终出现在我们口袋里的移动设备上,而这些设备今天能够做到与魔法无异的事情——那就是 AI。我们之所以喜欢展示这张幻灯片——在座来过往届的朋友以前见过——是因为它提醒我们,所有这些浪潮都是叠加的。我们需要所有这几十年的演进,才能拥有今天的算力、带宽、数据和人才,才能充分把握这个时刻。
现在,这波 AI 浪潮有三点不同。第一,它是迄今为止最大的一波浪潮。这话大体上没错。但有一点更具体的是,它是第一波同时覆盖软件和服务的浪潮。上面一行展示的是云转型的前 15 年,软件的总可寻址市场(TAM)从大约 3500 亿涨到了 6500 亿,其中云计算大概占了 4000 亿。下面一行是全新的。这是看起来现在也可以获取的服务收入。10 万亿是一个很方便的整数。我们不知道到底是 10 万亿还是 5 万亿还是 50 万亿。但我们知道,光是美国的法律服务市场就有 4000 亿。那只是一个垂直领域、一个地区,就已经和整个软件市场一样大了。所以这个机会是巨大的。
第二点,这是迄今为止最快的一波浪潮。我想大家都能感受到。这意味着——请注意看这页的 AI 这边——这些空白地带正在被很快地填满。这些 logo 是因为云计算、移动互联网以及现在的 AI 这些构造性转变而收入超过 10 亿美元的公司。按照目前的速度,很快还会有更多。
第三点,也可能是最有意思的一点——这个我借用了我的合伙人 Constantine 的观点——技术领域基本上有两种革命。一种是通信革命(revolution of communication),关于信息如何分发。在座大多数人只经历过通信革命。互联网、云计算、移动互联网,这些都是关于信息分发的。这些是通信革命。AI 不一样。AI 是这一种。AI 是计算革命(revolution of computation)。它关于的是信息如何被处理。这听起来像是在咬文嚼字,但这是根本不同形态的浪潮。也许最直观的感受方式是想想这个事实:脚下的地板一直在动。每个人赖以构建的技术基础,每天都在变化,因为新的能力不断涌现。在过去几年里,我们经历了三个主要的拐点。第一个,ChatGPT 时刻,2022 年 11 月,全世界看到了预训练(pre-training)的力量。第二个,大约两年后,o1 模型,推理(reasoning)能力,突然之间第二条扩展定律(scaling law)围绕推理时计算(inference time compute)出现了。第三个,就在最近,Claude Code,Opus 4.5,现在是 4.7。全世界看到了长时程 agent 的力量。虽然这三个点看起来像是一条连续线上的三个节点,但第二个和第三个之间其实有一个硬断层。这是一个不连续的变化。如果我们可以大胆一点的话,我们会说——这就是 AGI。
你看,我是学经济学的。我们是风险投资人。我们不打算给 AGI 提出一个技术性定义,好吧?我们研究创始人和市场,以及两者碰撞产生的东西——也就是商业。但我们确实研究商业。所以从商业角度、从实用角度、从功能角度来看,如果你可以派遣一个 agent 去做一项工作,它能从失败中恢复并坚持到工作完成——我不知道,这感觉就很像 AGI 了。即使你不认为这是 AGI——这完全没问题。顺便说一下,Sonya 会在她那部分更多地谈这个。即使你不认为这是 AGI,我想我们都能看到——汽车已经来了。过去几年,我们有很多更快的马——就是那些让你提高 10% 或 40% 生产力的应用,但并没有从根本上改变你的工作方式。现在我们开始看到汽车了——让你提高 10 倍或 40 倍生产力的应用,绝对会改变你的工作方式。改变你工作的本质。改变你组织的本质。汽车已经来了。
这位是 Sequoia 的创始人 Don Valentine。他以只问一个问题而闻名:So what?所以呢?这些东西为什么重要?重要的原因是,就在最近几个月,竞赛已经开始了,而且这是一种和我们习惯的不一样的竞赛。开汽车的方式和骑马的方式不一样。造汽车的方式和照顾马的方式也不一样。所以这是一种非常不同的竞赛。我们今天想把大家聚在一起的原因之一,就是没有人拥有所有答案。我们在一起的时间越多,我们就越能学习,希望能弄清楚这一切将何去何从。而且我们需要尽快这样做,因为有很多东西在悬而未决——仅从商业角度来看,就有 10 万亿美元等着争夺。我们有实验室从技术推出(tech-out)的方向切入,也有创业公司在上面构建、从客户回溯(customer-back)的方向切入。这个房间里确实有所有主要实验室的代表,但你们大多数人是在上面构建的。所以我们花点时间聊聊 customer-back 这种方式。
我们对那些在实验室之上构建的人的建议——这是免费的建议,所以值你为它付出的每一分钱——我们的建议是要变"MAD"。我们并不真的需要你们生气。你想生气也行。如果那是驱动你的东西,那很酷,去生气吧。但这只是一个方便的缩写,代表护城河(Modes)、可供性(Affordance)和扩散(Diffusion),这是在模型之上构建时的三个战略支柱。
首先说护城河。纯属好玩,有人还记得去年的这张幻灯片吗?一个人记得,他是我的合伙人。好的,酷。嗯,我们不打算——好的,提醒一下,这张幻灯片展示的是商品化周期(merchandising cycle),就是把一个想法变成一个满意的客户所需的价值链中的各个环节。我们实际上不会逐一讲解每个环节。我想在这里表达的观点是:如果你从技术推出的角度切入,价值链的每个环节你会用一种方式来处理。如果你从客户回溯的角度切入,每个环节你会用另一种方式来处理。
现在,这里有一个反直觉的地方。在计算革命中,也就是关于信息处理的革命,你会本能地想往下看,因为不断有很酷的新东西出来。但你实际上应该做的——为了构建护城河——是往上看。因为你的客户的变化速度远远没有技术能力变化那么快。你构建的东西明天可能就过时了。但你围绕客户包裹得越紧密,那就会更持久一些。这不是说产品和技术不重要。它极其重要,一般来说最好的产品会赢。但在一个因为技术能力变化太快而导致产品变化太快的世界里,在思考护城河的时候,我们会鼓励你尽可能地从客户回溯出发,想想你可以用哪些方式把自己围绕在那些客户周围。
好的。MAD 中的 A 代表可供性(Affordance)。这是一个我们从设计领域借来的术语。锤子是一个具有"可供性"的物体。我有一个 2 岁的儿子。如果我给他一把锤子,他知道该怎么用。他会拿起来开始到处敲。这就是我们不给他锤子的原因,好吧?一个具有可供性的物体是不需要解释的。人们就是知道该怎么用它。Claude Code 极其强大。你去给一个普通的财富 500 强企业员工打开一个终端,看看他们能走多远。虽然它很强大,但它并没有提供那么多可供性。这不是在批评 Anthropic,但这对于任何想在上面构建的人来说是一个机会——为你的特定客户和他们的特定问题创造阻力最小的路径,让他们极其简单地就能找到达成业务目标的方法。这就是可供性的概念。
最后,MAD 中的 D 是扩散(Diffusion)。扩散鸿沟就是应用层公司的机会。技术能力扩散到市场中的速度,远远慢于这些能力被创造出来的速度。每一天,当基础模型的进化速度比你的普通财富 500 强企业更快时,那个鸿沟就更大了,那个机会也更大了。所以在护城河方面,尝试从客户回溯的角度思考;在可供性方面,尝试为你的客户创造阻力最小的路径;而那个扩散鸿沟就代表着你的机会。
如果之前那张白色空间开始被填满的幻灯片让在座任何人感到沮丧的话——请记住,没有领先是安全的。赛车界有句话:晴天你没法一次超过 15 辆车,但雨天你可以。而现在,基础模型正在倾盆大雨般地输出新能力,这意味着没有领先是安全的。但这也意味着任何人都可以赢。活在这个时代,多好啊。说到这里,我把话筒交给 Sonya。
**Pat Grady:** Good morning. How's everybody doing? All right. All right. A little bit better. Hey, thank you all for being here. We really appreciate it. We do t
his as a service to the community because we are living through important times and it's an honor for us to be able to serve as a bit of a gathering p
lace for people to come together. And this is by far the best agenda we put together and by far the best set of attendees. And so we just want to star
t out by saying thank you. We know you all crazy busy. Thank you for being here today. So, we do have a pretty exceptional agenda put together. As per
usual, Sonia in the impeccable outfit, Constantine and I are going to say a couple of words to start. You know, we have we have the privilege of bein
g in the midst of a lot of conversations with a lot of interesting people. And so, once a year, we like to try to synthesize that a little bit and sha
re back with you what it is we've been hearing. And so I'll say a few words of overall calibration and then Sony will say a bit about what we see toda
y and then Constantine will say a bit about what we think might be coming tomorrow. So for calibration, we're going to start by zooming out going back
to these siliconebased transistors which gave this area its name that got built into systems connected by networks that went public in the form of th
e internet supported applications like social media in the cloud eventually showed up on our pockets in mobile devices that today are capable of doing
something indistinguishable from magic which is AI. The reason we like to show this slide, and those of you who've been here in the past have seen th
is before, is because it reminds us that all of these waves are additive. And we sort of needed all of these decades of evolution to have the compute,
the bandwidth, the data, the talent to make the most of this moment. Now, this AI wave is a little bit different in three ways. First, it's the bigge
st wave yet. And that's generally true. But there is something more specifically true about this wave, which is it is the first one that is both softw
are and services. The top row shows the first 15 years of the cloud transition where the TAM for software went from about 350 billion to 650 billion a
nd cloud grew to be about 400 billion of that. The bottom row is what is brand new. This is the services revenue that seems to also be available now.
10 trillion is a conveniently round number. We don't know if it's 10 trillion or 5 trillion or 50 trillion. We do know that legal services in the US a
lone is a $400 billion market. That is one vertical and one geo and it's the same as all of software. So this opportunity is immense. Point number two
, fastest wave yet. I think we can all feel this. What it means is that this white space, and I direct your attention to the AI side of this page, thi
s whites space is getting filled pretty fast. These logos are the companies that got to a billion plus of revenue as a result of the cloud, mobile, an
d now AI tectonic shifts. And at current course and speed, there are more coming soon. Point number three, which is probably the most interesting one
and I borrow this from my partner Constantine is that there are two basic kinds of revolutions in technology. There are revolutions of communication w
hich are about the way information is distributed. Most of the people in this room have only lived through revolutions in communication. The internet,
the cloud, mobile, those are all about information distribution. Those are revolutions in communication. AI is different. AI is this one. AI is a rev
olution in computation. It's about how information is processed. And that might sound like semantics, but these are fundamentally different shapes of
waves. And maybe the most visceral way to feel this is to think about the fact that the floor keeps moving underfoot. The technology foundation on whi
ch everybody builds changes every day when new capabilities come out. And we've had three major inflection points over the last handful of years. Firs
t one chat GP chat GPT moment November 2022 the world saw the power of pre-training. Second one couple years later 01 model reasoning all of a sudden
a second scaling law emerges around inference time compute. Third one, just recently cloud code, nopus45, now 47. The world saw the power of long hori
zon agents. And while these look like three points on a continuum, it's kind of a hard break between two and three. It's a little bit of a discontinuo
us change. And if we may be so bold, we would say that this is AGI. And look, I'm an ecom major. We're venture capitalists. not about to propose a tec
hnical definition for AGI, okay? We study founders and markets and the collision thereof, which is businesses, but we do study businesses. And so from
a commercial standpoint, from a practical standpoint, from a functional standpoint, if you can dispatch an agent to do a job and it can recover from
failure and persist until that job is done, I don't know. That feels pretty much like AGI. Even if you don't think it's AGI, which is fine. By the way
, Sonia will talk a lot more about this in her part. Even if you don't think it's AGI, I think we can all see that the cars have arrived. Last few yea
rs, we've had a lot of faster horses, applications that made you 10 or 40% more productive, but didn't fundamentally change the way you work. Now we'r
e starting to see cars, applications that make you 10 or 40x more productive and absolutely change the way that you work. Change the nature of your wo
rk. Change the nature of your organization. Cars have arrived. This is the founder of Sequoia, Don Valentine. He was famous for asking one question. S
o what? Why does all this stuff matter? What matters is because just in the last few months the race has begun and it's a different kind of race than
what we're used to. The way you drive a car is different than the way you ride a horse. The way you build a car is different than the way you take car
e of a horse. So, it's a very different sort of race. And one of the reasons that we wanted to gather everybody here today is because nobody has all t
he answers. And the more time we can spend together, the more we can learn and hopefully figure out where all this stuff is headed. And it's important
that we do so as soon as possible because there's a lot at stake just from a commercial perspective. There's 10 trillion dollars up for grabs. We got
labs coming at it from a tech out approach. We got startups building on top coming at it from more of a customerback approach. We do have all of the
labs represented in this room, but most of you are building on top. And so we'll take a minute and uh talk about that customerback approach. So our ad
vice for those of you who are building on top of the labs, it's free advice and so it's worth every penny you paid for it. Our our advice would be to
get mad and we don't we don't actually need you to be angry. You can be angry if you want. If that's what drives you, that's cool. Go ahead and be ang
ry. But this is just a convenient acronym for modes, affordance, and diffusion, which are three characteristics or three pillars of a strategy for bui
lding on top of the models. first on moes just for fun. Anybody remember this slide from last year? One person and he's my partner. Okay, that's cool.
Um, we're not gonna Okay, as a reminder, this slide shows the merchandising cycle, which is the links in the value chain required to take something f
rom an idea to a happy customer. We're not actually going to go through the links in the chain. The point that I want to make here, if you approach th
ings from a techout point of view, each link in the chain gets approached a little bit different. If you approach things from a customer back point of
view, each link in the chain you approach a little bit different. Now, here's a part that's counterintuitive. In a revolution of computation, which i
s about information processing, what you want to do is look down here because there's cool new stuff coming out all the time. What you should actually
do for the sake of building modes is look up here because your customers are not changing nearly as fast as the capabilities are changing. The things
that you build might be irrelevant tomorrow. The degree to which you wrap yourself around your customers is going to be a bit more durable. That's no
t to say that product and technology is not important. It is insanely important and generally speaking, best product wins. But in a world where produc
t changes so fast because capabilities change so fast in thinking about modes, we would encourage you to go as customer back as possible and think abo
ut all the ways you can wrap yourself around those customers. Okay. The A in MAD stands for affordance. This is a term that we borrow from the design
world. A hammer is an object that has a force. I have a 2-year-old son. If I gave him a hammer, he would know what to do with it. He would grab you, s
tart hitting stuff. It's why we don't give him hammers. Okay? An object with affordance is one that doesn't need to be explained. People just know wha
t to do with it. Cloud code is insanely powerful. Go open up a terminal for the average Fortune 500 employee and see how far they get. While it is pow
erful, it does not offer that much affordance. That's not a knock in on anthropic, but it is an opportunity for anybody who wants to build on top and
to create paths of least resistance for your specific customers and their specific problems so that it's just braindead simple for them to figure out
how to get to the outcome that they need for their business. That's the concept of affordance. And then finally, the D in MAD is diffusion. And the di
ffusion gap is the opportunity for companies building at the application layer. The rate at which capabilities are diffusing out into the market is fa
r shy of the rate at which those capabilities are being created. And every day that the foundation models move faster than your average Fortune 500 en
terprise, that gap gets bigger and that opportunity gets bigger. So for modes try to think customer back for affordance try to think about creating th
ose paths of least resistance for your customers and that diffusion gap that represents your opportunity. Unless that slide from earlier with the whit
e space starting to fill up was discouraging for anybody. May we remind you that no lead is safe. There's this expression in racing. You cannot pass 1
5 cars in the sun, but you can pass 15 cars in the rain. And right now, there is a torrential downpour of new capabilities coming out of the foundatio
n models, which means that no lead is safe. But it also means that anybody can win. What a time to be alive. And with that, I'll hand it off to Sonia.
**Sonya Huang:** 谢谢 Pat。我必须说,在观众席看到这么多熟悉的面孔真好。今天这里有一群非常出色的人,我真的很高兴能和你们所有人一起成为这个生态系统的一部分。我这部分的目的是谈谈 AI 现在正在发生什么——对于 2026 年来说,就是 agent。
好,闪回到 2022 年。举个手,有人还记得 AutoGPT 或 BabyAGI 吗?好的。好的。这些项目曾经是 GitHub 上的一夜爆红。它们做的事情是拿 GPT-3,给它一些工具,套在一个循环里,让它朝着一个目标运行。这很有前景,直到你看着那些 agent 一次又一次又一次地失败。有点可爱,有点让人心软,但完全没用。我放这张幻灯片是为了提醒大家,你知道,我们都知道 agent 会来。我们几年前就能看到了。但在 2022 年,模型还没有准备好。
快进到今天。大概在年初前后,有些东西真正发生了变化。突然间,我们周围到处都是 agent,而且它们似乎真的在起作用了。有两个 agent 特别成功。Claude Code 面向技术人群,OpenClaw 和它所有的龙虾兄弟们则把 agent 民主化到了任何一个有手机的人。所以,不管你是硬核工程师还是普通人,结论就是现在任何人都可以创建 agent 了。
我们看到的是,人们在为所有事情构建 agent。有一些很搞笑的东西,比如一个 OpenClaw agent 会去举报你的邻居逃税。请不要这样做。或者——其实也许请这样做。[笑] 有创业类的东西,agent 运营生成式媒体营销来销售建筑服务。然后还有专业层面的。我可以告诉你,Sequoia 内部正在疯狂地竞争谁能构建最好的 agent 来更好地完成我们的工作。
那么,什么是 agent?这里有一个可能的定义。Agent 是一个能感知环境、选择行动、并自主朝目标推进的系统。顺便说一下各位,这个我是自己用 Cdance 做的。我非常自豪。视频模型已经进步了很多。更具体地说,我认为 agent 有三个功能组件。第一是推理和规划的能力。这是基础层的直觉以及随机应变的能力。第二是采取行动的能力。这是工具——搜索、写入、编译。第三是朝目标迭代的能力。这是让 agent 能够在长时间跨度内完成任务的持久性。所以"能动性"(agency)结合了这三样东西。它就是把事情搞定的能力。
如果我们把 agent 拆解成它们的组成部分——模型、工具、框架——每个组件在过去一年都快速进步了。首先,模型是大脑。这是发生的最重要的事情。这个仪表图测量的是一个模型在复杂任务上能持续推进多长时间而不偏离轨道。一年前我们还在几十分钟这个量级,今天已经到了几小时这个量级。所以这是发生的最重要的事情——模型终于变得足够强大,能够在长时程任务上保持性能。
第二,工具是手臂和腿。它们让模型能够使用那些让我们在电脑上高效工作的东西。终端用于文件系统和开发工具、iMessage、Slack、网络搜索、计算机使用,等等。过去二十年我们为人类构建的工具,最终被证明也对 agent 极其有用,可以直接移植过来。有一种常见的论调说 SaaS 已死。我认为恰恰相反——随着使用这些工具的 agent 数量增加,这些工具的价值会爆炸式增长。
模型和工具赋予 agent 能力。框架(harness)赋予它们持久性——保持专注、适应并继续前进的能力。而那个反馈循环现在真的开始转起来了,特别是有了强化学习(Reinforcement Learning)。我们正在把这些 agent 送去"驾校",在 RL 训练场里训练它们,我们正在不同场景下——从机械工程到设计到金融——推动性能提升。我们还看到了自我改进的早期苗头,也就是机器建造机器。比如 Andrej 的另一个研究项目,能够自主地朝着 GPT-2 级别的模型进行研究改进,只需两个小时。
那么,一个到处都是 agent 的世界会是什么样子?Agent 存在于一个 agent 程度的滑动标尺上。以编程为例。2023 年,我们有 Tab 自动补全。一个 AI 在行内辅助一个人类。这是增量式的有用,但根本不是变革性的。现在我们有了 agent 式开发。一个人类与一个 agent 对话,指挥它做什么,也许管理一个 agent 团队。但这种范式正在被进一步推动。我们现在看到的是后台 agent、异步 agent、agent 生成子 agent。我们认为异步 agent 这种范式在量上可能会超过当前的范式,就因为系统中的杠杆效应太大了。然后在最前沿的出血边缘,是我所说的"暗工厂"(dark factories)——完全把人类审核从系统中移除。这听起来很疯狂,但我在生产环境中见过,包括在网络安全公司。只要有足够好的护栏和足够好的工程,这是可能的。
所以我们正在一个 agent 程度的标尺上向上攀升。Agent 正在从在你身边做少量事情的小助手,变成需要被管理的实习生,再变成能自我管理的实习生,最终变成足够值得信赖、可以在没有监督的情况下推送到生产环境的实习生。这不仅仅发生在编程领域,而是发生在所有 agent 领域。
对于在座的创始人来说,最重要的收获是:服务就是新的软件。Pat 从我认识他以来就一直在说这句话。我们的合伙人 Julian,今天也在观众席,发表了一篇很好的文章讨论这个问题。我们早就知道了,但我觉得它现在真的在发生。
在医疗领域,你可以雇一个 agent 来检查你的基因组,给你个性化建议,给你开药,推荐你参加临床试验。在法律领域,你可以雇 agent 代表你谈判合同,甚至进行诉讼,帮你和解。在数学和科学领域,我们看到 agent 可以解决 Erdős 问题或发现新的超导体。这多让人振奋啊?在消费者领域,个人 agent 可以帮你管理收件箱、日历、财务,帮你报税。
我们预计到处都会有 agent。部分原因是雇佣 agent 比雇佣员工容易得多。人类很难扩展。Agent 可以随着算力无限扩展。人类很难让他们保持开心。除了我,我一直都很开心。[笑] Agent 是低维护的。人类很贵,你要付他们工资。Agent 你付 token。通常用 token 完成一项任务的成本比等价的工资要低。今天人类在大多数情况下仍然更聪明,但苦涩教训(the bitter lesson)持续发挥作用,很快 agent 在很多事情上会更聪明。
所以这张幻灯片的要点不是说我们人类要失业了。我认为人类一个独特的特质就是适应能力。但我们确实预计 agent 在应用层的部署将会非常迅速,规模史无前例,因为经济性太清晰了,也因为其固有的可扩展性。
如果你把这些加在一起,agent 的数量正在以某种指数甚至超指数的方式膨胀。我觉得我们即将到达事情变得真正奇怪的那个点。当商业行为发生在 agent 之间会怎样?它们能互相付款吗?当 agent 真的能彼此协商交易条款会怎样?我们会不会有 agent 群体在监管我们,防止网络安全事件或大规模灾难?我们唯一知道的是,世界正在以极快的速度变得奇怪。
所以我最后要引用一下我内心的 Ben Serate 的话。长时程 agent 已经到来。它们所在的曲线非常清晰。对于创始人来说,我觉得每个人都有通过 AI 完成了疯狂困难时间线的例子。Nathan 来自 Zed,他用 Claude Code 在假期里独自一人完成了一个三年期的登月项目。Brett Taylor 在一个周末重建了 Sierra。Notion 团队在短短六周内重写了 800 万行代码。每个人都有这些被压缩的时间线的例子,但我觉得在 AGI 实验室之外,很少有人见过当你把这些被压缩的时间线一个叠一个地堆起来会发生什么。而这就是现在可能实现的事情。所以无论你能想象在未来一百年建造什么,我们认为现在 100 天内就可以实现——多亏了 agent。我把话筒交给 Constantine。
**Sonya Huang:** Thank you, Pat. And can I just say it's so nice to see so many friendly faces in the audience. There is an exceptional group of people here today and
I'm just really happy to be part of this ecosystem with all of you. And so the purpose of my section is to talk about what's happening in AI right no
w which for 2026 is agents. Okay, flashback to 2022. Show of hands. Does anybody here remember AutoGPT or baby AGI? Okay. Okay. So these projects were
overnight hits on GitHub. And what they did was they took GPT3, gave it some tools, wrapped it in a loop, and let it run towards a goal. And it was p
romising until you watch those agents just fail over and over and over again. Kind of cute, kind of endearing, but completely useless. And I put this
slide here to remind us that, you know, we all knew agents were coming. Uh we could have seen it years ago, but back in 2022, the models just weren't
ready yet. Fast forward to today. Something around the turn of the year really changed. Suddenly, we have agents everywhere around us and they seem to
actually be working. Two agents in particular have been home runs. Cloud code for the technical crowd and OpenClaw and all of its lobster brethren, u
h, which democratize agents to anybody with a phone. And so whether you are a hardcore engineer or a normie, the punch line is that anybody can create
agents now. And so what we're seeing is people are building agents for everything. There is silly stuff like an open call agent that will literally s
nitch on your neighbors for tax fraud. Please don't do this. Or actually maybe please do this. Uh there's entrepreneurial stuff, agents running genera
tive media campaigns to sell construction services. And then there's the professional layer. I can tell you there's a huge race internally at Sequoia
for who can build the best agents to do our jobs better. So what does it mean to be an agent? Here is one possible definition. An agent is a system th
at perceives its environment, chooses actions, and progresses autonomously towards a goal. By the way, guys, I made this in C dance by myself. I'm I'm
very proud of it. The video models have come a long way. Uh and more specifically, I view agents as having three functional components. First is the
ability to reason and plan. Uh this is the baseline level of intuition uh and the abil ability to think on the fly. Second is the ability to take acti
ons. This is tools, search, write, compile. And then finally, the ability to iterate towards a goal. This is the persistence that gives agents the abi
lity to accomplish things over long time horizons. And so agency combines these three things. It is simply the ability to get done. If we boil agents
out into their constituent components, the models, the tools, the harnesses, each component has progressed rapidly over the last year. First, the mode
ls are the brain. This is the most important thing that's happened. The meter chart measures uh how long a model can sustain progress on a complex tas
k without going off the rails. And we've gone from the order of tens of minutes a year ago to the order of hours today. And so this is this is the mos
t important thing that's happened. the models are finally getting capable enough to sustain per performance on long horizon tasks. Second, the tools a
re the arms and the legs. These give models access to things that make us productive on a computer. The terminal for file systems and dev tools, iMess
age, Slack, uh web search, computer use, you name it. And the last two decades that we spent building tools for humans have ended up being uh able to
port over to be incredibly useful for for agents as well. And there's a common refrain that SAS is dead. I think to the contrary the value of these to
ols is going to explode as the number of agents using them increases. Models and tools give agents capability. The harness is what gives them persiste
nce. The ability to stay on task, adapt, and keep going. And that feedback loop is now really starting to crank, especially now with reinforcement lea
rning. We're giving these agents uh we're taking them to driving school, giving them uh you know, training them uh in RL gyms, and we're pushing perfo
rmance in different settings from mechanical engineering to design to finance. We're also seeing the early glimmers of self-improvement or the machine
building the machine. uh for example, Andre's other research project uh improves research autonomously towards a GPT2 level model in just two hours.
So what does a world of agents everywhere look like? Agents exist on a sliding scale of agentness. Um and so let's take coding as an example. In 2023,
we had tab autocomplete. One AI assisting a human in line. This was incrementally useful, fundamentally not transformative. We now have agentic devel
opment. One human talking to an agent, instructing it what to do, maybe managing a team of agents. But this paradigm is getting pushed further. We're
now seeing background agents, async agents, agents spawning sub agents. We think that async agents in this whole paradigm is likely to overtake the cu
rrent paradigm in volume just because the amount of leverage in the system. And then finally pushing the bleeding edge of the frontier, what I call da
rk factories. um taking human review out of the system completely. This sounds crazy, but I've seen it happen in production, including with cyber secu
rity companies. It is possible with good enough guardrails and good enough engineering. So, we're progressing up a scale of agenticness. agents are go
ing from little helpers that do a little amount by your side to interns that need to be managed to interns that manage themselves and eventually to in
terns that can be trusty enough to p to push to prod without oversight. And so that's the evolution that's happening not just in coding but across all
of agents. Uh the most important takeaway uh for the founders in this room is that services is the new software. Pat's been saying this for as long a
s I've known him. And our partner Julian, who's in the audience today as well, uh, published a great article on this. And we've known this for a long
time, but I think it's actually happening. So, in medicine, you're able to hire an agent that inspects your genome, gives you personalized recommendat
ions, can prescribe you medication, uh, recommend you clinical trials. In law, you'll be able to hire agents that can negotiate contracts on your beha
lf, even perform litigation, and settle for you. in math and the sciences, we're seeing agents that can solve Airdos problems or discover new supercon
ductors. Like, how thrilling is that? Or in uh in the consumer world, personal agents that can manage your inbox for you, your calendar, your finances
, uh file your taxes. And we expect there's going to be agents everywhere. And that's in part because hiring agents is so much easier than hiring empl
oyees. Humans are hard to scale. Infinites are infinitely scalable with compute. Humans are hard to keep happy. Except for me, I'm always happy. Uh ag
ents are low maintenance. Uh humans are uh humans are expensive. You pay them salaries. Uh you pay agents tokens. Generally, it costs less to accompli
sh a task with tokens than the equivalent in salary. Today, humans are still generally smarter, but the bitter lesson presses on and soon agents will
be smarter at many things. And so, the point of this slide is not that we humans are out of a job. Uh I think a uniquely human trait is adaptability.
But we do expect the deployment of agents across the application layer to be swift and at an unprecedented rate in scale because the economics are so
clear and because of the inherent scalability of fits. So if you add all this up, the number of agents is ballooning on some sort of exponential, mayb
e a super exponential. And I think we're about to hit the point where things get genuinely strange. Uh what happens when commerce happens between agen
ts? Can they pay each other? What happens when agents can actually negotiate the terms of a transaction with each other? Are we going to have swarms o
f agents policing us, preventing things like cyber security or megadan? All we know is the world is getting weird extremely quickly. And so I'll close
by channeling my in my inner Beneserate. Uh long horizon agents are here. The curve that they're on is very clear. And for founders, I think everybod
y has examples of people that are accomplishing insanely hard timelines thanks to AI. So, Nathan from Zed accomplished a three-year moonshot project o
ver the holidays by himself with cloud code. Uh Brett Taylor rebuilt Sierra over a weekend. Um the notion team rewrote 8 million lines of code in just
six weeks. And so everybody has these examples of compressed timelines, but I think very few people outside of the AGI labs have seen what happens wh
en you take these compressed timelines and you stack them on top of each other. And that's what's possible now. And so whatever you can imagine buildi
ng over the next hundred years, we think is now possible in a 100 days thanks to agents. I will pass over to Constantine.
**Constantine Buhayer:** 谢谢 Sonya。好的。非常感谢 Sonya 和 Pat 精彩的概览分析。在这个部分,我们要谈谈接下来会发生什么。目标是——我们都知道我们身处 AI 时代。它会是什么样子?它会有什么感觉?它有什么特征?
在演讲的前半部分,Pat 把技术革命分为了计算和通信两类。我们在这里要做另一种划分——针对工作的类型。有体力工作。这是一个驿马快递上的包裹。这是一颗 Falcon 9 火箭上的卫星。功等于力乘以距离。物理运动。然后是认知工作。这是毕达哥拉斯提出他的定理。这是 DeepMind 解决蛋白质折叠(protein folding)问题。有意识的思考。这些是非常不同类型的工作,但我们相信它们将遵循非常相似的革命模式。
让我们先说体力工作,因为我们已经经历过工业革命了。在人类历史的绝大部分时间里,为人类服务的所有工作或几乎所有工作都是由某种肌肉完成的。人或动物。人搬动东西,或者动物拉着人前行。这张图从 1700 年开始,但实际上可以追溯到数千年前。然后事情开始改变。水力和风力、蒸汽机,然后事情加速了。蒸汽机、内燃机、电动机。到今天 2026 年,你可以估算地球上为人类完成的所有体力工作中,99% 以上是由机器完成的。把你带到这里的飞机、这个房间里所有物品的制造、所有让你此刻拥有人类体验巅峰的运输设施。
好,我们认为类似的模式将在认知领域发生。我们只是处在更早的阶段。在人类历史的大部分时间里,地球上为人类做的所有思考主要都是由人类完成的,也许动物有一点点——牧羊犬追赶羊,对吧?还有上面一薄层机械工作,比如星盘或时钟。在过去几百年里,进步不大,直到电子计算出现。在过去一百年里,想想在任何给定时刻正在发生的数万亿次计算,用来服务你这个人类。所有那些正在发生的认知工作,在任何给定时刻服务着我们。万亿次计算。我们相信神经网络是下一个大浪潮,在不远的将来,地球上 99.9% 的认知将由机器完成。
这个类比是相当鲜明的。好消息是我们已经经历过一次这样的革命。认知革命将会很像工业革命,只是规模大得多、速度快得多。
那么,生活在这个未来会是什么样的?我想用四个小故事来分享一些关于这个未来的启示。
第一个故事。19 世纪中叶,美国想为我们的第一任总统和最伟大的战争英雄乔治·华盛顿建造一座宏伟的纪念碑。于是我们设计了当时世界上最高的建筑——华盛顿国家纪念碑。我们想在顶端覆上世界上最珍贵的金属——100 盎司世界上最珍贵的金属。事实上它珍贵到我们把它放在曼哈顿的 Tiffany's 橱窗里展示。那种金属是铝。在华盛顿国家纪念碑建成后的几十年内,一位年轻的发明家想出了电解法,就是把铝从泥土中分离出来的过程。又过了几十年,铝被用来包裹我们的糖果和三明治,然后被扔进垃圾桶。铝就是智能。电解法就是人工智能。我们即将进入一个世界,在那里一些花了几十年才修炼成的最珍贵的技能——博士级别的技能——可以被瞬间调用,用完之后你就可以揉成一团扔进垃圾桶。
第二个故事,我们正在进入一个异形设计的世界。我们今天看到的世界完全是为人类设计的。你知道,它被优化成了对我们的大脑来说有道理的方式,因为世界上几乎所有的认知都是我们在做。好,当机器来做认知的时候,情况会有些不同。2006 年,NASA 在为一个大型太空卫星任务优化天线。传统上,他们的天线长这样——一个美丽的几何对称图案,在功率约束条件下优化了表面积。这次他们说,我们把它交给计算机,用进化算法——很像强化学习。结果就是这个天线——性能大幅提升,但对人类思维来说并不直观。在这个 AI 时代,当我们把认知交给机器时,我们会得到对我们来说不直观的结果。当 AI 在设计芯片、汽车、建筑时,它们可能看起来完全不同。我们进入的那个世界,我们必须保持开放心态,因为 AI 不会像我们一样思考。它会有异形设计(alien design)。
第三个启示故事是关于新兴科学。注意,不是新兴科学(单数),我们都知道有新兴科学。我说的是新兴科学(复数,sciences)。在工业革命早期,你有 Newcomen 和 Watt 这样伟大的工程师,他们完善了内燃机。基本上就是把石化燃料放进一个活塞,点火,数百万、数十亿颗粒爆炸,推动活塞做功。在将近一百年里,所有这些都是"摸索"(tinkering)。一个工程师说:"啊,这样好一点。"也许能看到类似扩展定律的东西,但那就是工程师在摆弄产品,看看怎么能改进一点。在 120 多年后,Sadi Carnot 出现了,把这些形式化为一门新科学——热力学(thermodynamics)。他说:"等一下。有数百万或数十亿颗粒。我们实际上可以把这一切的规律形式化。"在我们的情况里,有数十亿个神经元,数万亿个 token。现在,我们还处在 AI 的摸索阶段。即使我们觉得它已经是一门被理解的科学了,其实不是。在未来,我们将在接下来的几十年里看到一门和热力学一样根本性的新科学诞生。在座的某个人也许会提出那门科学。那门科学将被写进高中教材。它就是那么根本性的。它将帮助我们掌握 AI。它甚至会帮助我们理解意识。
第四个故事,非理性的艺术(the art of unreason)。在人类历史的绝大部分时间里,数万年间,艺术一直是朝着写实主义的方向进步的。这是大约 25000 年前的洞穴壁画、埃及象形文字、希腊陶器、文艺复兴绘画——一场朝向写实艺术的宏大变革。看看这个差异。跨越数万年,人类的胜利。然后工程来了——达盖尔银版照相法、早期摄影——突然之间,花了几十年人生才完善的技艺、让每一笔都完美的功力,没了。
那么世界如何反应?人们觉得绘画完了。就这样了,机器比任何人都做得更好。艺术结束了。然后发生了什么?人类如何回应?人类的回应是说:这种艺术的目的是要像眼睛看到的那样捕捉瞬间,还是要像心灵和灵魂感受到的那样捕捉瞬间?印象派、表现主义、立体主义、新表现主义。所有这些新的艺术形式,就是人类对科学的这一剧变做出的回应。
2500 年前,希腊哲学家普罗塔哥拉斯(Protagoras)写道:"人是万物的尺度。"他的意思是,在真空中没有任何东西对人类有价值。不是铝、不是艺术、不是智能。它们有价值仅仅是因为体验。AI 能做那些工作。AI 将会做那些工作。但只有人与人之间的联结才能给你一个在乎的理由。
这就是我们今天都在这个房间里的原因。十年后,工作会大不相同。很多东西都会改变。但有一件事是不变的——你今天和身边的人建立的关系将会持续下去。那才是你将来会回顾的。那才是今天真正有价值的东西。所以我鼓励大家和身边的人建立那些关系,享受在这次 AI Ascent 上共处的时光,真正投入到让我们最具人性的那些东西中去。
**Constantine Buhayer:** Thank you, Sonia. All rig
ht. Thank you so much Sonia Pat for the brilliant overview analysis. In this section we're going to talk a little bit about what's next. Uh so the goa
l here is we all know we're in the AI age. What's it going to look like? What's it going to feel like? How is it characterized? Earlier in the present
ation, Pat bifurcated technological revolutions between comput and communication. We're going to do another bifurcation here for types of work. There
is physical work. This is a package on the Pony Express. This is a satellite on a Falcon 9. Work equals force times distance. Physical movement. And t
hen there's cognitive work. This is Pythagoras coming up with this theorem. This is deep mind solving the pro protein folding problem, conscious think
ing. These are very different types of work, but we believe that they're going to follow a very similar pattern in revolution. So let's talk about phy
sical work because we've been through this revolution with the industrial revolution. For the vast majority of human history, all the work or virtuall
y all the work for serving humans was done by some sort of muscle. People or animals. People moving something or an animal pulling the human along. Th
is starts at 1700, but it goes back millennia. Then things started to change. Water and wind, steam engines, and then things accelerated. Steam engine
s, combustion, electric motors. Today 2026, you could estimate that 99 plus% of all the physical work done on planet Earth for humans is done by a mac
hine. The plane that brought you here, the manufacturing of all the goods in this room, all the transportation that sets up for the pinnacle of the hu
man experience you're having right now. Well, we think a similar pattern is going to happen in cognition. We're just a little earlier on. So, for most
of human history, all the thinking on planet Earth for humans was done primarily by humans, maybe a little bit for animals, the the sheep dog chasing
the sheep, right? And there was this sliver on top of mechanical work, the astrolabe or the clock. Now over the past couple hundred years there was n
ot a lot of progress until electronic computation and in the past hundred years think about all the trillions of calculations that are happening at an
y given moment to serve you the human. All of that work all of that cognitive work that's happening to serve us at any given moment. Trillions of calc
ulations. We believe that the neural network is the next big wave and that in the near future 99.9% of cognition on planet Earth will be done by machi
nes. Well, the parallel is pretty stark. And the good news is we've been through a revolution like this. The cognitive revolution is going to be a lot
like the industrial revolution, just much, much bigger and much faster. So, what's it going to be like living in this future? I'd like to share some
motivations for this future in the form of four short stories. The first story. In the mid 1800s, America wanted to build a grand monument to our firs
t president and our greatest war hero, George Washington. So, we designed the tallest building in the world at the time, the Washington National Monum
ent. And we wanted to cap it with the most precious metal in the world, 100 ounces of the most precious metal in the world. So precious in fact that w
e put it on display at Tiffany's in Manhattan. That metal was aluminum. Within decades of the completion of the Washington National Monument, a young
inventor came up with electrolysis, the process of separating aluminum from dirt. And within decades, aluminum was used to wrap our candies and our sa
ndwiches. and then tossed into the trash. Aluminum is intelligence. Electrolysis is artificial intelligence. We're about to enter a world where some o
f the most precious skills that took decades to earn, PhD level skills, are so instantly invoked that right after using them, you can crumple them up
and throw them right in the trash. Story number two, we are entering a world of alien design. The world as we see it today is all about design for hum
ans. You know, it's it's been optimized in a way that makes sense to our brains because we are doing almost all the cognition in the world. Well, when
machines do the cognition, it's going to be a little different. In 2006, NASA was optimizing an antenna for a large space mission, satellite space mi
ssion. And traditionally, their antennas looked like this. It was a beautiful geometric symmetrical pattern that optimized surface area for some power
constraints. This time around they said we're going to hand it over to computer and we're going to have an evolutionary algorithm a lot like reinforc
ement learning. The result this antenna right here dramatically more productive not intuitive to the human mind. In this AI era, when we hand over cog
nition to machines, we're going to get results that are not intuitive to us. When AI is designing chips, cars, buildings, they might look dramatically
different. The world that we enter into, we have to be open-minded because the AI is not going to think like us. It's going to have alien design. The
third motivation story is on emerging sciences. Not emerging science. We all know there's emerging science. I'm talking about emerging sciences. In t
he early industrial revolution, you had great engineers like Newman and Watt and they perfected combustion engines. Basically put a petrochemical into
a piston, ignited on fire, millions, billions of particles explode. move the piston work. For almost a hundred years, all of that was tinkering. It w
as an engineer saying, "Ah, that works a little bit better." Maybe something you could see see like a scaling law, but it was engineers playing with t
he product and seeing how they can improve it a little bit. over 120 years after Sadi Caro came around and formalized this in a new science thermodyna
mics. He said, "Wait a second. There are millions or billions of particles. We can actually formalize what that all looks like." In this case, there a
re billions of neurons, trillions of tokens. Right now, we're in the tinkering phase of AI. Even if we think it's an understood science, it's not. In
the future, we will have a science as fundamental as thermodynamics introduced in the next couple decades. Someone in this room might come up with tha
t science. And that science will be taught in high schools. It will be that fundamental. And it will help us master AI. It will even help us master co
nsciousness. Fourth story, the art of unreason. So for the vast majority of human history, tens of thousands of years, art has been a progression towa
rds realism, this is uh cave painting from about 25,000 years ago, Egyptian hieroglyphs, Greek pottery, uh Renaissance paintings, uh a grand transform
ation toward realistic art. Just look at the difference. Over tens of thousands of years, the triumph of humanity, and then engineering came along, th
e Dogger type, early photography, and all of a sudden what was spent decades of life to perfect the skill of getting every brush stroke perfect gone.
So, how did the world react? They thought that painting was over. Oh, that's it. The machine can do it better than any human. Art is art is ended. Wel
l, what happened? How did humans respond? Humans responded by saying, was the purpose of this art to capture the moment in the way the eye sees it or
was it to capture the moment in the way the heart and the soul sees it? Impressionism, expressionism, cubism, neoexpressionism. All these new forms of
art are how humanity responded to this dramatic change in science. 2500 years ago, Greek philosopher Pagoras wrote, "Man is the measure of all things
." What he meant is that nothing in a vacuum has value to humans. Not aluminum, not art, not intelligence. It only has value because of the experience
. AI can do the work. AI will do the work. But only the human connection can give you a reason to care. That's why we're all in this room today. A dec
ade from now, work is going to be dramatically different. Things are going to change so much. But the one thing that will be constant is the relations
hips that you form today with the person right next to you will endure. That's what you're going to look back on. That's what's going to be valuable f
rom today. So, I encourage you to form those relationships with the people next to you, enjoy your time together at this AI ascent, and really lean in
to what makes us most human.