← 首页/AI 资讯

大语言模型2026-05-18 08:15·VentureBeat

RecursiveMAS 新架构：多 Agent 推理提速 2.4 倍，Token 消耗降低 75%

VentureBeat 报道了 RecursiveMAS 多 Agent 推理加速架构，该方案将推理速度提升 2.4 倍的同时，Token 消耗降低 75%，为大规模 Agent 部署提供了新的优化路径

RecursiveMAS：多 Agent 推理加速革命

2026 年 5 月，VentureBeat 报道了 RecursiveMAS 多 Agent 推理加速架构。

性能数据

速度提升：推理速度提高 2.4 倍
Token 节省：Token 消耗降低 75%
架构创新：递归式多 Agent 协调机制

技术原理

通过递归式的 Agent 协调减少重复推理
智能缓存中间推理结果避免重复计算
动态分配 Agent 资源根据任务复杂度

行业意义

多 Agent 系统的 Token 成本一直是大规模部署的瓶颈
75% 的 Token 节省可能改变企业 AI 的经济模型
为 Agent 编排框架提供了新的优化参考

来源: VentureBeat
链接: https://venturebeat.com/orchestration/how-recursivemas-speeds-up-multi-agent-inference-by-2-4x-and-reduces-token-usage-by-75

📰 原始来源

https://venturebeat.com/orchestration/how-recursivemas-speeds-up-multi-agent-inference-by-2-4x-and-reduces-token-usage-by-75

← 上一篇

Intercom 更名为 Fin 并发布 AI Agent 管理 AI Agent 平台，多层 Agent 架构兴起

下一篇 →

Claude 企业战略新战场：Agent 控制平面成 Anthropic 下一步核心方向

📰 更多动态

行业2026-05-18 00:00

Eclipse 获 25 亿美元 Cerebras 订单，验证物理世界 AI 基础设施投资逻辑

行业2026-05-18 00:00

TechCrunch 深度分析：AI 淘金热中的赢家与输家，贫富差距正在扩大

行业2026-05-18 00:00

Cisco 创收与裁员同日宣布：AI 转型下的科技巨头两难