★原文链接:https://zhuanlan.zhihu.com/p/1998418717743289472作者:王云鹤写这个的时候,其实我脑子里第一反应是好多年以前某位领导问过我,transformer的下一跳是什么?
腾讯微信 AI 团队提出 WeDLM(WeChat Diffusion Language Model),通过在标准因果注意力下实现扩散式解码,在数学推理等任务上实现相比 vLLM 部署的 AR 模型 3 倍以上加速,低熵场景更可达 10 倍以上,同时保持甚至提升生成质量。 引言 自回归(AR)生成是当前大语言 ...
ELYZA, an AI development company established by the Matsuo Laboratory at the University of Tokyo, released a Japanese-specific diffusion language model, ' ELYZA-LLM-Diffusion,' on January 16, 2026.
On Wednesday, Stability AI released a new family of open source AI language models called StableLM. Stability hopes to repeat the catalyzing effects of its Stable Diffusion open source image synthesis ...
The development of large language models (LLMs) is entering a pivotal phase with the emergence of diffusion-based architectures. These models, spearheaded by Inception Labs through its new Mercury ...
Stability AI says its open-source StableLM language model is the AI for the everyman, though it apparently fails at making a peanut butter and jelly sandwich. Reading time 3 minutes It seems like ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果