不久前,NVIDIA研究员Jim Fan提出一个很有代表性的判断:AI正在经历第二次预训练范式转移。第一次是next word prediction,第二次则是world modeling,也就是“预测下一个物理状态”。在这个视角下,模型学习的不再只是token与token的接续关系,而是世界在给定条件和动作之后,会如何继续演化。
Last month, along with a comprehensive suite of new AI tools and innovations, Google DeepMind unveiled Gemini Diffusion. This experimental research model uses a diffusion-based approach to generate ...
Inception, the company behind the first commercial diffusion large language models (dLLMs), today announced the launch of ...
Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Just yesterday, I asked if Google would ...
Google at Google Cloud Next 24 unveiled three open source projects for building and running generative AI models. The company also introduced new large language models to its MaxText project of ...
On Thursday, Inception Labs released Mercury Coder, a new AI language model that uses diffusion techniques to generate text faster than conventional models. Unlike traditional models that create text ...