Multi-Block Diffusion Language Models

A training recipe, paradigm definition, and runnable inference path for practical Multi-Block Diffusion Language Models.

Yijie Jin1, Jiajun Xu2, Yuxuan Liu1, Chenkai Xu1, Yi Tu3, Jiajun Li3, Dandan Tu3, Xiaohui Yan3, Kai Yu1, Pengfei Liu1, Zhijie Deng1,†

1Shanghai Jiao Tong University   |   2Xi'an Jiao Tong University   |   3Huawei

Corresponding Author

Explore the Project

MBD-LMs spans three parts: demonstrated decoding results, the model-side paradigm, and the inference engine that makes it runnable.