Dion2: A new simple method to shrink matrix in Muon

118
Опубликовано 3 марта 2026, 17:59
Dion2 reduces the cost of Muon’s orthonormalization step by orthonormalizing only a small, selected submatrix at each iteration. This lightweight approach preserves Muon’s strong performance while significantly improving scalability of optimizer at scale.

GitHub Repo: github.com/microsoft/dion

This session aired on March 3, 2026, at Microsoft Research Forum, Season 2 Episode 3.

Register for the series to learn about future episodes: events.microsoft.com/flow/ms/r...
Explore all previous episodes: aka.ms/researchforumYTplaylist
автотехномузыкадетское