Microsoft Research355 тыс
Опубликовано 3 марта 2026, 17:59
Dion2 reduces the cost of Muon’s orthonormalization step by orthonormalizing only a small, selected submatrix at each iteration. This lightweight approach preserves Muon’s strong performance while significantly improving scalability of optimizer at scale.
GitHub Repo: github.com/microsoft/dion
This session aired on March 3, 2026, at Microsoft Research Forum, Season 2 Episode 3.
Register for the series to learn about future episodes: events.microsoft.com/flow/ms/r...
Explore all previous episodes: aka.ms/researchforumYTplaylist
GitHub Repo: github.com/microsoft/dion
This session aired on March 3, 2026, at Microsoft Research Forum, Season 2 Episode 3.
Register for the series to learn about future episodes: events.microsoft.com/flow/ms/r...
Explore all previous episodes: aka.ms/researchforumYTplaylist
Свежие видео






















