Abstracts: July 29, 2024

28
Published on 3 Jun 2025, 18:32
A lack of appropriate data, decreased model performance, and other obstacles have made it difficult to expand the input language models can receive. Li Lyna Zhang introduces LongRoPE, a method capable of extending content windows to more than 2 million tokens.

Show notes: microsoft.com/en-us/research/p...

Listen to the Abstracts series: microsoft.com/en-us/research/p...
autotechmusickids