Research Talk: Enhancing the robustness of massive language models via invariant risk minimization

626
9.5
Опубликовано 8 февраля 2022, 17:19
Speaker: Robert West, Tenure-Track Assistant Professor, EPFL

Despite the dramatic recent progress in natural language processing (NLP) afforded by large pretrained language models, important limitations remain. A growing body of work demonstrates that such models are easily fooled by adversarial attacks and have poor out-of-distribution generalization, as they tend to learn spurious, non-causal correlations. This talk explores how to reduce the impact of spurious correlations in large language models based on the so-called invariance principle, which states that only relationships invariant across training environments should be learned. It includes data showing that language models trained via invariant risk minimization (IRM), rather than the traditional expected risk minimization, achieve better out-of-distribution generalization.

Learn more about the 2021 Microsoft Research Summit: Aka.ms/researchsummit
автотехномузыкадетское