Bayesian Inference of Grammars

537

22.4

Microsoft Research330 тыс

Следующее

07.09.16 – 1 0451:40:35

Developing Physically-Based, Dynamic Vocal Tract Models Using ArtiSynth

Популярные

30.05.23 – 3 5834:48

Microsoft’s Holoportation™ Communications Technology: Facilitating 3D Telemedicine

05.05.23 – 4 3181:32:25

Human-Centered AI: Ensuring Human Control While Increasing Automation

Опубликовано 7 сентября 2016, 16:51

Mark Johnson (Joint work with Sharon Goldwater and Tom Griffiths) Even though Maximum Likelihood Estimation (MLE) of Probabilistic Context-Free Grammars (PCFGs) is well-understood (the Inside-Outside algorithm can do this efficiently from the terminal strings alone) the inferred grammars are usually linguistically inaccurate. In order to better understand why maximum likelihood finds poor grammars, this talk examines two simple natural language induction problems: morphological segmentation and word segmentation. We identify several problems with the MLE PCFG models of these problems and propose Hierarchical Dirichlet Process (HDP) models to overcome them. In order to test these HDP models we develop MCMC algorithms for Bayesian inference of these models from strings alone. Finally, we discuss to what extent the lessons learnt from these examples can be put into a unified framework and applied to the general problem of grammar induction.

Свежие видео