Bayesian Inference of Grammars

537
22.4
Опубликовано 7 сентября 2016, 16:51
Mark Johnson (Joint work with Sharon Goldwater and Tom Griffiths) Even though Maximum Likelihood Estimation (MLE) of Probabilistic Context-Free Grammars (PCFGs) is well-understood (the Inside-Outside algorithm can do this efficiently from the terminal strings alone) the inferred grammars are usually linguistically inaccurate. In order to better understand why maximum likelihood finds poor grammars, this talk examines two simple natural language induction problems: morphological segmentation and word segmentation. We identify several problems with the MLE PCFG models of these problems and propose Hierarchical Dirichlet Process (HDP) models to overcome them. In order to test these HDP models we develop MCMC algorithms for Bayesian inference of these models from strings alone. Finally, we discuss to what extent the lessons learnt from these examples can be put into a unified framework and applied to the general problem of grammar induction.
Случайные видео
317 дней – 1 6960:25
AI adoption best practices
15.12.22 – 44 4716:39
Is THIS The Future Of Foldables?
25.04.22 – 1 2297:30
Customer success Q1 2022
автотехномузыкадетское