Misplaced Pages

Stochastic grammar

Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Part of a series on
Linguistics
General linguistics
Applied linguistics
Theoretical frameworks
Topics
Portal

A stochastic grammar (statistical grammar) is a grammar framework with a probabilistic notion of grammaticality:

The grammar is realized as a language model. Allowed sentences are stored in a database together with the frequency how common a sentence is. Statistical natural language processing uses stochastic, probabilistic and statistical methods, especially to resolve difficulties that arise because longer sentences are highly ambiguous when processed with realistic grammars, yielding thousands or millions of possible analyses. Methods for disambiguation often involve the use of corpora and Markov models. "A probabilistic model consists of a non-probabilistic model plus some numerical quantities; it is not true that probabilistic models are inherently simpler or less structural than non-probabilistic models."

Examples

A probabilistic method for rhyme detection is implemented by Hirjee & Brown in their study in 2013 to find internal and imperfect rhyme pairs in rap lyrics. The concept is adapted from a sequence alignment technique using BLOSUM (BLOcks SUbstitution Matrix). They were able to detect rhymes undetectable by non-probabilistic models.

See also

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Stochastic grammar" – news · newspapers · books · scholar · JSTOR (March 2011) (Learn how and when to remove this message)

References

  1. Carrasco, Rafael C.; Oncina, Jose (1994). Carrasco, Rafael C.; Oncina, Jose (eds.). "Learning stochastic regular grammars by means of a state merging method". Grammatical Inference and Applications. Berlin, Heidelberg: Springer: 139–152. doi:10.1007/3-540-58473-0_144. ISBN 978-3-540-48985-6.
  2. Steve Young; Gerrit Bloothooft (14 March 2013). Corpus-Based Methods in Language and Speech Processing. Springer Science & Business Media. pp. 140–. ISBN 978-94-017-1183-8.
  3. John Goldsmith. 2002. "Probabilistic Models of Grammar: Phonology as Information Minimization." Phonological Studies #5: 21–46.
  4. Hirjee, Hussein; Brown, Daniel (2013). "Using Automated Rhyme Detection to Characterize Rhyming Style in Rap Music" (PDF). Empirical Musicology Review.

Further reading

  • Christopher D. Manning, Hinrich Schütze: Foundations of Statistical Natural Language Processing, MIT Press (1999), ISBN 978-0-262-13360-9.
  • Stefan Wermter, Ellen Riloff, Gabriele Scheler (eds.): Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing, Springer (1996), ISBN 978-3-540-60925-4.
  • Pirani, Giancarlo, ed. Advanced algorithms and architectures for speech understanding. Vol. 1. Springer Science & Business Media, 2013.
Stub icon

This grammar-related article is a stub. You can help Misplaced Pages by expanding it.

Categories:
Stochastic grammar Add topic