1
Jurafsky D, Martin JH. Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition. 2nd ed. Upper Saddle River, N.J.: Pearson Prentice Hall 2009.
2
Smith NA. Linguistic structure prediction. San Rafael, Calif: Morgan & Claypool 2011.
3
Collins M. Discriminative training methods for hidden Markov models. Proceedings of the ACL-02 conference on Empirical methods in natural language processing  - EMNLP ’02. Association for Computational Linguistics 2002:1–8.
4
Stat NLP Book. https://github.com/uclmr/stat-nlp-book