1.
Jurafsky, D. & Martin, J. H. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. vol. Prentice Hall series in artificial intelligence (Pearson Prentice Hall, Upper Saddle River, N.J., 2009).
2.
Smith, N. A. Linguistic Structure Prediction. vol. Synthesis lectures on human language technologies (Morgan & Claypool, San Rafael, Calif, 2011).
3.
Collins, M. Discriminative training methods for hidden Markov models. in Proceedings of the ACL-02 conference on Empirical methods in natural language processing  - EMNLP ’02 1–8 (Association for Computational Linguistics, 2002). doi:10.3115/1118693.1118694.
4.
Stat NLP Book. https://github.com/uclmr/stat-nlp-book.