1.
Jurafsky, D. & Martin, J. H. Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition. vol. Prentice Hall series in artificial intelligence (Pearson Prentice Hall, 2009).
2.
Smith, N. A. Linguistic structure prediction. vol. Synthesis lectures on human language technologies (Morgan & Claypool, 2011).
3.
Collins, M. Discriminative training methods for hidden Markov models. in Proceedings of the ACL-02 conference on Empirical methods in natural language processing  - EMNLP ’02 1–8 (Association for Computational Linguistics, 2002). doi:10.3115/1118693.1118694.
4.
Stat NLP Book. https://github.com/uclmr/stat-nlp-book.