[1]
Collins, M. 2002. Discriminative training methods for hidden Markov models. Proceedings of the ACL-02 conference on Empirical methods in natural language processing  - EMNLP ’02 (2002), 1–8.
[2]
Jurafsky, D. and Martin, J.H. 2009. Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition. Pearson Prentice Hall.
[3]
Smith, N.A. 2011. Linguistic structure prediction. Morgan & Claypool.
[4]
Stat NLP Book: https://github.com/uclmr/stat-nlp-book.