Collins, M. (2002). Discriminative training methods for hidden Markov models. Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing  - EMNLP ’02, 1–8. https://doi.org/10.3115/1118693.1118694
Jurafsky, D., & Martin, J. H. (2009). Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition: Vol. Prentice Hall series in artificial intelligence (2nd ed). Pearson Prentice Hall.
Smith, N. A. (2011). Linguistic structure prediction: Vol. Synthesis lectures on human language technologies. Morgan & Claypool. http://dx.doi.org/10.2200/S00361ED1V01Y201105HLT013
Stat NLP Book. (n.d.). https://github.com/uclmr/stat-nlp-book