Collins, Michael. 2002. ‘Discriminative Training Methods for Hidden Markov Models’. Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing  - EMNLP ’02, 1–8. https://doi.org/10.3115/1118693.1118694.
Jurafsky, Dan, and James H. Martin. 2009. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. 2nd ed. Prentice Hall series in artificial intelligence. Pearson Prentice Hall.
Smith, Noah Ashton. 2011. Linguistic Structure Prediction. Synthesis lectures on human language technologies. Morgan & Claypool. http://dx.doi.org/10.2200/S00361ED1V01Y201105HLT013.
‘Stat NLP Book’. n.d. https://github.com/uclmr/stat-nlp-book.