Collins, Michael, ‘Discriminative Training Methods for Hidden Markov Models’, in Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing  - EMNLP ’02 (Association for Computational Linguistics, 2002), pp. 1–8 <https://doi.org/10.3115/1118693.1118694>
Jurafsky, Dan, and James H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, 2nd ed (Upper Saddle River, N.J.: Pearson Prentice Hall, 2009), Prentice Hall series in artificial intelligence
Smith, Noah Ashton, Linguistic Structure Prediction (San Rafael, Calif: Morgan & Claypool, 2011), Synthesis lectures on human language technologies <http://dx.doi.org/10.2200/S00361ED1V01Y201105HLT013>
‘Stat NLP Book’ <https://github.com/uclmr/stat-nlp-book>