Collins M, ‘Discriminative Training Methods for Hidden Markov Models’, Proceedings of the ACL-02 conference on Empirical methods in natural language processing  - EMNLP ’02 (Association for Computational Linguistics 2002) <http://portal.acm.org/citation.cfm?doid=1118693.1118694>
Jurafsky D and Martin JH, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, vol Prentice Hall series in artificial intelligence (2nd ed, Pearson Prentice Hall 2009)
Smith NA, Linguistic Structure Prediction, vol Synthesis lectures on human language technologies (Morgan & Claypool 2011) <http://dx.doi.org/10.2200/S00361ED1V01Y201105HLT013>
‘Stat NLP Book’ <https://github.com/uclmr/stat-nlp-book>