[1]
D. Jurafsky and J. H. Martin, Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd ed., vol. Prentice Hall series in artificial intelligence. Upper Saddle River, N.J.: Pearson Prentice Hall, 2009.
[2]
N. A. Smith, Linguistic structure prediction, vol. Synthesis lectures on human language technologies. San Rafael, Calif: Morgan & Claypool, 2011 [Online]. Available: http://dx.doi.org/10.2200/S00361ED1V01Y201105HLT013
[3]
M. Collins, ‘Discriminative training methods for hidden Markov models’, in Proceedings of the ACL-02 conference on Empirical methods in natural language processing  - EMNLP ’02, 2002, pp. 1–8, doi: 10.3115/1118693.1118694 [Online]. Available: http://portal.acm.org/citation.cfm?doid=1118693.1118694
[4]
‘Stat NLP Book’. [Online]. Available: https://github.com/uclmr/stat-nlp-book