Bishop CM, Pattern Recognition and Machine Learning, vol Information science and statistics (Springer 2006) <https://www.microsoft.com/en-us/research/people/cmbishop/prml-book/>
Manning CD, Raghavan P and Schütze H, Introduction to Information Retrieval (Cambridge University Press 2008)
Pang-Ning Tan, Michael Steinbach, and Vipin Kumar, Introduction to Data Mining, (First Edition) (Addison Wesley)
Witten IH, Moffat A and Bell TC, Managing Gigabytes: Compressing and Indexing Documents and Images (2nd ed, Morgan Kaufman 1999)