Reading club: The Elements of Statistical Learning - neuronstar/elements-of-statistical-learning The Elements of Statistical Learning written by Trevor Hastie, Robert Tibshirani and Jerome Friedman is A-MUST-TO-READ for everyone involved in the data mining field! The challenge of understanding these data has led to the development of new tools in the field of statistics, and spawned new areas such as data mining, machine learning, and bioinformatics. Both books are available for as free PDFs. Tibshirani proposed the lasso and is co-author of the very successful An Introduction to the Bootstrap. Download The Elements of Statistical Learning: Data Mining, Inference, and Prediction written by Trevor Hastie & Robert Tibshirani and Jerome Friedman is very useful for Mathematics Department students and also who are all having an interest to develop their knowledge in the field of Maths. Trevor Hastie, Robert Tibshirani, and Jerome Friedman are professors of statistics at Stanford University. During the past decade there has been an explosion in computation and information technology. While the approach is statistical, the emphasis is on concepts rather than mathematics. An Introduction to Statistical Learning covers many of the same topics, but at a level accessible to a much broader audience. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. Professors Hastie and Tibshirani published "The Elements of Statistical learning: Data mining, inference and prediction", with Jerome Friedman (springer, 2001, second edition 2009). Some unsupervised learning methods are discussed: principal components and clustering (k-means and hierarchical). Publications Subject: The Elements of Statistical Learning book: Free PDF download. It is a valuable resource for statisticians and anyone interested in data mining in science or industry. While the approach is statistical, the emphasis is on concepts rather than mathematics. Computing is done in R. There is also a chapter on methods for ``wide'' data (p bigger than n), including multiple testing and false discovery rates.
