Description
Discovering knowledge from big multivariate data, recorded every days, requires specialized machine learning techniques.
This book presents an easy to use practical guide in R to compute the most popular machine learning methods for exploring real word data sets, as well as, for building predictive models.
The main parts of the book include:
A) Unsupervised learning methods, to explore and discover knowledge from a large multivariate data set using clustering and principal component methods. You will learn hierarchical clustering, k-means, principal component analysis and correspondence analysis methods.
B) Regression analysis, to predict a quantitative outcome value using linear regression and non-linear regression strategies.
C) Classification techniques, to predict a qualitative outcome value using logistic regression, discriminant analysis, naive bayes classifier and support vector machines.
D) Advanced machine learning methods, to build robust regression and classification models using k-nearest neighbors methods, decision tree models, ensemble methods (bagging, random forest and boosting).
E) Model selection methods, to select automatically the best combination of predictor variables for building an optimal predictive model. These include, best subsets selection methods, stepwise regression and penalized regression (ridge, lasso and elastic net regression models). We also present principal component-based regression methods, which are useful when the data contain multiple correlated predictor variables.
F) Model validation and evaluation techniques for measuring the performance of a predictive model.
G) Model diagnostics for detecting and fixing a potential problems in a predictive model.
The book presents the basic principles of these tasks and provide many examples in R. This book offers solid guidance in data mining for students and researchers.
Key features:
- Covers machine learning algorithm and implementation
- Key mathematical concepts are presented
- Short, self-contained chapters with practical examples.
Recommended for you
This section contains best data science and self-development resources to help you on your path.
Coursera - Online Courses and Specialization
Data science
- Course: Machine Learning: Master the Fundamentals by Stanford
- Specialization: Data Science by Johns Hopkins University
- Specialization: Python for Everybody by University of Michigan
- Courses: Build Skills for a Top Job in any Industry by Coursera
- Specialization: Master Machine Learning Fundamentals by University of Washington
- Specialization: Statistics with R by Duke University
- Specialization: Software Development in R by Johns Hopkins University
- Specialization: Genomic Data Science by Johns Hopkins University
Popular Courses Launched in 2020
- Google IT Automation with Python by Google
- AI for Medicine by deeplearning.ai
- Epidemiology in Public Health Practice by Johns Hopkins University
- AWS Fundamentals by Amazon Web Services
Trending Courses
- The Science of Well-Being by Yale University
- Google IT Support Professional by Google
- Python for Everybody by University of Michigan
- IBM Data Science Professional Certificate by IBM
- Business Foundations by University of Pennsylvania
- Introduction to Psychology by Yale University
- Excel Skills for Business by Macquarie University
- Psychological First Aid by Johns Hopkins University
- Graphic Design by Cal Arts
Amazon FBA
Amazing Selling Machine
Books - Data Science
Our Books
- Practical Guide to Cluster Analysis in R by A. Kassambara (Datanovia)
- Practical Guide To Principal Component Methods in R by A. Kassambara (Datanovia)
- Machine Learning Essentials: Practical Guide in R by A. Kassambara (Datanovia)
- R Graphics Essentials for Great Data Visualization by A. Kassambara (Datanovia)
- GGPlot2 Essentials for Great Data Visualization in R by A. Kassambara (Datanovia)
- Network Analysis and Visualization in R by A. Kassambara (Datanovia)
- Practical Statistics in R for Comparing Groups: Numerical Variables by A. Kassambara (Datanovia)
- Inter-Rater Reliability Essentials: Practical Guide in R by A. Kassambara (Datanovia)
Others
- R for Data Science: Import, Tidy, Transform, Visualize, and Model Data by Hadley Wickham & Garrett Grolemund
- Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems by Aurelien Géron
- Practical Statistics for Data Scientists: 50 Essential Concepts by Peter Bruce & Andrew Bruce
- Hands-On Programming with R: Write Your Own Functions And Simulations by Garrett Grolemund & Hadley Wickham
- An Introduction to Statistical Learning: with Applications in R by Gareth James et al.
- Deep Learning with R by François Chollet & J.J. Allaire
- Deep Learning with Python by François Chollet
Version: Français
Laurent D. (verified owner) –
The explanations are clear and concise. The code directly usable. Perfect for quickly finding a solution to a particular problem.
However, there are no links to navigate within the pdf document (for example from the contents or from the index).
david maupin (verified owner) –
Great book with practical examples!
Eko Subagyo (verified owner) –
Yuming (verified owner) –
The explanations are clear and concise. a good book!
However, cant copy text from the PDF into Rstudio or into Word I only get unreadable gibberish. thank author sent me a code~
Hamidou Sy (verified owner) –
Andrzej Ptasznik (verified owner) –
Federico (verified owner) –
Didier Ouedraogo (verified owner) –
I really appreciate this book. The author explain clearly and put the learner to the up level in the mastery of machine learning with R.
Anonymous (verified owner) –
good text. I get a skill to make best models.
Oscar Salas (verified owner) –
José de França Bueno (verified owner) –
Anonymous (verified owner) –
na
Troy (verified owner) –
This is a nice introduction to machine learning using modern R syntax.
Bitrus (verified owner) –
Well arranged and logical; I learned the essentials within a short space of time. Definitely recommend
Pavel (verified owner) –
Very professional level, very helpfull book
Anonymous (verified owner) –
Great examples on how to use the caret R package. However, tidymodels is usurping caret, so this book will need an update soon. Also a comparison between tidymodels and more mlr3 would be nice.
Ognjen (verified owner) –
So far the best manual of Datanovia!
kassambara (store manager) –
Thank you for the positive feedback, highly appreciated
Ioannis M. (verified owner) –
Great reference, the only issue is the PDF does not have a table of contents to redirect as the other documents do.
kassambara (store manager) –
Thank you for the feedback, we’ll check and fix this issue
Anonymous (verified owner) –
excellent quick reference
Tsheten (verified owner) –
I purchased the right book for my research. All codes required for my analysis are well-captured in the book with real-life examples. One thing missing in the book is data management steps (like categorizing, generating new variables, etc) before doing real analysis. Otherwise, I am really satisfied for having the book
Andre S. (verified owner) –
Etoma Egot (verified owner) –
Easy purchase for me. I highly recommend it
Joaquín Dutour (verified owner) –
Very good book. Ease and Speed in the purchase and acquisition of the product
Anonymous (verified owner) –
Sarah Manu (verified owner) –
Emmanuel (verified owner) –
Great book. Very practical. Clear examples. I would just add the scripts.
Etienne Ntumba (verified owner) –
Kenneth Yakubu (verified owner) –
I liked that I had access to this book quickly.
Stefan K. (verified owner) –
Unfortunately, the download link did not work, so I did not receive the book I bought.
Gernot (verified owner) –
Veerasak P. (verified owner) –
A wonderful guide to the practical side of machine learning. The explanation of machine learning is presented in a clear and concise manner. Examples using R code are well demonstrated.