Data Science and Predictive Analytics
![]()  | |
| Author | Ivo D. Dinov | 
|---|---|
| Language | English | 
| Series | The Springer Series in Applied Machine Learning | 
| Subject | Computer science, Data science, artificial intelligence | 
| Publisher | Springer | 
Publication date  | 2018 (1st ed.), 2023 (2nd edition) | 
| Publication place | Switzerland | 
| Media type | Print (hardcover and softcover), electronic (PDF and EPub) | 
| ISBN | 978-3-031-17483-4 978-3-319-72346-4, 978-3-031-17485-8, 978-3-031-17482-7 | 
The first edition of the textbook Data Science and Predictive Analytics: Biomedical and Health Applications using R, authored by Ivo D. Dinov, was published in August 2018 by Springer.[1] The second edition of the book was printed in 2023.[2]
This textbook covers some of the core mathematical foundations, computational techniques, and artificial intelligence approaches used in data science research and applications.[3]
By using the statistical computing platform R and a broad range of biomedical case-studies, the 23 chapters of the book first edition provide explicit examples of importing, exporting, processing, modeling, visualizing, and interpreting large, multivariate, incomplete, heterogeneous, longitudinal, and incomplete datasets (big data).[4]
Structure
First edition table of contents
The first edition of the Data Science and Predictive Analytics (DSPA) textbook[1] is divided into the following 23 chapters, each progressively building on the previous content.
- Motivation
 - Foundations of R
 - Managing Data in R
 - Data Visualization
 - Linear Algebra & Matrix Computing
 - Dimensionality Reduction
 - Lazy Learning: Classification Using Nearest Neighbors
 - Probabilistic Learning: Classification Using Naive Bayes
 - Decision Tree Divide and Conquer Classification
 - Forecasting Numeric Data Using Regression Models
 - Black Box Machine-Learning Methods: Neural Networks and Support Vector Machines
 - Apriori Association Rules Learning
 - k-Means Clustering
 - Model Performance Assessment
 - Improving Model Performance
 - Specialized Machine Learning Topics
 - Variable/Feature Selection
 - Regularized Linear Modeling and Controlled Variable Selection
 - Big Longitudinal Data Analysis
 - Natural Language Processing/Text Mining
 - Prediction and Internal Statistical Cross Validation
 - Function Optimization
 - Deep Learning, Neural Networks
 
Second edition table of contents
The significantly reorganized revised edition of the book (2023)[2] expands and modernizes the presented mathematical principles, computational methods, data science techniques, model-based machine learning and model-free artificial intelligence algorithms. The 14 chapters of the new edition start with an introduction and progressively build foundational skills to naturally reach biomedical applications of deep learning.
- Introduction
 - Basic Visualization and Exploratory Data Analytics
 - Linear Algebra, Matrix Computing, and Regression Modeling
 - Linear and Nonlinear Dimensionality Reduction
 - Supervised Classification
 - Black Box Machine Learning Methods
 - Qualitative Learning Methods—Text Mining, Natural Language Processing, and Apriori Association Rules Learning
 - Unsupervised Clustering
 - Model Performance Assessment, Validation, and Improvement
 - Specialized Machine Learning Topics
 - Variable Importance and Feature Selection
 - Big Longitudinal Data Analysis
 - Function Optimization
 - Deep Learning, Neural Networks
 
Reception
The materials in the Data Science and Predictive Analytics (DSPA) textbook have been peer-reviewed in the Journal of the American Statistical Association,[5] International Statistical Institute’s ISI Review Journal,[3] and the Journal of the American Library Association.[4] Many scholarly publications reference the DSPA textbook.[6][7]
As of January 17, 2021, the electronic version of the book first edition (ISBN 978-3-319-72347-1) is freely available on SpringerLink[8] and has been downloaded over 6 million times. The textbook is globally available in print (hardcover and softcover) and electronic formats (PDF and EPub) in many college and university libraries[9] and has been used for data science, computational statistics, and analytics classes at various institutions.[10]
References
- ^ a b Dinov, Ivo (2018). Data Science and Predictive Analytics: Biomedical and Health Applications Using R. Springer.
 - ^ a b Dinov, Ivo (2023). Data Science and Predictive Analytics: Biomedical and Health Applications Using R. The Springer Series in Applied Machine Learning. Springer. doi:10.1007/978-3-031-17483-4. ISBN 978-3-031-17482-7. S2CID 256875731.
 - ^ a b Capaldi, Mindy (April 2019) [5 April 2019]. "(Review) Data Science and Predictive Analytics: Biomedical and Health Applications Using R". International Statistical Review. 87 (1): 181–182. doi:10.1111/insr.12317. S2CID 132379032.
 - ^ a b Saracco, Benjamin (2020) [April 2020]. "Review of Data Science and Predictive Analytics: Biomedical and Health Applications Using R". Journal of the Medical Library Association. 108 (2): 344. doi:10.5195/jmla.2020.901. PMC 7069824. S2CID 214729817.
 - ^ Qiu, Xing (2024). "Book Review: Data Science and Predictive Analytics, 2nd ed". Journal of the American Statistical Association. 119 (546): 1692–1693. doi:10.1080/01621459.2024.2303323.
 - ^ "Altmetric – Data Science and Predictive Analytics".
 - ^ "Google Scholar".
 - ^ Dinov, Ivo D. (2018). Data Science and Predictive Analytics. doi:10.1007/978-3-319-72347-1. ISBN 978-3-319-72346-4. S2CID 52098523.
 - ^ Textbook library availability
 - ^ Courses using the DSPA textbook
 
.png)