Data Science (2nd Ed.)
Concepts and Practice

Authors:

Language: English
Cover of the book Data Science

Keywords

ARIMA; Accuracy; Anomaly; Application; Apriori; Artificial neural networks; Association; Attribute weighting; Backward elimination; Bagging; Boosting; CRISP; Cartesian space; Centroid; Chi-square test; Classification; Classification performance; Clustering; Collaborative filtering; Confidence; Content-based recommender; Correlation; Cross-sectional data; DBSCAN; Data exploration; Data mining process; Data preparation; Data science; Data-driven forecasting; Decision trees; Density clustering; Density-based; Descriptive analytics; Dimension reduction; Distance-based; Distribution; Ensemble; Evaluation; Exponential smoothing; FP Growth; Feature selection; Filter type; Forward selection; Fraud; Frequent items; Function fitting; Histogram; Information gain; Inverse document frequency; Item profile; Item-based collaborative filtering; K-means clustering; K-nearest neighbors; KDD; Keyword clustering; Kohonen networks; LOF; Latent factors; Lift charts; Linear regression; Local outlier factor; Logistic regression; Logit; Machine learning-based forecasting; Market basket analysis; Matrix factorization; Mean; Model evaluation; Model-based forecasting; Modeling; N-grams; Naïve Bayesian; Neighborhood-based methods; Optimization; Outlier; PCA; Precision; Prior knowledge; ROC curve; Random forests; RapidMiner; RapidMiner GUI; Ratings matrix; Recall; Recommendation; Recommender; Regression; Regression model validity; Roadmap; Rule induction; Scatter chart; Self-organizing maps; Sensitivity; Specificity; Standard deviation; Statistics; Stemming; Stop word filtering; Supervised model–based approach; Support

69.52 €

In Print (Delivery period: 14 days).

Add to cartAdd to cart
Publication date:
568 p. · 19x23.3 cm · Paperback

Learn the basics of Data Science through an easy to understand conceptual framework and immediately practice using RapidMiner platform. Whether you are brand new to data science or working on your tenth project, this book will show you how to analyze data, uncover hidden patterns and relationships to aid important decisions and predictions.

Data Science has become an essential tool to extract value from data for any organization that collects, stores and processes data as part of its operations. This book is ideal for business users, data analysts, business analysts, engineers, and analytics professionals and for anyone who works with data.

You?ll be able to:

  1. Gain the necessary knowledge of different data science techniques to extract value from data.
  2. Master the concepts and inner workings of 30 commonly used powerful data science algorithms.
  3. Implement step-by-step data science process using using RapidMiner, an open source GUI based data science platform

Data Science techniques covered: Exploratory data analysis, Visualization, Decision trees, Rule induction, k-nearest neighbors, Naïve Bayesian classifiers, Artificial neural networks, Deep learning, Support vector machines, Ensemble models, Random forests, Regression, Recommendation engines, Association analysis, K-Means and Density based clustering, Self organizing maps, Text mining, Time series forecasting, Anomaly detection, Feature selection and more...

1. Introduction
2. Data Science Process
3. Data Exploration
4. Classification
5. Deep Learning
6. Regression Methods
7. Association Analysis
8. Recommendation Engines
9. Clustering
10. Text Mining (renamed to: Natural Language Processing)
11. Time Series Forecasting
12. Anomaly Detection
13. Feature Selection
14. Model Evaluation
15. Efficient Model Execution
16. Getting Started with RapidMiner
Business and analytics professionals who use data in everyday work settings: Data analysts, business intelligence, data warehousing, business analysts, IT leadership teams, finance and sales operations; students and instructors of data science courses; users of RapidMiner
Vijay Kotu is Vice President of Analytics at ServiceNow. He leads the implementation of large-scale data platforms and services to support the company's enterprise business. He has led analytics organizations for over a decade with focus on data strategy, business intelligence, machine learning, experimentation, engineering, enterprise adoption, and building analytics talent. Prior to joining ServiceNow, he was Vice President of Analytics at Yahoo. He worked at Life Technologies and Adteractive where he led marketing analytics, created algorithms to optimize online purchasing behavior, and developed data platforms to manage marketing campaigns. He is a member of the Association of Computing Machinery and a member of the Advisory Board at RapidMiner.
Dr. Deshpande has extensive experience in working with companies ranging from startups to Fortune 5 in fields ranging from automotive, aerospace, retail, food, and manufacturing verticals delivering business analysis; designing and developing custom data products for implementing business intelligence, data science, and predictive analytics solutions. He was the Founder of SimaFore, a predictive analytics consulting company which was acquired by Soliton Inc., a provider of testing solutions for the semiconductor industry. He was also the Founding Co-chair of the annual Predictive Analytics World-Manufacturing conference. In his professional career he has worked with Ford Motor Company on their product development, with IBM at their IBM Watson Center of Competence, and with Domino’s Pizza at their data science and artificial intelligence groups. He has a Ph.D. from Carnegie Mellon and an MBA from Ross School of Business, Michigan.
  • Contains fully updated content on data science, including tactics on how to mine business data for information
  • Presents simple explanations for over twenty powerful data science techniques
  • Enables the practical use of data science algorithms without the need for programming
  • Demonstrates processes with practical use cases
  • Introduces each algorithm or technique and explains the workings of a data science algorithm in plain language
  • Describes the commonly used setup options for the open source tool RapidMiner