万本电子书0元读

万本电子书0元读

顶部广告

Mastering Machine Learning with R - Second Edition电子书

售       价:¥

3人正在读 | 0人评论 9.8

作       者:Cory Lesmeister

出  版  社:Packt Publishing

出版时间:2017-04-24

字       数:48.6万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
This book will teach you advanced techniques in machine learning with the latest code in R 3.3.2. You will delve into statistical learning theory and supervised learning; design efficient algorithms; learn about creating Recommendation Engines; use multi-class classification and deep learning; and more. You will explore, in depth, topics such as data mining, classification, clustering, regression, predictive modeling, anomaly detection, boosted trees with XGBOOST, and more. More than just knowing the outcome, you'll understand how these concepts work and what they do. With a slow learning curve on topics such as neural networks, you will explore deep learning, and more. By the end of this book, you will be able to perform machine learning with R in the cloud using AWS in various scenarios with different datasets. What you will learn ?Gain deep insights into the application of machine learning tools in the industry ?Manipulate data in R efficiently to prepare it for analysis ?Master the skill of recognizing techniques for effective visualization of data ?Understand why and how to create test and training data sets for analysis ?Master fundamental learning methods such as linear and logistic regression ?Comprehend advanced learning methods such as support vector
目录展开

Title Page

Copyright

Credits

About the Author

About the Reviewers

Packt Upsell

Customer Feedback

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Downloading the color images of this book

Errata

Piracy

Questions

A Process for Success

The process

Business understanding

Identifying the business objective

Assessing the situation

Determining the analytical goals

Producing a project plan

Data understanding

Data preparation

Modeling

Evaluation

Deployment

Algorithm flowchart

Summary

Linear Regression - The Blocking and Tackling of Machine Learning

Univariate linear regression

Business understanding

Multivariate linear regression

Business understanding

Data understanding and preparation

Modeling and evaluation

Other linear model considerations

Qualitative features

Interaction terms

Summary

Logistic Regression and Discriminant Analysis

Classification methods and linear regression

Logistic regression

Business understanding

Data understanding and preparation

Modeling and evaluation

The logistic regression model

Logistic regression with cross-validation

Discriminant analysis overview

Discriminant analysis application

Multivariate Adaptive Regression Splines (MARS)

Model selection

Summary

Advanced Feature Selection in Linear Models

Regularization in a nutshell

Ridge regression

LASSO

Elastic net

Business case

Business understanding

Data understanding and preparation

Modeling and evaluation

Best subsets

Ridge regression

LASSO

Elastic net

Cross-validation with glmnet

Model selection

Regularization and classification

Logistic regression example

Summary

More Classification Techniques - K-Nearest Neighbors and Support Vector Machines

K-nearest neighbors

Support vector machines

Business case

Business understanding

Data understanding and preparation

Modeling and evaluation

KNN modeling

SVM modeling

Model selection

Feature selection for SVMs

Summary

Classification and Regression Trees

An overview of the techniques

Understanding the regression trees

Classification trees

Random forest

Gradient boosting

Business case

Modeling and evaluation

Regression tree

Classification tree

Random forest regression

Random forest classification

Extreme gradient boosting - classification

Model selection

Feature Selection with random forests

Summary

Neural Networks and Deep Learning

Introduction to neural networks

Deep learning, a not-so-deep overview

Deep learning resources and advanced methods

Business understanding

Data understanding and preparation

Modeling and evaluation

An example of deep learning

H2O background

Data upload to H2O

Create train and test datasets

Modeling

Summary

Cluster Analysis

Hierarchical clustering

Distance calculations

K-means clustering

Gower and partitioning around medoids

Gower

PAM

Random forest

Business understanding

Data understanding and preparation

Modeling and evaluation

Hierarchical clustering

K-means clustering

Gower and PAM

Random Forest and PAM

Summary

Principal Components Analysis

An overview of the principal components

Rotation

Business understanding

Data understanding and preparation

Modeling and evaluation

Component extraction

Orthogonal rotation and interpretation

Creating factor scores from the components

Regression analysis

Summary

Market Basket Analysis, Recommendation Engines, and Sequential Analysis

An overview of a market basket analysis

Business understanding

Data understanding and preparation

Modeling and evaluation

An overview of a recommendation engine

User-based collaborative filtering

Item-based collaborative filtering

Singular value decomposition and principal components analysis

Business understanding and recommendations

Data understanding, preparation, and recommendations

Modeling, evaluation, and recommendations

Sequential data analysis

Sequential analysis applied

Summary

Creating Ensembles and Multiclass Classification

Ensembles

Business and data understanding

Modeling evaluation and selection

Multiclass classification

Business and data understanding

Model evaluation and selection

Random forest

Ridge regression

MLR's ensemble

Summary

Time Series and Causality

Univariate time series analysis

Understanding Granger causality

Business understanding

Data understanding and preparation

Modeling and evaluation

Univariate time series forecasting

Examining the causality

Linear regression

Vector autoregression

Summary

Text Mining

Text mining framework and methods

Topic models

Other quantitative analyses

Business understanding

Data understanding and preparation

Modeling and evaluation

Word frequency and topic models

Additional quantitative analysis

Summary

R on the Cloud

Creating an Amazon Web Services account

Launch a virtual machine

Start RStudio

Summary

R Fundamentals

Getting R up-and-running

Using R

Data frames and matrices

Creating summary statistics

Installing and loading R packages

Data manipulation with dplyr

Summary

Sources

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部