万本电子书0元读

万本电子书0元读

顶部广告

R Data Science Essentials电子书

售       价:¥

1人正在读 | 0人评论 9.8

作       者:Raja B. Koushik

出  版  社:Packt Publishing

出版时间:2016-01-13

字       数:29.7万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Learn the essence of data science and visualization using R in no time at allAbout This BookBecome a pro at making stunning visualizations and dashboards quickly and without hassleFor better decision making in business, apply the R programming language with the help of useful statistical techniques.From seasoned authors comes a book that offers you a plethora of fast-paced techniques to detect and analyze data patternsWho This Book Is ForIf you are an aspiring data scientist or analyst who has a basic understanding of data science and has basic hands-on experience in R or any other analytics tool, then R Data Science Essentials is the book for you.What You Will LearnPerform data preprocessing and basic operations on dataImplement visual and non-visual implementation data exploration techniquesMine patterns from data using affinity and sequential analysisUse different clustering algorithms and visualize themImplement logistic and linear regression and find out how to evaluate and improve the performance of an algorithmExtract patterns through visualization and build a forecasting algorithmBuild a recommendation engine using different collaborative filtering algorithmsMake a stunning visualization and dashboard using ggplot and R shinyIn DetailWith organizations increasingly embedding data science across their enterprise and with management becoming more data-driven it is an urgent requirement for analysts and managers to understand the key concept of data science. The data science concepts discussed in this book will help you make key decisions and solve the complex problems you will inevitably face in this new world.R Data Science Essentials will introduce you to various important concepts in the field of data science using R. We start by reading data from multiple sources, then move on to processing the data, extracting hidden patterns, building predictive and forecasting models, building a recommendation engine, and communicating to the user through stunning visualizations and dashboards.By the end of this book, you will have an understanding of some very important techniques in data science, be able to implement them using R, understand and interpret the outcomes, and know how they helps businesses make a decision.Style and approachThis easy-to-follow guide contains hands-on examples of the concepts of data science using R.
目录展开

R Data Science Essentials

Table of Contents

R Data Science Essentials

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Getting Started with R

Reading data from different sources

Reading data from a database

Data types in R

Variable data types

Data preprocessing techniques

Performing data operations

Arithmetic operations on the data

String operations on the data

Aggregation operations on the data

Mean

Median

Sum

Maximum and minimum

Standard deviation

Control structures in R

Control structures – if and else

Control structures – for

Control structures – while

Control structures – repeat and break

Control structures – next and return

Bringing data to a usable format

Summary

2. Exploratory Data Analysis

The Titanic dataset

Descriptive statistics

Box plot

Exercise

Inferential statistics

Univariate analysis

Bivariate analysis

Multivariate analysis

Cross-tabulation analysis

Graphical analysis

Summary

3. Pattern Discovery

Transactional datasets

Using the built-in dataset

Building the dataset

Apriori analysis

Support, confidence, and lift

Support

Confidence

Lift

Generating filtering rules

Plotting

Dataset

Rules

Sequential dataset

Apriori sequence analysis

Understanding the results

Reference

Business cases

Summary

4. Segmentation Using Clustering

Datasets

Reading and formatting the dataset in R

Centroid-based clustering and an ideal number of clusters

Implementation using K-means

Visualizing the clusters

Connectivity-based clustering

Visualizing the connectivity

Business use cases

Summary

5. Developing Regression Models

Datasets

Sampling the dataset

Logistic regression

Evaluating logistic regression

Linear regression

Evaluating linear regression

Methods to improve the accuracy

Ensemble models

Replacing NA with mean or median

Removing the highly correlated values

Removing outliers

Summary

6. Time Series Forecasting

Datasets

Extracting patterns

Forecasting using ARIMA

Forecasting using Holt-Winters

Methods to improve accuracy

Summary

7. Recommendation Engine

Dataset and transformation

Recommendations using user-based CF

Recommendations using item-based CF

Challenges and enhancements

Summary

8. Communicating Data Analysis

Dataset

Plotting using the googleVis package

Creating an interactive dashboard using Shiny

Summary

Index

累计评论(0条) 2个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部