万本电子书0元读

万本电子书0元读

顶部广告

R for Data Science Cookbook电子书

售       价:¥

3人正在读 | 0人评论 9.8

作       者:Yu-Wei, Chiu (David Chiu)

出  版  社:Packt Publishing

出版时间:2016-07-01

字       数:264.1万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Over 100 hands-on recipes to effectively solve real-world data problems using the most popular R packages and techniques About This Book Gain insight into how data scientists collect, process, analyze, and visualize data using some of the most popular R packages Understand how to apply useful data analysis techniques in R for real-world applications An easy-to-follow guide to make the life of data scientist easier with the problems faced while performing data analysis Who This Book Is For This book is for those who are already familiar with the basic operation of R, but want to learn how to efficiently and effectively analyze real-world data problems using practical R packages. What You Will Learn Get to know the functional characteristics of R language Extract, transform, and load data from heterogeneous sources Understand how easily R can confront probability and statistics problems Get simple R instructions to quickly organize and manipulate large datasets Create professional data visualizations and interactive reports Predict user purchase behavior by adopting a classification approach Implement data mining techniques to discover items that are frequently purchased together Group similar text documents by using various clustering methods In Detail This cookbook offers a range of data analysis samples in simple and straightforward R code, providing step-by-step resources and time-saving methods to help you solve data problems efficiently. The first section deals with how to create R functions to avoid the unnecessary duplication of code. You will learn how to prepare, process, and perform sophisticated ETL for heterogeneous data sources with R packages. An example of data manipulation is provided, illustrating how to use the “dplyr” and “data.table” packages to efficiently process larger data structures. We also focus on “ggplot2” and show you how to create advanced figures for data exploration. In addition, you will learn how to build an interactive report using the “ggvis” package. Later chapters offer insight into time series analysis on financial data, while there is detailed information on the hot topic of machine learning, including data classification, regression, clustering, association rule mining, and dimension reduction. By the end of this book, you will understand how to resolve issues and will be able to comfortably offer solutions to problems encountered while performing data analysis. Style and approach This easy-to-follow guide is full of hands-on examples of data analysis with R. Each topic is fully explained beginning with the core concept, followed by step-by-step practical examples, and concluding with detailed explanations of each concept used.
目录展开

R for Data Science Cookbook

Table of Contents

R for Data Science Cookbook

Credits

About the Author

About the Reviewer

www.PacktPub.com

eBooks, discount offers, and more

Why subscribe?

Preface

What this book covers

What you need for this book

Who this book is for

Sections

Getting ready

How to do it…

How it works…

There's more…

See also

Conventions

Reader feedback

Customer support

Downloading the example code

Downloading the color images of this book

Errata

Piracy

Questions

1. Functions in R

Introduction

Creating R functions

Getting ready

How to do it...

How it works...

There's more...

Matching arguments

Getting ready

How to do it...

How it works...

There's more...

Understanding environments

Getting ready

How to do it...

How it works...

There's more...

Working with lexical scoping

Getting ready

How to do it...

How it works...

There's more...

Understanding closure

Getting ready

How to do it...

How it works...

There's more...

Performing lazy evaluation

Getting ready

How to do it...

How it works...

There's more...

Creating infix operators

Getting ready

How to do it...

How it works...

There's more...

Using the replacement function

Getting ready

How to do it...

How it works...

There's more...

Handling errors in a function

Getting ready

How to do it...

How it works...

There's more...

The debugging function

Getting ready

How to do it...

How it works...

There's more...

2. Data Extracting, Transforming, and Loading

Introduction

Downloading open data

Getting ready

How to do it…

How it works…

There's more…

Reading and writing CSV files

Getting ready

How to do it…

How it works…

There's more…

Scanning text files

Getting ready

How to do it…

How it works…

There's more…

Working with Excel files

Getting ready

How to do it…

How it works…

Reading data from databases

Getting ready

How to do it…

How it works…

There's more…

Scraping web data

Getting ready

How to do it…

How it works…

There's more…

Accessing Facebook data

Getting ready

How to do it…

How it works…

There's more…

Working with twitteR

Getting ready

How to do it…

How it works…

There's more…

3. Data Preprocessing and Preparation

Introduction

Renaming the data variable

Getting ready

How to do it…

How it works…

There's more…

Converting data types

Getting ready

How to do it…

How it works…

There's more…

Working with the date format

Getting ready

How to do it…

How it works…

There's more…

Adding new records

Getting ready

How to do it…

How it works…

There's more…

Filtering data

Getting ready

How to do it…

How it works…

There's more…

Dropping data

Getting ready

How to do it…

How it works…

There's more…

Merging data

Getting ready

How to do it…

How it works…

There's more…

Sorting data

Getting ready

How to do it…

How it works…

There's more…

Reshaping data

Getting ready

How to do it…

How it works…

There's more…

Detecting missing data

Getting ready

How to do it…

How it works…

There's more…

Imputing missing data

Getting ready

How to do it…

How it works…

There's more…

4. Data Manipulation

Introduction

Enhancing a data.frame with a data.table

Getting ready

How to do it…

How it works…

There's more…

Managing data with a data.table

Getting ready

How to do it…

How it works…

There's more…

Performing fast aggregation with a data.table

Getting ready

How to do it…

How it works…

There's more…

Merging large datasets with a data.table

Getting ready

How to do it…

How it works…

There's more…

Subsetting and slicing data with dplyr

Getting ready

How to do it…

How it works…

There's more…

Sampling data with dplyr

Getting ready

How to do it…

How it works…

There's more…

Selecting columns with dplyr

Getting ready

How to do it…

How it works…

There's more…

Chaining operations in dplyr

Getting ready

How to do it…

How it works…

There's more…

Arranging rows with dplyr

Getting ready

How to do it…

How it works…

There's more…

Eliminating duplicated rows with dplyr

Getting ready

How to do it…

How it works…

There's more…

Adding new columns with dplyr

Getting ready

How to do it…

How it works…

There's more…

Summarizing data with dplyr

Getting ready

How to do it…

How it works…

There's more…

Merging data with dplyr

Getting ready

How to do it…

How it works…

There's more…

5. Visualizing Data with ggplot2

Introduction

Creating basic plots with ggplot2

Getting ready

How to do it…

How it works…

There's more…

Changing aesthetics mapping

Getting ready

How to do it…

How it works…

There's more…

Introducing geometric objects

Getting ready

How to do it…

How it works…

There's more…

Performing transformations

Getting ready

How to do it…

How it works…

There's more…

Adjusting scales

Getting ready

How to do it…

How it works…

See also

Faceting

Getting ready

How to do it…

How it works…

There's more…

Adjusting themes

Getting ready

How to do it…

How it works…

There's more…

Combining plots

Getting ready

How to do it…

How it works…

There's more…

Creating maps

Getting ready

How to do it…

How it works…

There's more…

6. Making Interactive Reports

Introduction

Creating R Markdown reports

Getting ready

How to do it…

How it works…

There's more…

Learning the markdown syntax

Getting ready

How to do it…

How it works…

There's more…

Embedding R code chunks

Getting ready

How to do it…

How it works…

There's more…

Creating interactive graphics with ggvis

Getting ready

How to do it…

How it works…

There's more…

Understanding basic syntax and grammar

Getting ready

How to do it…

How it works…

There's more…

Controlling axes and legends

Getting ready

How to do it…

How it works…

There's more…

Using scales

Getting ready

How to do it …

How it works…

There's more …

Adding interactivity to a ggvis plot

Getting ready

How to do it…

How it works…

There's more…

Creating an R Shiny document

Getting ready

How to do it…

How it works…

There's more…

Publishing an R Shiny report

Getting ready

How to do it…

How it works…

There's more…

7. Simulation from Probability Distributions

Introduction

Generating random samples

Getting ready

How to do it…

How it works…

There's more…

Understanding uniform distributions

Getting ready

How to do it…

How it works…

Generating binomial random variates

Getting ready

How to do it…

How it works…

There's more…

Generating Poisson random variates

Getting ready

How to do it…

How it works…

There's more…

Sampling from a normal distribution

Getting ready

How to do it…

How it works…

There's more…

Sampling from a chi-squared distribution

Getting ready

How to do it…

How it works…

There's more…

Understanding Student's t-distribution

Getting ready

How to do it…

How it works…

There's more…

Sampling from a dataset

Getting ready

How to do it…

How it works…

There's more…

Simulating the stochastic process

Getting ready

How to do it…

How it works…

There's more…

8. Statistical Inference in R

Introduction

Getting confidence intervals

Getting ready

How to do it…

How it works…

There's more…

Performing Z-tests

Getting ready

How to do it…

How it works…

There's more…

Performing student's T-tests

Getting ready

How to do it…

How it works…

There's more…

Conducting exact binomial tests

Getting ready

How to do it…

How it works…

There's more…

Performing Kolmogorov-Smirnov tests

Getting ready

How to do it…

How it works…

There's more…

Working with the Pearson's chi-squared tests

Getting ready

How to do it…

How it works…

There's more…

Understanding the Wilcoxon Rank Sum and Signed Rank tests

Getting ready

How to do it…

How it works…

There's more…

Conducting one-way ANOVA

Getting ready

How to do it…

How it works…

There's more…

Performing two-way ANOVA

Getting ready

How to do it…

How it works…

There's more…

9. Rule and Pattern Mining with R

Introduction

Transforming data into transactions

Getting ready

How to do it…

How it works…

There's more…

Displaying transactions and associations

Getting ready

How to do it…

How it works…

There's more…

Mining associations with the Apriori rule

Getting ready

How to do it…

How it works…

There's more…

Pruning redundant rules

Getting ready

How to do it…

How it works…

There's more…

Visualizing association rules

Getting ready

How to do it…

How it works…

See also

Mining frequent itemsets with Eclat

Getting ready

How to do it…

How it works…

There's more…

Creating transactions with temporal information

Getting ready

How to do it…

How it works…

There's more…

Mining frequent sequential patterns with cSPADE

Getting ready

How to do it…

How it works…

See also

10. Time Series Mining with R

Introduction

Creating time series data

Getting ready

How to do it…

How it works…

There's more…

Plotting a time series object

Getting ready

How to do it…

How it works…

There's more…

Decomposing time series

Getting ready

How to do it…

How it works…

There's more…

Smoothing time series

Getting ready

How to do it…

How it works…

There's more…

Forecasting time series

Getting ready

How to do it…

How it works…

There's more…

Selecting an ARIMA model

Getting ready

How to do it…

How it works…

There's more…

Creating an ARIMA model

Getting ready

How to do it…

How it works…

There's more…

Forecasting with an ARIMA model

Getting ready

How to do it…

How it works…

There's more…

Predicting stock prices with an ARIMA model

Getting ready

How to do it…

How it works…

There's more…

11. Supervised Machine Learning

Introduction

Fitting a linear regression model with lm

Getting ready

How to do it…

How it works…

There's more…

Summarizing linear model fits

Getting ready

How to do it…

How it works…

There's more…

Using linear regression to predict unknown values

Getting ready

How to do it…

How it works…

There's more…

Measuring the performance of the regression model

Getting ready

How to do it…

How it works…

There's more…

Performing a multiple regression analysis

Getting ready

How to do it…

How it works…

There's more…

Selecting the best-fitted regression model with stepwise regression

Getting ready

How to do it…

How it works…

There's more…

Applying the Gaussian model for generalized linear regression

Getting ready

How to do it…

How it works…

See also

Performing a logistic regression analysis

Getting ready

How to do it…

How it works…

See also

Building a classification model with recursive partitioning trees

Getting ready

How to do it…

How it works…

See also

Visualizing a recursive partitioning tree

Getting ready

How to do it…

How it works…

See also

Measuring model performance with a confusion matrix

Getting ready

How to do it…

How it works…

Measuring prediction performance using ROCR

Getting ready

How to do it…

How it works…

See also…

12. Unsupervised Machine Learning

Introduction

Clustering data with hierarchical clustering

Getting ready

How to do it…

How it works…

There's more…

Cutting tree into clusters

Getting ready

How to do it…

How it works…

There's more…

Clustering data with the k-means method

Getting ready

How to do it…

How it works…

There's more…

Clustering data with the density-based method

Getting ready

How to do it…

How it works…

See also

Extracting silhouette information from clustering

Getting ready

How to do it…

How it works…

See also

Comparing clustering methods

Getting ready

How to do it…

How it works…

There's more…

Recognizing digits using the density-based clustering method

Getting ready

How to do it…

How it works…

See also

Grouping similar text documents with k-means clustering methods

Getting ready

How to do it…

How it works…

See also

Performing dimension reduction with Principal Component Analysis (PCA)

Getting ready

How to do it…

How it works…

There's more…

Determining the number of principal components using a scree plot

Getting ready

How to do it…

How it works…

There's more…

Determining the number of principal components using the Kaiser method

Getting ready

How to do it…

How it works…

See also

Visualizing multivariate data using a biplot

Getting ready

How to do it…

How it works…

See also

Index

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部