万本电子书0元读

万本电子书0元读

顶部广告

Hands-On Exploratory Data Analysis with R电子书

售       价:¥

0人正在读 | 0人评论 9.8

作       者:Radhika Datar

出  版  社:Packt Publishing

出版时间:2019-05-31

字       数:21.9万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Learn exploratory data analysis concepts using powerful R packages to enhance your R data analysis skills Key Features * Speed up your data analysis projects using powerful R packages and techniques * Create multiple hands-on data analysis projects using real-world data * Discover and practice graphical exploratory analysis techniques across domains Book Description Hands-On Exploratory Data Analysis with R will help you build not just a foundation but also expertise in the elementary ways to analyze data. You will learn how to understand your data and summarize its main characteristics. You'll also uncover the structure of your data, and you'll learn graphical and numerical techniques using the R language. This book covers the entire exploratory data analysis (EDA) process—data collection, generating statistics, distribution, and invalidating the hypothesis. As you progress through the book, you will learn how to set up a data analysis environment with tools such as ggplot2, knitr, and R Markdown, using tools such as DOE Scatter Plot and SML2010 for multifactor, optimization, and regression data problems. By the end of this book, you will be able to successfully carry out a preliminary investigation on any dataset, identify hidden insights, and present your results in a business context. What you will learn * Learn powerful R techniques to speed up your data analysis projects * Import, clean, and explore data using powerful R packages * Practice graphical exploratory analysis techniques * Create informative data analysis reports using ggplot2 * Identify and clean missing and erroneous data * Explore data analysis techniques to analyze multi-factor datasets Who this book is for Hands-On Exploratory Data Analysis with R is for data enthusiasts who want to build a strong foundation for data analysis. If you are a data analyst, data engineer, software engineer, or product manager, this book will sharpen your skills in the complete workflow of exploratory data analysis.
目录展开

Dedication

About Packt

Why subscribe?

Packt.com

Contributors

About the authors

About the reviewer

Packt is searching for authors like you

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Code in Action

Conventions used

Get in touch

Reviews

Section 1: Setting Up Data Analysis Environment

Setting Up Our Data Analysis Environment

Technical requirements

The benefits of EDA across vertical markets

Manipulating data

Examining, cleaning, and filtering data

Visualizing data

Creating data reports

Installing the required R packages and tools

Installing R packages from the Terminal

Installing R packages from inside RStudio

Summary

Importing Diverse Datasets

Technical requirements

Converting rectangular data into R with the readr R package

readr read functions

read_tsv method

read_delim method

read_fwf method

read_table method

read_log method

Reading in Excel data with the readxl R package

Reading in JSON data with the jsonlite R package

Loading the jsonlite package

Getting data into R from web APIs using the httr R package

Getting data into R by scraping the web using the rvest package

Importing data into R from relational databases using the DBI R package

Summary

Examining, Cleaning, and Filtering

Technical requirements

About the dataset

Reshaping and tidying up erroneous data

The gather() function

The unite() function

The separate() function

The spread() function

Manipulating and mutating data

The mutate() function

The group_by() function

The summarize() function

The arrange() function

The glimpse() function

Selecting and filtering data

The select() function

The filter() function

Cleaning and manipulating time series data

Summary

Visualizing Data Graphically with ggplot2

Technical requirements

Advanced graphics grammar of ggplot2

Data

Layers

Scales

The coordinate system

Faceting

Theme

Installing ggplot2

Scatter plots

Histogram plots

Density plots

Probability plots

dnorm()

pnorm()

rnorm()

Box plots

Residual plots

Summary

Creating Aesthetically Pleasing Reports with knitr and R Markdown

Technical requirements

Installing R Markdown

Working with R Markdown

Reproducible data analysis reports with knitr

Exporting and customizing reports

Summary

Section 2: Univariate, Time Series, and Multivariate Data

Univariate and Control Datasets

Technical requirements

Reading the dataset

Cleaning and tidying up the data

Understanding the structure of the data

Hypothesis tests

Statistical hypothesis in R

The t-test in R

Directional hypothesis in R

Correlation in R

Tietjen-Moore test

Parsimonious models

Probability plots

The Shapiro-Wilk test

Summary

Time Series Datasets

Technical requirements

Introducing and reading the dataset

Cleaning the dataset

Mapping and understanding structure

Hypothesis test

t-test in R

Directional hypothesis in R

Grubbs' test and checking outliers

Parsimonious models

Bartlett's test

Data visualization

Autocorrelation plots

Spectrum plots

Phase plots

Summary

Multivariate Datasets

Technical requirements

Introducing and reading a dataset

Cleaning the data

Mapping and understanding the structure

Hypothesis test

t-test in R

Directional hypothesis in R

Parsimonious model

Levene's test

Data visualization

Principal Component Regression

Partial Least Squares Regression

Summary

Section 3: Multifactor, Optimization, and Regression Data Problems

Multi-Factor Datasets

Technical requirements

Introducing and reading the dataset

Cleaning the dataset

Mapping and understanding data structure

Hypothesis test

t-test in R

Directional hypothesis in R

Grubbs test and checking outliers

Parsimonious model

Multi-factor variance analysis

Exploring graphically the dataset

Summary

Handling Optimization and Regression Data Problems

Technical requirements

Introducing and reading a dataset

Cleaning the dataset

Mapping and understanding the data structure

Hypothesis test

t-test in R

Directional hypothesis in R

Grubbs' test and checking outliers

Parsimonious model

Exploration using graphics

Summary

Section 4: Conclusions

Next Steps

Technical requirements

What to learn next

Why R?

Environmental setup

R syntax

R packages

Understanding the help system

The data analysis workflow

Data import

Manipulating data

Visualizing data

Reporting results

Standout as R wizard

Building a data science portfolio

Datasets in R

Getting help with exploratory data analysis

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部