万本电子书0元读

万本电子书0元读

顶部广告

Getting Started with Python Data Analysis电子书

售       价:¥

3人正在读 | 0人评论 6.2

作       者:Phuong Vo.T.H

出  版  社:Packt Publishing

出版时间:2015-11-04

字       数:50.5万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Learn to use powerful Python libraries for effective data processing and analysisAbout This BookLearn the basic processing steps in data analysis and how to use Python in this area through supported packages, especially Numpy, Pandas, and MatplotlibCreate, manipulate, and analyze your data to extract useful information to optimize your systemA hands-on guide to help you learn data analysis using Python Who This Book Is For If you are a Python developer who wants to get started with data analysis and you need a quick introductory guide to the python data analysis libraries, then this book is for you.What You Will LearnUnderstand the importance of data analysis and get familiar with its processing stepsGet acquainted with Numpy to use with arrays and array-oriented computing in data analysisCreate effective visualizations to present your data using MatplotlibProcess and analyze data using the time series capabilities of PandasInteract with different kind of database systems, such as file, disk format, Mongo, and RedisApply the supported Python package to data analysis applications through examplesExplore predictive analytics and machine learning algorithms using Scikit-learn, a Python library In Detail Data analysis is the process of applying logical and analytical reasoning to study each component of data. Python is a multi-domain, high-level, programming language. It’s often used as a *ing language because of its forgiving syntax and operability with a wide variety of different eco-systems. Python has powerful standard libraries or toolkits such as Pylearn2 and Hebel, which offers a fast, reliable, cross-platform environment for data analysis. With this book, we will get you started with Python data analysis and show you what its advantages are. The book starts by introducing the principles of data analysis and supported libraries, along with NumPy basics for statistic and data processing. Next it provides an overview of the Pandas package and uses its powerful features to solve data processing problems. Moving on, the book takes you through a brief overview of the Matplotlib API and some common plotting functions for DataFrame such as plot. Next, it will teach you to manipulate the time and data structure, and load and store data in a file or database using Python packages. The book will also teach you how to apply powerful packages in Python to process raw data into pure and helpful data using examples. Finally, the book gives you a brief overview of machine learning algorithms, that is, applying data analysis results to make decisions or build helpful products, such as recommendations and predictions using scikit-learn.Style and approach This is an easy-to-follow, step-by-step guide to get you familiar with data analysis and the libraries supported by Python. Topics are explained with real-world examples wherever required.
目录展开

Getting Started with Python Data Analysis

Table of Contents

Getting Started with Python Data Analysis

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Introducing Data Analysis and Libraries

Data analysis and processing

An overview of the libraries in data analysis

Python libraries in data analysis

NumPy

Pandas

Matplotlib

PyMongo

The scikit-learn library

Summary

2. NumPy Arrays and Vectorized Computation

NumPy arrays

Data types

Array creation

Indexing and slicing

Fancy indexing

Numerical operations on arrays

Array functions

Data processing using arrays

Loading and saving data

Saving an array

Loading an array

Linear algebra with NumPy

NumPy random numbers

Summary

3. Data Analysis with Pandas

An overview of the Pandas package

The Pandas data structure

Series

The DataFrame

The essential basic functionality

Reindexing and altering labels

Head and tail

Binary operations

Functional statistics

Function application

Sorting

Indexing and selecting data

Computational tools

Working with missing data

Advanced uses of Pandas for data analysis

Hierarchical indexing

The Panel data

Summary

4. Data Visualization

The matplotlib API primer

Line properties

Figures and subplots

Exploring plot types

Scatter plots

Bar plots

Contour plots

Histogram plots

Legends and annotations

Plotting functions with Pandas

Additional Python data visualization tools

Bokeh

MayaVi

Summary

5. Time Series

Time series primer

Working with date and time objects

Resampling time series

Downsampling time series data

Upsampling time series data

Time zone handling

Timedeltas

Time series plotting

Summary

6. Interacting with Databases

Interacting with data in text format

Reading data from text format

Writing data to text format

Interacting with data in binary format

HDF5

Interacting with data in MongoDB

Interacting with data in Redis

The simple value

List

Set

Ordered set

Summary

7. Data Analysis Application Examples

Data munging

Cleaning data

Filtering

Merging data

Reshaping data

Data aggregation

Grouping data

Summary

8. Machine Learning Models with scikit-learn

An overview of machine learning models

The scikit-learn modules for different models

Data representation in scikit-learn

Supervised learning – classification and regression

Unsupervised learning – clustering and dimensionality reduction

Measuring prediction performance

Summary

Index

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部