万本电子书0元读

万本电子书0元读

顶部广告

Beginning Data Science with Python and Jupyter电子书

售       价:¥

59人正在读 | 0人评论 6.2

作       者:Alex Galea

出  版  社:Packt Publishing

出版时间:2018-06-05

字       数:251.3万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
Getting started with data science doesn't have to be an uphill battle. This step-by-step guide is ideal for beginners who know a little Python and are looking for a quick, fast-paced introduction. About This Book ? Get up and running with the Jupyter ecosystem and some example datasets ? Learn about key machine learning concepts like SVM, KNN classifiers and Random Forests ? Discover how you can use web scraping to gather and parse your own bespoke datasets Who This Book Is For This book is ideal for professionals with a variety of job descriptions across large range of industries, given the rising popularity and accessibility of data science. You'll need some prior experience with Python, with any prior work with libraries like Pandas, Matplotlib and Pandas providing you a useful head start. What You Will Learn ? Identify potential areas of investigation and perform exploratory data analysis ? Plan a machine learning classification strategy and train classification models ? Use validation curves and dimensionality reduction to tune and enhance your models ? Scrape tabular data from web pages and transform it into Pandas DataFrames ? Create interactive, web-friendly visualizations to clearly communicate your findings In Detail Get to grips with the skills you need for entry-level data science in this hands-on Python and Jupyter course. You'll learn about some of the most commonly used libraries that are part of the Anaconda distribution, and then explore machine learning models with real datasets to give you the skills and exposure you need for the real world. We'll finish up by showing you how easy it can be to scrape and gather your own data from the open web, so that you can apply your new skills in an actionable context. Style and approach This book covers every aspect of the standard data-workflow process within a day, along with theory, practical hands-on coding, and relatable illustrations.
目录展开

Table of Contents

Beginning Data Science with Python and Jupyter

Why Subscribe?

PacktPub.com

Contributors

About the author

About the reviewer

Packt is searching for authors like you

Preface

What This Book Covers

What You Need for This Book

Installation and Setup

Installing Anaconda

Updating Jupyter and Installing Dependencies

Who This Book is for

Conventions

Note

Tip

Reader Feedback

Customer Support

Downloading the Example Code

Errata

Piracy

Questions

Chapter 1. Jupyter Fundamentals

Lesson Objectives

Note

Basic Functionality and Features

Subtopic A: What is a Jupyter Notebook and Why is it Useful?

Subtopic B: Navigating the Platform

Introducing Jupyter Notebooks

Note

Note

Note

Subtopic C: Jupyter Features

Note

Explore some of Jupyter's most useful features

Converting a Jupyter Notebook to a Python Script

Subtopic D: Python Libraries

Note

Import the external libraries and set up the plotting environment

Our First Analysis - The Boston Housing Dataset

Subtopic A: Loading the Data into Jupyter Using a Pandas DataFrame

Note

Load the Boston housing dataset

Note

Note

Subtopic B: Data Exploration

Explore the Boston housing dataset

Note

Subtopic C: Introduction to Predictive Analytics with Jupyter Notebooks

Linear models with Seaborn and scikit-learn

Note

Note

Activity B: Building a Third-Order Polynomial Model

Note

Subtopic D: Using Categorical Features for Segmentation Analysis

Create categorical fields from continuous variables and make segmented visualizations

Note

Note

Summary

Chapter 2. Data Cleaning and Advanced Machine Learning

Preparing to Train a Predictive Model

Note

Subtopic A: Determining a Plan for Predictive Analytics

Subtopic B: Preprocessing Data for Machine Learning

Note

Explore data preprocessing tools and methods

Activity A: Preparing to Train a Predictive Model for the Employee-Retention Problem

Note

Note

Training Classification Models

Subtopic A: Introduction to Classification Algorithms

Note

Training two-feature classification models with scikit-learn

Note

The plot_decision_regions Function

Training k-nearest neighbors for our model

Note

Training a Random Forest

Note

Subtopic B: Assessing Models with k-Fold Cross-Validation and Validation Curves

Note

Note

Using k-fold cross validation and validation curves in Python with scikit-learn

Note

Note

Subtopic C: Dimensionality Reduction Techniques

Training a predictive model for the employee retention problem

Note

Note

Note

Note

Summary

Chapter 3. Web Scraping and Interactive Visualizations

Lesson Objectives

Scraping Web Page Data

Subtopic A: Introduction to HTTP Requests

Subtopic B: Making HTTP Requests in the Jupyter Notebook

Note

Handling HTTP requests with Python in a Jupyter Notebook

Subtopic C: Parsing HTML in the Jupyter Notebook

Parsing HTML with Python in a Jupyter Notebook

Note

Activity A: Web Scraping with Jupyter Notebooks

Note

Interactive Visualizations

Subtopic A: Building a DataFrame to Store and Organize Data

Building and merging Pandas DataFrames

Note

Note

Subtopic B: Introduction to Bokeh

Note

Introduction to interactive visualizations with Bokeh

Note

Activity B: Exploring Data with Interactive Visualizations

Note

Summary

Index

B

C

D

G

H

I

J

K

L

M

N

P

R

S

T

U

V

W

X

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部