万本电子书0元读

万本电子书0元读

顶部广告

Hands-On Data Science with the Command Line电子书

售       价:¥

0人正在读 | 0人评论 9.8

作       者:Jason Morris

出  版  社:Packt Publishing

出版时间:2019-01-31

字       数:11.7万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Big data processing and analytics at speed and scale using command line tools. Key Features * Perform string processing, numerical computations, and more using CLI tools * Understand the essential components of data science development workflow * Automate data pipeline scripts and visualization with the command line Book Description The Command Line has been in existence on UNIX-based OSes in the form of Bash shell for over 3 decades. However, very little is known to developers as to how command-line tools can be OSEMN (pronounced as awesome and standing for Obtaining, Scrubbing, Exploring, Modeling, and iNterpreting data) for carrying out simple-to-advanced data science tasks at speed. This book will start with the requisite concepts and installation steps for carrying out data science tasks using the command line. You will learn to create a data pipeline to solve the problem of working with small-to medium-sized files on a single machine. You will understand the power of the command line, learn how to edit files using a text-based and an. You will not only learn how to automate jobs and scripts, but also learn how to visualize data using the command line. By the end of this book, you will learn how to speed up the process and perform automated tasks using command-line tools. What you will learn * Understand how to set up the command line for data science * Use AWK programming language commands to search quickly in large datasets. * Work with files and APIs using the command line * Share and collect data with CLI tools * Perform visualization with commands and functions * Uncover machine-level programming practices with a modern approach to data science Who this book is for This book is for data scientists and data analysts with little to no knowledge of the command line but has an understanding of data science. Perform everyday data science tasks using the power of command line tools.
目录展开

Title Page

Copyright and Credits

Hands-On Data Science with the Command Line

About Packt

Why subscribe?

Packt.com

Contributors

About the authors

About the reviewers

Packt is searching for authors like you

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Reviews

Data Science at the Command Line and Setting It Up

History of the command line

We don't want to BaSH other shells, but...

Language-focused shells

So, why the command line?

Getting set up with Windows 10

Getting set up on OS X

Getting set up on Ubuntu Linux

Getting set up with Docker

Summary

Essential Commands

Essential commands

Navigating the command line

Getting help

Customizing the shell

Summary

Shell Workflows, and Data Acquisition and Massaging

Download the data

Using the file command

Performing a word count

Introduction to cut

Detached processing

How to background a process

Disregarding SIGHUP

Terminal multiplexers

Introduction to screen

Sharing a screen session between multiple users

Introduction to tmux

Summary

Bash Functions and Data Visualization

My first shell script

She bangs, she bangs!

Function arguments, positional parameters, and IFS

Prompt me baby one more time

Feed the function input!

Down the rabbit hole of IFS and bash arrays

Advanced shell scripting magic

Here be dragons, ye be warned

Text injection of text files

Bash networks for fun and profit!

From dumb Terminal to glam Terminal

Who, what, where, why, how?

Enter the mind's eye

Summary

Loops, Functions, and String Processing

Once, twice, three times a lady loops

It's the end of the world as we know it while and until

The simple case

Pay no heed to the magician redirecting your attention

Regular expressions and grep

Exact matches

Character sets

Dot the i (or anything else)

Capture groups

Either or, neither nor

Repetition

Other operators

Putting it all together

awk, sed, and tr

awk

sed

tr

sort and uniq

sort

uniq

Summary

SQL, Math, and Wrapping it up

cut and viewing data as columnar

WHERE clauses

Join, for joining data

Group by and ordering

Simulating selects

Keys to the kingdom

Using SQLite

Math in bash itself

Using let

Basic arithmetic

Double-parentheses

bc, the unix basic calculator

Math in (g)awk

Python (pandas, numpy, scikit-learn)

Analyzing weather data in bash

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部