万本电子书0元读

万本电子书0元读

顶部广告

Learning AWK Programming电子书

售       价:¥

34人正在读 | 0人评论 6.2

作       者:Shiwang Kalkhanda

出  版  社:Packt Publishing

出版时间:2018-03-26

字       数:44.6万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Text processing and pattern matching simplified About This Book ? Master the fastest and most elegant big data munging language ? Implement text processing and pattern matching using the advanced features of AWK and GAWK ? Implement debugging and inter-process communication using GAWK Who This Book Is For This book is for developers or analysts who are inclined to learn how to do text processing and data extraction in a Unix-like environment. Basic understanding of Linux operating system and shell scripting will help you to get the most out of the book. What You Will Learn ? Create and use different expressions and control flow statements in AWK ? Use Regular Expressions with AWK for effective text-processing ? Use built-in and user-defined variables to write AWK programs ? Use redirections in AWK programs and create structured reports ? Handle non-decimal input, 2-way inter-process communication with Gawk ? Create small scripts to reformat data to match patterns and process texts In Detail AWK is one of the most primitive and powerful utilities which exists in all Unix and Unix-like distributions. It is used as a command-line utility when performing a basic text-processing operation, and as programming language when dealing with complex text-processing and mining tasks. With this book, you will have the required expertise to practice advanced AWK programming in real-life examples. The book starts off with an introduction to AWK essentials. You will then be introduced to regular expressions, AWK variables and constants, arrays and AWK functions and more. The book then delves deeper into more complex tasks, such as printing formatted output in AWK, control flow statements, GNU's implementation of AWK covering the advanced features of GNU AWK, such as network communication, debugging, and inter-process communication in the GAWK programming language which is not easily possible with AWK. By the end of this book, the reader will have worked on the practical implementation of text processing and pattern matching using AWK to perform routine tasks. Style and approach An easy-to-follow, step by step guide which will help you get to grips with real-world applications of AWK programming.
目录展开

Title Page

Copyright and Credits

Learning AWK Programming

Dedication

Packt Upsell

Why subscribe?

PacktPub.com

Contributors

About the author

About the reviewers

Packt is searching for authors like you

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Reviews

Getting Started with AWK Programming

AWK programming language overview

What is AWK?

Types of AWK

When and where to use AWK

Getting started with AWK

Installation on Linux

Using the package manager

Compiling from the source code

Workflow of AWK

Action and pattern structure of AWK

Example data file

Pattern-only statements

Action-only statements

Printing each input line/record

Using the BEGIN and END blocks construct

The BEGIN block

The body block

The END block

Patterns

Actions

Running AWK programs

AWK as a Unix command line

AWK as a filter (reading input from the Terminal)

Running AWK programs from the source file

AWK programs as executable script files

Extending the AWK command line on multiple lines

Comments in AWK

Shell quotes with AWK

Data files used as examples in this book

Some simple examples with default usage

Multiple rules with AWK

Using standard input with names in AWK

AWK standard options

Standard command-line options

The -F option – field separator

The -f option (read source file)

The -v option (assigning variables)

GAWK-only options

The --dump-variables option (AWK global variables)

The --profile option (profiling)

The --sandbox option

The -i option (including other files in your program)

Include other files in the GAWK program (using @include)

The -V option

Summary

Working with Regular Expressions

Introduction to regular expressions

What is a regular expression?

Why use regular expressions?

Using regular expressions with AWK

Regular expressions as string-matching patterns with AWK

Basic regular expression construct

Understanding regular expression metacharacters

Quoted metacharacter

Anchors

Matching at the beginning of a string

Matching at the end of a string

Dot

Brackets expressions

Character classes

Named character classes (POSIX standard)

Complemented bracket expressions

Complemented character classes

Complemented named character classes

Alternation operator

Unary operator for repetition

Closure

Positive closure

Zero or one

Repetition ranges with interval expressions

A single number in brackets

A single number followed by a comma in brackets

Two numbers in brackets

Grouping using parentheses

Concatenation using alternation operator within parentheses

Backreferencing in regular expressions – sed and grep

Precedence in regular expressions

GAWK-specific regular expression operators

Matching whitespaces

Matching not whitespaces

Matching words (\w)

Matching non-words

Matching word boundaries

Matching at the beginning of a word

Matching at the end of a word

Matching not as a sub-string using

Matching a string as sub-string only using

Case-sensitive matching

Escape sequences

Summary

AWK Variables and Constants

Built-in variables in AWK

Field separator

Using a single character or simple string as a value of the FS

Using regular expressions as values of the FS

Using each character as a separate field

Using the command line to set the FS as -F

Output field separator

Record separator

Outputting the record separator

NR and NF

FILENAME

Environment variables in AWK

ARGC and ARGV

CONVFMT and OFMT

RLENGTH and RSTART

FNR

ENVIRON and SUBSET

FIELD (POSITIONAL) VARIABLE ($0 and $n)

Environment variables in GAWK

ARGIND

ERRNO

FIELDWIDTHS

IGNORECASE

PROCINFO

String constants

Numeric constants

Conversion between strings and numbers

Summary

Working with Arrays in AWK

One-dimensional arrays

Assignment in arrays

Accessing elements in arrays

Referring to members in arrays

Processing arrays using loops

Using the split() function to create arrays

Delete operation in arrays

Multidimensional arrays

Summary

Printing Output in AWK

The print statement

Role of output separator in print statement

Pretty printing with the printf statement

Escape sequences for special character printing

Different format control characters in the format specifier

Format specification modifiers

Printing with fixed column width

Using the minus modifier (-) for left justification

Printing with fixed width – right justified

Using hash modifier (#)

Using plus modifier (+) for prefixing with sign/symbol

Printing with prefix sign/symbol

Dot precision as modifier

Positional modifier using integer constant followed by $ (N$):

Redirecting output to file

Redirecting output to a file (>)

Appending output to a file (>>)

Sending output on other commands using pipe (|)

Special file for redirecting output (/dev/null, stderr)

Closing files and pipes

Summary

AWK Expressions

AWK variables and constants

Arithmetic expressions using binary operators

Assignment expressions

Increment and decrement expressions

Relational expressions

Logical or Boolean expressions

Ternary expressions

Unary expressions

Exponential expressions

String concatenation

Regular expression operators

Operators' Precedence

Summary

AWK Control Flow Statements

Conditional statements

The if statement

if

If...else

The if...else...if statement

The switch statement (a GAWK-specific feature)

Looping statement

The while loop

do...while loop statement

The for loop statement

For each loop statement

Statements affecting flow control

Break usage

Usage of continue

Exit usage

Next usage

Summary

AWK Functions

Built-in functions

Arithmetic functions

The sin (expr) function

The cos (expr) function

The atan2 (x, y) function

The int (expr) function

The exp (expr) function

The log (expr) function

The sqrt (expr) function

The rand() function

The srand ([expr]) function

Summary table of built-in arithmetic functions

String functions

The index (str, sub) function

The length ( string ) function

The split (str, arr, regex) function

The substr (str, start, [ length ]) function

The sub (regex, replacement, string) function

The gsub (regex, replacement, string) function

The gensub (regex, replacement, occurrence, [ string ]) function

The match (string, regex) function

The tolower (string) function

The toupper (string) function

The sprintf (format, expression) function

The strtonum (string) function

Summary table of built-in string functions

Input/output functions

The close (filename [to/from]) function

The fflush ([ filename ]) function

The system (command) function

The getline command

Simple getline

Getline into a variable

Getline from a file

Using getline to get a variable from a file

Using getline to output into a pipe

Using getline to change the output into a variable from a pipe

Using getline to change the output into a variable from a coprocess

The nextfile() function

The time function

The systime() function

The mktime (datespec) function

The strftime (format, timestamp) function

Bit-manipulating functions

The and (num1, num2) function

The or (num1, num2) function

The xor (num1, num2) function

The lshift (val, count) function

The rshift (val, count) function

The compl (num) function

User-defined functions

Function definition and syntax

Calling user-defined functions

Controlling variable scope

Return statement

Making indirect function calls

Summary

GNU's Implementation of AWK – GAWK (GNU AWK)

Things you don't know about GAWK

Reading non-decimal input

GAWK's built-in command line debugger

What is debugging?

Debugger concepts

Using GAWK as a debugger

Starting the debugger

Set breakpoint

Removing the breakpoint

Running the program

Looking inside the program

Displaying some variables and data

Setting watch and unwatch

Controlling the execution

Viewing environment information

Saving the commands in file

Exiting the debugger

Array sorting

Sort array by values using asort( )

Sort array indexes using asorti()

Two-way inter-process communication

Using GAWK for network programming

TCP client and server (/inet/tcp)

UDP client and server ( /inet/udp )

Reading a web page using HttpService

Profiling

Summary

Practical Implementation of AWK

Working with one-liners for text processing and pattern matching with AWK

Selective printing of lines with AWK

Modifying line spacing in a file with AWK

Numbering and calculations with AWK

Selective deletion of certain lines in a file with AWK

String operation on selected lines with AWK

Array creation with AWK one-liner

Text conversion and substitution in files with AWK

One-liners for system administrators

Use case examples of pattern matching using AWK

Parsing web server (Apache/Nginx) log files

Understanding the Apache combined log format

Using AWK for processing different log fields

Identifying problems with the running website

Printing the top 10 request IP addresses with their GeoIP information

Counting and printing unique visits to a website

Real-time IP address lookup for requests

Converting text to HTML table

Converting decimal to binary

Renaming files in a directory with AWK

Printing a generated sequence of numbers in a specified columnate format

Transposing a matrix

Processing multiple files using AWK

Summary

Further reading

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部