万本电子书0元读

万本电子书0元读

顶部广告

Using OpenRefine电子书

售       价:¥

3人正在读 | 0人评论 9.8

作       者:Ruben Verborgh

出  版  社:Packt Publishing

出版时间:2013-09-10

字       数:35.5万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
The book is styled on a Cookbook, containing recipes - combined with free datasets - which will turn readers into proficient OpenRefine users in the fastest possible way.This book is targeted at anyone who works on or handles a large amount of data. No prior knowledge of OpenRefine is required, as we start from the very beginning and gradually reveal more advanced features. You don't even need your own dataset, as we provide example data to try out the book's recipes.
目录展开

Using OpenRefine

Table of Contents

Using OpenRefine

Credits

Foreword

About the Authors

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers and more

Why Subscribe?

Free Access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example files

Errata

Piracy

Questions

1. Diving Into OpenRefine

Introducing OpenRefine

Recipe 1 – installing OpenRefine

Windows

Mac

Linux

Recipe 2 – creating a new project

File formats supported by OpenRefine

Recipe 3 – exploring your data

Recipe 4 – manipulating columns

Collapsing and expanding columns

Moving columns around

Renaming and removing columns

Recipe 5 – using the project history

Recipe 6 – exporting a project

Recipe 7 – going for more memory

Windows

Mac

Linux

Summary

2. Analyzing and Fixing Data

Recipe 1 – sorting data

Reordering rows

Recipe 2 – faceting data

Text facets

Numeric facets

Customized facets

Faceting by star or flag

Recipe 3 – detecting duplicates

Recipe 4 – applying a text filter

Recipe 5 – using simple cell transformations

Recipe 6 – removing matching rows

Summary

3. Advanced Data Operations

Recipe 1 – handling multi-valued cells

Recipe 2 – alternating between rows and records mode

Recipe 3 – clustering similar cells

Recipe 4 – transforming cell values

Recipe 5 – adding derived columns

Recipe 6 – splitting data across columns

Recipe 7 – transposing rows and columns

Summary

4. Linking Datasets

Recipe 1 – reconciling values with Freebase

Recipe 2 – installing extensions

Recipe 3 – adding a reconciliation service

Recipe 4 – reconciling with Linked Data

Recipe 5 – extracting named entities

Summary

A. Regular Expressions and GREL

Regular expressions for text patterns

Character classes

Quantifiers

Anchors

Choices

Groups

Overview

General Refine Expression Language (GREL)

Transforming data

Creating custom facets

Solving problems with GREL

Index

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部