万本电子书0元读

万本电子书0元读

顶部广告

Hadoop Cluster Deployment电子书

售       价:¥

3人正在读 | 0人评论 9.8

作       者:Danil Zburivsky

出  版  社:Packt Publishing

出版时间:2013-11-25

字       数:162.0万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
This book is a step-by-step tutorial filled with practical examples which will show you how to build and manage a Hadoop cluster along with its intricacies.This book is ideal for database administrators, data engineers, and system administrators, and it will act as an invaluable reference if you are planning to use the Hadoop platform in your organization. It is expected that you have basic Linux skills since all the examples in this book use this operating system. It is also useful if you have access to test hardware or virtual machines to be able to follow the examples in the book.
目录展开

Hadoop Cluster Deployment

Table of Contents

Hadoop Cluster Deployment

Credits

About the Author

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers and more

Why Subscribe?

Free Access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Errata

Piracy

Questions

1. Setting Up Hadoop Cluster – from Hardware to Distribution

Choosing Hadoop cluster hardware

Choosing the DataNode hardware

Low storage density cluster

High storage density cluster

NameNode and JobTracker hardware configuration

The NameNode hardware

The JobTracker hardware

Gateway and other auxiliary services

Network considerations

Hadoop hardware summary

Hadoop distributions

Hadoop versions

Choosing Hadoop distribution

Cloudera Hadoop distribution

Hortonworks Hadoop distribution

MapR

Choosing OS for the Hadoop cluster

Summary

2. Installing and Configuring Hadoop

Configuring OS for Hadoop cluster

Choosing and setting up the filesystem

Setting up Java Development Kit

Other OS settings

Setting up the CDH repositories

Setting up NameNode

JournalNode, ZooKeeper, and Failover Controller

Hadoop configuration files

NameNode HA configuration

JobTracker configuration

Configuring the job scheduler

JobQueueTaskScheduler

FairScheduler

CapacityTaskScheduler

DataNode configuration

TaskTracker configuration

Advanced Hadoop tuning

hdfs-site.xml

mapred-site.xml

core-site.xml

Summary

3. Configuring the Hadoop Ecosystem

Hosting the Hadoop ecosystem

Sqoop

Installing and configuring Sqoop

Sqoop import example

Sqoop export example

Hive

Hive architecture

Installing Hive Metastore

Installing the Hive client

Installing Hive Server

Impala

Impala architecture

Installing Impala state store

Installing the Impala server

Summary

4. Securing Hadoop Installation

Hadoop security overview

HDFS security

MapReduce security

Hadoop Service Level Authorization

Hadoop and Kerberos

Kerberos overview

Kerberos in Hadoop

Configuring Kerberos clients

Generating Kerberos principals

Enabling Kerberos for HDFS

Enabling Kerberos for MapReduce

Summary

5. Monitoring Hadoop Cluster

Monitoring strategy overview

Hadoop Metrics

JMX Metrics

Monitoring Hadoop with Nagios

Monitoring HDFS

NameNode checks

JournalNode checks

ZooKeeper checks

Monitoring MapReduce

JobTracker checks

Monitoring Hadoop with Ganglia

Summary

6. Deploying Hadoop to the Cloud

Amazon Elastic MapReduce

Installing the EMR command-line interface

Choosing the Hadoop version

Launching the EMR cluster

Temporary EMR clusters

Preparing input and output locations

Using Whirr

Installing and configuring Whirr

Summary

Index

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部