万本电子书0元读

万本电子书0元读

顶部广告

OpenStack Sahara Essentials电子书

售       价:¥

7人正在读 | 0人评论 6.2

作       者:Omar Khedher

出  版  社:Packt Publishing

出版时间:2016-04-01

字       数:147.7万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
Integrate, deploy, rapidly configure, and successfully manage your own big data-intensive clusters in the cloud using OpenStack Sahara About This Book A fast paced guide to help you utilize the benefits of Sahara in OpenStack to meet the Big Data world of Hadoop. A step by step approach to simplify the complexity of Hadoop configuration, deployment and maintenance. Who This Book Is For This book targets data scientists, cloud developers and Devops Engineers who would like to become proficient with OpenStack Sahara. Ideally, this book is well suitable for readers who are familiars with databases, Hadoop and Spark solutions. Additionally, a basic prior knowledge of OpenStack is expected. The readers should also be familiar with different Linux boxes, distributions and virtualization technology. What You Will Learn Integrate and Install Sahara with OpenStack environment Learn Sahara architecture under the hood Rapidly configure and scale Hadoop clusters on top of OpenStack Explore the Sahara REST API to create, deploy and manage a Hadoop cluster Learn the Elastic Processing Data (EDP) facility to execute jobs in clusters from Sahara Cover other Hadoop stable plugins existing supported by Sahara Discover different features provided by Sahara for Hadoop provisioning and deployment Learn how to troubleshoot OpenStack Sahara issues In Detail The Sahara project is a module that aims to simplify the building of data processing capabilities on OpenStack. The goal of this book is to provide a focused, fast paced guide to installing, configuring, and getting started with integrating Hadoop with OpenStack, using Sahara. The book should explain to users how to deploy their data-intensive Hadoop and Spark clusters on top of OpenStack. It will also cover how to use the Sahara REST API, how to develop applications for Elastic Data Processing on Openstack, and setting up hadoop or spark clusters on Openstack. Style and approach This book takes a step by step approach teaching how to integrate, deploy and manage data using OpenStack Sahara. It will teach how the OpenStack Sahara is beneficial by simplifying the complexity of Hadoop configuration, deployment and maintenance.
目录展开

OpenStack Sahara Essentials

Table of Contents

OpenStack Sahara Essentials

Credits

About the Author

About the Reviewer

www.PacktPub.com

eBooks, discount offers, and more

Why subscribe?

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Errata

Piracy

Questions

1. The Essence of Big Data in the Cloud

It is all about data

The dimensions of big data

The big challenge of big data

The revolution of big data

A key of big data success

Use case: Elastic MapReduce

OpenStack crossing big data

Sahara: bringing big data to the cloud

Sahara in OpenStack

The Sahara OpenStack mission

The Sahara's architecture

Summary

2. Integrating OpenStack Sahara

Preparing the test infrastructure environment

OpenStack test topology environment

OpenStack test networking layout

OpenStack test environment design

Installing OpenStack

Network requirements

System requirements

Running the RDO installation

Integrating Sahara

Installing and configuring OpenStack Sahara

Installing the Sahara user interface

Summary

3. Using OpenStack Sahara

Planning a Hadoop deployment

Assigning Hadoop nodes

Sahara provisioning plugins

Creating a Hadoop cluster

Preparing the image from Horizon

Preparing the image using CLI

Creating the Node Group Template

Creating the Node Group Template in Horizon

Creating a Node Group Template using CLI

Creating the Node Cluster Template

Creating the Node Cluster Template with Horizon

Creating the Node Cluster Template using CLI

Launching the Hadoop cluster

Launching the Hadoop cluster with Horizon

Launching the Hadoop cluster using the CLI

Summary

4. Executing Jobs with Sahara

Job glossary in Sahara

Job binaries in Sahara

Jobs in Sahara

Running jobs in Sahara

Executing jobs via Horizon

Executing jobs using the Sahara RESTful API

API authentication

Launching an EDP job

Registering a Spark image using REST API

Creating Spark node group templates

Creating a Spark cluster template

Launching the Spark cluster

Creating a job binary

Creating a Spark job template

Executing the Spark job

Extending the Spark job

Summary

5. Discovering Advanced Features with Sahara

Sahara plugins

Vanilla Apache Hadoop

Building an image for the Apache Vanilla plugin

Vanilla Apache requirements and limitations

Hortonworks Data Platform plugin

Building an image for the HDP plugin

HDP requirements and limitations

Cloudera Distribution Hadoop plugin

Building an image for the CDH plugin

CDH requirements and limitations

Apache Spark plugin

Building an image for the Spark plugin

Spark requirements and limitations

Affinity and anti-affinity

Anti-affinity in action

Boosting Elastic Data Processing performance

Defining the network

Increasing data reliability

Summary

6. Hadoop High Availability Using Sahara

HDP high-availability support

Minimum requirements for the HA Hadoop cluster in Sahara

HA Hadoop cluster templates

CDH high-availability support

Summary

7. Troubleshooting

Troubleshooting OpenStack

OpenStack debug tool

Troubleshooting SELinux

Troubleshooting identity

Troubleshooting networking

Troubleshooting data processing

Debugging Sahara

Logging Sahara

Troubleshooting missing services

Troubleshooting cluster creation

Troubleshooting user quota

Troubleshooting cluster scaling

Troubleshooting cluster access

Summary

Index

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部