Aucun appareil Kindle n'est requis. Téléchargez l'une des applis Kindle gratuites et commencez à lire les livres Kindle sur votre smartphone, tablette ou ordinateur.

  • Apple
  • Android
  • Windows Phone
  • Android

Pour obtenir l'appli gratuite, saisissez votre numéro de téléphone mobile.

Prix Kindle : EUR 18,45

EUR 24,78 (57%)

TVA incluse

Ces promotions seront appliquées à cet article :

Certaines promotions sont cumulables avec d'autres offres promotionnelles, d'autres non. Pour en savoir plus, veuillez vous référer aux conditions générales de ces promotions.

Envoyer sur votre Kindle ou un autre appareil

Envoyer sur votre Kindle ou un autre appareil

Cloudera Administration Handbook par [Menon, Rohit]
Publicité sur l'appli Kindle

Cloudera Administration Handbook Format Kindle

Voir les formats et éditions Masquer les autres formats et éditions
Prix Amazon
Neuf à partir de Occasion à partir de
Format Kindle
"Veuillez réessayer"
EUR 18,45

Descriptions du produit

Présentation de l'éditeur

A complete, hands-on guide to building and maintaining large Apache Hadoop clusters using Cloudera Manager and CDH5

About This Book

  • Understand the CDH architecture and its components and successfully set up a Hadoop cluster

  • Maintain, troubleshoot, and secure your cluster using Cloudera Manager

  • Easy-to-follow administrator's guide with step-by-step explanations to help you master Apache Hadoop

Who This Book Is For

This book is great for administrators interested in setting up and managing a large Hadoop cluster. If you are an administrator, or want to be an administrator, and you are ready to build and maintain a production-level cluster running CDH5, then this book is for you.

What You Will Learn

  • Understand the Apache Hadoop architecture and the future of distributed processing frameworks

  • Use HDFS and MapReduce for all file-related operations

  • Install and configure CDH to bring up an Apache Hadoop cluster

  • Configure HDFS High Availability and HDFS Federation to prevent single points of failure

  • Install and configure Cloudera Manager to perform administrator operations

  • Implement security by installing and configuring Kerberos for all services in the cluster

  • Add, remove, and rebalance nodes in a cluster using cluster management tools

  • Understand and configure the different backup options to back up your HDFS

In Detail

Apache Hadoop is an open source distributed computing technology that assists users in processing large volumes of data with relative ease, helping them to generate tremendous insights into their data. Cloudera, with their open source distribution of Hadoop, has made data analytics on big data possible and accessible to anyone interested.

This book fully prepares you to be a Hadoop administrator, with special emphasis on Cloudera's CDH. It provides step-by-step instructions on setting up and managing a robust Hadoop cluster running CDH5. This book will also equip you with an understanding of tools such as Cloudera Manager, which is currently being used by many companies to manage Hadoop clusters with hundreds of nodes. You will learn how to set up security using Kerberos. You will also use Cloudera Manager to set up alerts and events that will help you monitor and troubleshoot cluster issues.

Biographie de l'auteur

Rohit Menon

Rohit Menon is a senior system analyst living in Denver, Colorado. He has over 7 years of experience in the field of Information Technology, which started with the role of a real-time applications developer back in 2006. He now works for a product-based company specializing in software for large telecom operators. He graduated with a master's degree in Computer Applications from University of Pune, where he built an autonomous maze-solving robot as his final year project. He later joined a software consulting company in India where he worked on C#, SQL Server, C++, and RTOS to provide software solutions to reputable organizations in USA and Japan. After this, he started working for a product-based company where most of his time was dedicated to programming the finer details of products using C++, Oracle, Linux, and Java. He is a person who always likes to learn new technologies and this got him interested in web application development. He picked up Ruby, Ruby on Rails, HTML, JavaScript, CSS, and built, a Netflix search engine that makes searching for titles on Netflix much easier. On the Hadoop front, he is a Cloudera Certified Apache Hadoop Developer. He blogs at, mainly on topics related to Apache Hadoop and its components. To share his learning, he has also started, a website that teaches Apache Hadoop using simple, short, and easy-to-follow screencasts. He is well versed with wide variety of tools and techniques such as MapReduce, Hive, Pig, Sqoop, Oozie, and Talend Open Studio.

Détails sur le produit

  • Format : Format Kindle
  • Taille du fichier : 34067 KB
  • Nombre de pages de l'édition imprimée : 254 pages
  • Editeur : Packt Publishing (18 juillet 2014)
  • Vendu par : Amazon Media EU S.à r.l.
  • Langue : Anglais
  • ASIN: B00LZU0RI4
  • Synthèse vocale : Activée
  • X-Ray :
  • Word Wise: Non activé
  • Composition améliorée: Activé
  • Moyenne des commentaires client : Soyez la première personne à écrire un commentaire sur cet article
  • Classement des meilleures ventes d'Amazon: n°284.153 dans la Boutique Kindle (Voir le Top 100 dans la Boutique Kindle)
  •  Voulez-vous faire un commentaire sur des images ou nous signaler un prix inférieur ?

click to open popover

Commentaires en ligne

Il n'y a pas encore de commentaires clients sur
5 étoiles
4 étoiles
3 étoiles
2 étoiles
1 étoile

Commentaires client les plus utiles sur (beta) 3.8 étoiles sur 5 9 commentaires
3 internautes sur 3 ont trouvé ce commentaire utile 
5.0 étoiles sur 5 Fantastic resource for anybody wanting to understand and manage a Cloudera platform. 22 septembre 2014
Par william El Kaim - Publié sur
Format: Broché
The Cloudera Administration Handbook written by Rohit Menon is a fantastic resource for anybody wanting to understand and manage a Cloudera platform.
I have to admit that I’m a rookie and that this book was exactly what I was dreaming of. Having all information in the same place, and code example both for Linux and Windows.
The book is mainly targeted at bid data expert and system administrator. The first three chapters are giving the minimum background to understand MapReduce, Hadoop and Yarn and the Cloudera's Distribution Including Apache Hadoop (all services are listed and explained).
Then, you enter into the “hard part”. Chapter 4 discussing in details HDFS Federation and Its High Availability and chapter 7 describing “Managing an Apache Hadoop Cluster” were for me particularly valuable. The chapter 5 presenting Cloudera Manager, a web-browser-based administration tool to manage Apache Hadoop clusters, will show you how to manage the clusters with point and clicks instead of command lines. Chapter 6 is about configuring access and right using the Kerberos services. It does show you how to implement the security services, but not how to manage user rights, which is a step requiring some planning. Monitoring and backup (using the Hadoop utility DistCp and the Cloudera manager). are also presented in two distinct parts.
What I like in this book is that it goes directly to the point, assuming you already know the basics of system administration and distributed architecture. It then shares many “tips” that only an experienced professional will know, and enables the rookie I was to avoid mistakes. With this book, you will gain time. For example, the author told you when a SPOF (single point of failure) exist and the solutions to avoid them.
The only part of the book that was missing for me was the cloud deployment. I would have liked a chapter explaining how to setup Cloudera in the cloud, and get the code (puppet or chef) to automate the install.
It is clearly a worth buying book for people wanting to setup and deploy correctly a Cloudera platform. I also like the fact that for the same price you can download the PDF, mobi, epub and kindle version.
1 internautes sur 1 ont trouvé ce commentaire utile 
4.0 étoiles sur 5 get Cloudera Manager ? 24 août 2014
Par W Boudville - Publié sur
Format: Broché
This book is a rapid and useful introduction to the use of Apache Hadoop to analyse huge data sets. But the central assumption is that you are not necessarily a data scientist per se, but an administrator of a Hadoop system. So the text really treads lightly if at all on the intricacies of data analysis. Instead, the merits of what Cloudera offers are explained.

Top level material is covered, including building a cluster and installing and if needed upgrading Hadoop on it. Very tightly related to this is running MapReduce, which is the analysis engine optimised for a Hadoop cluster. The demands are specialised enough that ancillary processes [daemons] are needed. Like a job tracker program, giving information about scheduling of jobs, a status of the hardware in the cluster, and what jobs are currently running. If you have ever been system administrator of a computer cluster, especially a unix cluster, you may have seen similar programs, albeit on a smaller scale of disks.

A key advantage of this book is a joint education in Hadoop and MapReduce. The point about running a Hadoop cluster is that often you then run MapReduce on it. Where Cloudera comes into play is in the simplification of the Hadoop administration. But not all the software described in the book is free. Cloudera has a Manager program in two versions - Standard, which is free, and Enterprise, which is not. Standard looks pretty good actually. But the text encourages you to carefully contemplate splurging on Enterprise. Arguing essentially that its extra features are worth the cost. You should read the text slowly to see if you concur. As an inducement, at least when the book was written, the Enterprise version has 60 days free use. This could still be valid when you read the book.
4.0 étoiles sur 5 An excellent resource for learning how to use Cloudera Manager to build and maintain a Hadoop cluster. 12 septembre 2014
Par Robert Rapplean - Publié sur
Format: Broché
This book provides an excellent overview of how to use the Cloudera Manager to build and maintain an industrial Hadoop cluster. I think its best audience is either someone who is already familiar with Hadoop and will need to start managing a Cloudera cluster, or someone who will mostly just be interacting with the Cloudera Manager interface while a primary system administrator handles the more complicated issues that revolve around unpredictable variations. It starts off with a relatively watered-down overview of the concepts behind Hadoop, but around Chapter 5 it really picks up and provides a great description of how to build and manage your cluster. The techniques provided in this book make use of Cloudera Manager wherever possible, as this is the preferred method of setting up and maintaining a Cloudera Hadoop cluster.

One of the strongest points is that it contributes to the amount of published knowledge around Hadoop 2, which has been slow to catch up with the release of the technology.

There are a few shortcomings that prevent me from giving it a full five stars. Since it focuses on the Cloudera Manager, it could leave a fledgeling admin in a bad place if things aren't all lined up just right. The education base of the target audience is a little narrow since its tone is aimed at informing, not teaching, so it excludes those who are not familiar with system administration and general. In the other direction, it doesn't provide the in-depth, "under the hood" details that heavy-weight system administrators enjoy wading through (but which would require a much thicker book).
3.0 étoiles sur 5 Good at the very basics, but leaves gaps. 7 novembre 2015
Par Sketchie - Publié sur
Format: Broché
While it is pretty good at the bulk of the Cloudera management suite, it does miss the mark on some key factors - specifically Kerberos. Granted, official Cloudera documentation is woefully inadequate, this book only brings up other issues with nothing to bridge the two. It gives simple instructions which are accurate (i.e. Hive, etc...), however it gives no detail as to how the given service works - which may not be the point of the book, but it would help.
1 internautes sur 1 ont trouvé ce commentaire utile 
4.0 étoiles sur 5 A very useful book for Cloudera administration. 10 octobre 2014
Par Ganapathy Kokkeshwara - Publié sur
Format: Format Kindle
This is a must read book for anyone who is in the process of learning the administration of Cloudera distribution of hadoop. Other than learning, the book can also be used for reference. In the first 2 chapters , the author talks about Apache Hadoop, its various components, HDFS and Mapreduce. These chapters provides a very informative introduction to Apache Hadoop and the ecosystem associated with it. These chapters are useful for anyone interested in learning hadoop technology let alone for Cloudera administrators.

In the 3rd chapter author talks about Cloudera distribution of Apache Hadoop and the other components that are distributed with Cloudera Hadoop Distribution ( CDH) such as Flume ,Sqoop, Pig, Hive, Zookeeper etc. The components are explained in simple terms that can be understood by most of the technical persons. I liked the 'screenshots' of the UI for each of the components that made it little bit easier to understand and comprehend. This chapter also covers the installation of CDH and the various components. I liked the fact that author covers the installation from Cloudera Manager as well as from Operating System’s package manager , thus providing more options for the administrator.

Rest of the chapters cover administrating the high availability , implementing security using Kerberos , managing cluster and monitoring. Chapter 9 talks about backing up the Hadoop cluster. I liked the fact that author took time to explain the various types of backup and storage media for backups before actually getting into technical nitty-gritty of back up of big data.

Throughout the book , author also writes briefly about the impact on administering the CDH when deployed on cloud and provides the relevant web links for further reference.
Ces commentaires ont-ils été utiles ? Dites-le-nous