Learning Hadoop 2

Read ^ Learning Hadoop 2 PDF by # Garry Turkington, Gabriele Modena eBook or Kindle ePUB Online free. Learning Hadoop 2 Thorough and Practical Guide This book is a thorough guide to Hadoop 2, and has a lot of detail packed in to its 382 pages. As part of Packts learning series, I was pleasantly surprised by the amount of depth here: the book covers a lot of essential material - details of HDFS, a quick description of variouscloud-based services that offer Hadoop, sentiment analysis (going significantly the now quite tired canonical example of word counting in documents), YARN, Tez, ZooKeeper, Pig It is really

Learning Hadoop 2

Author :
Rating : 4.12 (557 Votes)
Asin : 1783285516
Format Type : paperback
Number of Pages : 316 Pages
Publish Date : 2014-05-27
Language : English

DESCRIPTION:

Garry TurkingtonGarry Turkington has over 15 years of industry experience, most of which has been focused on the design and implementation of large-scale distributed systems. In his current role as the CTO at Improve Digital, he is primarily responsible for the realization of systems that store, process, and extract value from the company's large data volumes. He is the author of Hadoop Beginners Guide, published by Packt Publishing in 201

He has BSc and PhD degrees in Computer Science from Queens University Belfast in Northern Ireland, and a Master's degree in Engineering in Systems Engineering from Stevens Institute of Technology in the USA. Prior to this, he spent a decade in various government positions in both the UK and the USA. Before joining Improve Digital, he spent time at , where he led several software development teams, building systems that process the catalog data for every item worldwide. Gabriele enjoys using statistical and computational methods to look for patterns in large amounts of data. He is the author of Hadoop Beginners G

Thorough and Practical Guide This book is a thorough guide to Hadoop 2, and has a lot of detail packed in to it's 382 pages. As part of Packt's "learning" series, I was pleasantly surprised by the amount of depth here: the book covers a lot of essential material - details of HDFS, a quick description of variouscloud-based services that offer Hadoop, sentiment analysis (going significantly the now quite tired canonical example of word counting in documents), YARN, Tez, ZooKeeper, Pig It is really quite impressive and I think this would be helpful for anyone looking for a clear guide to Hadoop - not just people specificall. "Highly recommended" according to nyceyes. This book is excellently written. Good grammar, sentence construction and completeness-of-thought were held in high regard while writing this book. And this is important because -- believe it or not -- not having that will lead to misinterpretation of the approaches and technologies being discussed. As for the technical material, the authors nicely walk you through the various Big Data technologies (Hadoop and a few non-Hadoop ones, too), as they might be appropriate to particular use-cases/patterns, such as batch analytics; real-time streaming; data logistics; schema enforcement and evolutio. Good overview Alexander Helf When looking for a book about Hadoop one may find "Learning Hadoop 2". This the the successor of "Hadoop Beginner's Guide" from the same author and focuses on Hadoop version 2.Even without the knowledge of the previous Hadoop version you get a quick overview about the history and the core features.The middle of the book contains some technology chapters (streaming, programming, SQL) which use the same example to show the different aspects. With a basic Java knowhow the code is easy to understand (but I did not executed the code).The main focus of the book is the developing part but with the l

Familiarity with Hadoop would be a plus.What You Will LearnWrite distributed applications using the MapReduce frameworkGo beyond MapReduce and process data in real time with Samza and iteratively with SparkFamiliarize yourself with data mining approaches that work with very large datasetsPrototype applications on a VM and deploy them to a local cluster or to a cloud infrastructure ( Web Services)Conduct batch and real time data analysis using SQL-like toolsBuild data processing flows using Apache Pig and see how it enables the easy incorporation of custom functionalityDefine and orchestrate complex workflows and pipelines with Apache OozieManage your data lifecycle and changes over timeIn DetailThis book introduces you to the world of building data-processing applications with the wide variety of tools supported by Hadoop 2. The last part of this book discusses the likely future direction of major Hadoop components and how to get involved with the Hadoop community.. Starting with the core components of the frameworkHDFS and YARNthis book will guide you through how to build applications using a variety of approaches.You will learn how YARN completely changes the relationship between MapReduce and Hadoop and allows the latter to support more varied processing approaches and a broader array of applications. These include real

OTHER BOOK COLLECTION