Average Reviews:
(More customer reviews)The reason why I say this book's still a Good Buy is because Jason Venner has used Hadoop in several scenarios, and this book contains a lot of practical and time-saving tips on what mistakes to avoid or how to troubleshoot problems, making it an especially good book for Hadoop newbies. His materials on Testing and Debugging MapReduce Applications are also a value-add.
Chapter One provides detailed instructions on how to install Hadoop and how to run a test to verify that everything went fine. The author mentions that Hadoop 0.19 works best with Sun's JDK 1.6 and that although Hadoop will work on Windows with Cygwin installed, you have to be careful when specifying file paths.
Chapters Two and Three introduce basic concepts pertaining to MapReduce Jobs and Multimachine Clusters, respectively, and how "master" and "slave" nodes are configured. Chapter Four teaches you how to install, configure, and troubleshoot Hadoop Distributed File System.
Chapters Five and Six provide tutorials on the different types of inputs and outputs that a Hadoop MapReduce job can handle, and how to tune MapReduce jobs.
Chapter Seven is an excellent tutorial on how to unit test and debug MapReduce jobs, while Chapter Eight discusses more advanced MapReduce techniques for addressing more complex application requirements.
Chapter Nine walks you through the evolution of a (somewhat boring) real-world application, discussing rationales behind design changes, etc. Chapter 10 provides a few descriptive paragraphs each for various projects related to Hadoop (e.g., Pig, HBase, Mahout, ZooKeeper,etc). Finally, Appendix A is a detailed discussion of the JobConf API, JobConf being the object that controls information relating to a MapReduce job.
Click Here to see more reviews about: Pro Hadoop (Expert's Voice in Open Source)
It's a very safe bet that cloud computing interest increases and in fact, a near certainty. A recent estimate from Merrill Lynch suggested a $95 billion annual market by 2013. Hadoop is at the center of cloud computing: it is one of the most searched-for, documented, and prevalent form of cloud computing data access, and even in its pre-final form is already rich enough to support a consulting business model. Anyone wishing to investigate an enterprise-level cloud computing solution will need a Hadoop book to at least investigate the possibilities.Hadoop is one of the tools that's driving today's developers to build tomorrow's Software as a Service (SaaS)-based and driven Internet applications, invested in by Yahoo, Microsoft, Google, Amazon and more. With Hadoop, developers can build tomorrow's data centers, or the next 'Microsoft Office" or 'Google Apps" - applications that are fully hosted on the Web, not the desktop - and much more.Pro Hadoop will be the first to market with a professional guide and reference to getting up to speed with using, developing and working with Hadoop, an open source Java-based cloud computing framework and platform backed by Yahoo. This book will likely time with Hadoop 1.0 release in June 2009, also around JavaOne, the world's largest Java conference at around 15,000 attendees on average.
Click here for more information about Pro Hadoop (Expert's Voice in Open Source)
0 comments:
Post a Comment