Optimizing Hadoop for MapReduce. This book is the perfect introduction to sophisticated concepts in MapReduce and will ensure you have the knowledge to optimize job performance. This is not an academic treatise; it’s an example-driven tutorial for - Helion
ebook
Autor: Khaled TannirTytuÅ‚ oryginaÅ‚u: Optimizing Hadoop for MapReduce. This book is the perfect introduction to sophisticated concepts in MapReduce and will ensure you have the knowledge to optimize job performance. This is not an academic treatise; it’s an example-driven tutorial for
ISBN: 9781783285662
stron: 120, Format: ebook
Data wydania: 2014-02-21
Księgarnia: Helion
Cena książki: 109,00 zł
Osoby które kupowaÅ‚y "Optimizing Hadoop for MapReduce. This book is the perfect introduction to sophisticated concepts in MapReduce and will ensure you have the knowledge to optimize job performance. This is not an academic treatise; it’s an example-driven tutorial for ", wybieraÅ‚y także:
- Windows Media Center. Domowe centrum rozrywki 66,67 zł, (8,00 zł -88%)
- Ruby on Rails. Ćwiczenia 18,75 zł, (3,00 zł -84%)
- Przywództwo w świecie VUCA. Jak być skutecznym liderem w niepewnym środowisku 58,64 zł, (12,90 zł -78%)
- Scrum. O zwinnym zarządzaniu projektami. Wydanie II rozszerzone 58,64 zł, (12,90 zł -78%)
- Od hierarchii do turkusu, czyli jak zarządzać w XXI wieku 58,64 zł, (12,90 zł -78%)
Spis treści
Optimizing Hadoop for MapReduce. This book is the perfect introduction to sophisticated concepts in MapReduce and will ensure you have the knowledge to optimize job performance. This is not an academic treatise; it’s an example-driven tutorial for the real world eBook -- spis treÅ›ci
- Optimizing Hadoop for MapReduce
- Table of Contents
- Optimizing Hadoop for MapReduce
- Credits
- About the Author
- Acknowledgments
- About the Reviewers
- www.PacktPub.com
- Support files, eBooks, discount offers and more
- Why Subscribe?
- Free Access for Packt account holders
- Support files, eBooks, discount offers and more
- Preface
- What this book covers
- What you need for this book
- Who this book is for
- Conventions
- Reader feedback
- Customer support
- Errata
- Piracy
- Questions
- 1. Understanding Hadoop MapReduce
- The MapReduce model
- An overview of Hadoop MapReduce
- Hadoop MapReduce internals
- Factors affecting the performance of MapReduce
- Summary
- 2. An Overview of the Hadoop Parameters
- Investigating the Hadoop parameters
- The mapred-site.xml configuration file
- The CPU-related parameters
- The disk I/O related parameters
- The memory-related parameters
- The network-related parameters
- The hdfs-site.xml configuration file
- The core-site.xml configuration file
- The mapred-site.xml configuration file
- Hadoop MapReduce metrics
- Performance monitoring tools
- Using Chukwa to monitor Hadoop
- Using Ganglia to monitor Hadoop
- Using Nagios to monitor Hadoop
- Using Apache Ambari to monitor Hadoop
- Summary
- Investigating the Hadoop parameters
- 3. Detecting System Bottlenecks
- Performance tuning
- Creating a performance baseline
- Identifying resource bottlenecks
- Identifying RAM bottlenecks
- Identifying CPU bottlenecks
- Identifying storage bottlenecks
- Identifying network bandwidth bottlenecks
- Summary
- 4. Identifying Resource Weaknesses
- Identifying cluster weakness
- Checking the Hadoop cluster nodes health
- Checking the input data size
- Checking massive I/O and network traffic
- Checking for insufficient concurrent tasks
- Checking for CPU contention
- Sizing your Hadoop cluster
- Configuring your cluster correctly
- Summary
- Identifying cluster weakness
- 5. Enhancing Map and Reduce Tasks
- Enhancing map tasks
- Input data and block size impact
- Dealing with small and unsplittable files
- Reducing spilled records during the Map phase
- Calculating map tasks' throughput
- Enhancing reduce tasks
- Calculating reduce tasks' throughput
- Improving Reduce execution phase
- Tuning map and reduce parameters
- Summary
- Enhancing map tasks
- 6. Optimizing MapReduce Tasks
- Using Combiners
- Using compression
- Using appropriate Writable types
- Reusing types smartly
- Optimizing mappers and reducers code
- Summary
- 7. Best Practices and Recommendations
- Hardware tuning and OS recommendations
- The Hadoop cluster checklist
- The Bios tuning checklist
- OS configuration recommendations
- Hadoop best practices and recommendations
- Deploying Hadoop
- Hadoop tuning recommendations
- Using a MapReduce template class code
- Summary
- Hardware tuning and OS recommendations
- Index