reklama - zainteresowany?

Getting Started with Kudu. Perform Fast Analytics on Fast Data - Helion

Getting Started with Kudu. Perform Fast Analytics on Fast Data
ebook
Autor: Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland
ISBN: 978-14-919-8020-0
stron: 156, Format: ebook
Data wydania: 2018-07-09
Księgarnia: Helion

Cena książki: 152,15 zł (poprzednio: 176,92 zł)
Oszczędzasz: 14% (-24,77 zł)

Dodaj do koszyka Getting Started with Kudu. Perform Fast Analytics on Fast Data

Tagi: Analiza danych

Fast data ingestion, serving, and analytics in the Hadoop ecosystem have forced developers and architects to choose solutions using the least common denominator—either fast analytics at the cost of slow data ingestion or fast data ingestion at the cost of slow analytics. There is an answer to this problem. With the Apache Kudu column-oriented data store, you can easily perform fast analytics on fast data. This practical guide shows you how.

Begun as an internal project at Cloudera, Kudu is an open source solution compatible with many data processing frameworks in the Hadoop environment. In this book, current and former solutions professionals from Cloudera provide use cases, examples, best practices, and sample code to help you get up to speed with Kudu.

  • Explore Kudu’s high-level design, including how it spreads data across servers
  • Fully administer a Kudu cluster, enable security, and add or remove nodes
  • Learn Kudu’s client-side APIs, including how to integrate Apache Impala, Spark, and other frameworks for data manipulation
  • Examine Kudu’s schema design, including basic concepts and primitives necessary to make your project successful
  • Explore case studies for using Kudu for real-time IoT analytics, predictive modeling, and in combination with another storage engine

Dodaj do koszyka Getting Started with Kudu. Perform Fast Analytics on Fast Data

 

Osoby które kupowały "Getting Started with Kudu. Perform Fast Analytics on Fast Data", wybierały także:

  • NLP. Kurs video. Analiza danych tekstowych w j
  • Web scraping. Kurs video. Zautomatyzowane pozyskiwanie danych z sieci
  • Data Science w Pythonie. Kurs video. Algorytmy uczenia maszynowego
  • Microsoft Excel. Kurs video. Wykresy i wizualizacja danych
  • Data Science w Pythonie. Kurs video. Przetwarzanie i analiza danych

Dodaj do koszyka Getting Started with Kudu. Perform Fast Analytics on Fast Data

Spis treści

Getting Started with Kudu. Perform Fast Analytics on Fast Data eBook -- spis treści

  • Preface
    • Conventions Used in This Book
    • Using Code Examples
    • OReilly Safari
    • How to Contact Us
    • Acknowledgments
  • 1. Why Kudu?
    • Why Does Kudu Matter?
    • Simplicity Drives Adoption
    • New Use Cases
      • IoT
      • Current Approaches to Real-Time Analytics
        • Iteration 1: Hadoop Distributed File System
        • Iteration 2: HDFS + Compactions
        • Iteration 3: HBase + HDFS
      • Real-Time Processing
    • Hardware Landscape
    • Kudus Unique Place in the Big Data Ecosystem
      • Comparing Kudu with Other Ecosystem Components
      • Big DataHDFS, HBase, Cassandra
    • Conclusion
  • 2. About Kudu
    • Kudu High-Level Design
      • Kudu Roles
      • Master Server
      • Tablet Server
        • Storage
          • Columnar format
          • File layout and compactions
    • Kudu Concepts and Mechanisms
      • Hotspotting
      • Partitioning
        • Range partitioning
        • Hash partitioning
  • 3. Getting Up and Running
    • Installation
      • Apache Kudu Quickstart VM
      • Using Cloudera Manager
      • Building from Source
      • Packages
      • Cloudera Quickstart VM
    • Quick Install: Three Minutes or Less
    • Conclusion
  • 4. Kudu Administration
    • Planning for Kudu
      • Master and Tablet Servers
      • Write-Ahead Log
      • Data Servers and Storage
      • Replication Strategies
    • Deployment Considerations: New or Existing Clusters?
      • New Kudu-Only Cluster
      • New Hadoop Cluster with Kudu
      • Add Kudu to Existing Hadoop Cluster
    • Web UI of Tablet and Master Servers
      • Master Server UI and Tablet Server UI
      • Master Server UI
      • Tablet Server UI
    • The Kudu Command-Line Interface
      • Cluster
      • Filesystem
        • check
        • format
        • dump
      • Tablet Replica
        • Copy a remote replica to a local server
        • Deleting a replica
      • Consensus Metadata
    • Adding and Removing Tablet Servers
      • Adding Tablet Servers
      • Removing a Tablet Server
    • Security
      • A Simple Analogy
      • Kudu Security Features
        • Encryption over the wire
        • Data-at-rest encryption
        • Kerberos authentication
        • User authorization
        • Log redaction
        • Web UI security
    • Basic Performance Tuning
      • Kudu Memory Limits
      • Maintenance Manager Threads
      • Monitoring Performance
    • Getting Ahead and Staying Out of Trouble
      • Avoid Running Out of Disk Space
      • Disk Failures Tolerance
      • Backup
    • Conclusion
  • 5. Common Developer Tasks for Kudu
    • Client API
      • Kudu Client
      • Kudu Table
      • Kudu DDL
      • Kudu Scanner Read Modes
    • C++ API
    • Python API
      • Preparing the Python Development Environment
      • Python Kudu Application
    • Java
      • Java Application
    • Spark
    • Impala with Kudu
  • 6. Table and Schema Design
    • Schema Design Basics
    • Schema for Hybrid Transactional/Analytical Processing
      • Lambda Architecture
      • OLTP/OLAP Split
    • Primary Key and Column Design
      • Other Column Schema Considerations
    • Partitioning Basics
      • Range Partitioning
      • Hash Partitioning
    • Schema Alteration
    • Best Practices and Tips
      • Partitioning
      • Large Objects
      • decimal
      • Unique Strings
      • Compression
      • Object Names
      • Number of Columns
      • Binary Types
    • Network Packet Example
    • Conclusion
  • 7. Kudu Use Cases
    • Real-Time Internet of Things Analytics
    • Predictive Modeling
    • Mixed Platforms Solution
  • Index

Dodaj do koszyka Getting Started with Kudu. Perform Fast Analytics on Fast Data

Code, Publish & WebDesing by CATALIST.com.pl



(c) 2005-2024 CATALIST agencja interaktywna, znaki firmowe należą do wydawnictwa Helion S.A.