Mastering Apache Pulsar - Helion
ISBN: 9781492084853
stron: 242, Format: ebook
Data wydania: 2021-12-06
Księgarnia: Helion
Cena książki: 29,90 zł (poprzednio: 249,17 zł)
Oszczędzasz: 88% (-219,27 zł)
Every enterprise application creates data, including log messages, metrics, user activity, and outgoing messages. Learning how to move these items is almost as important as the data itself. If you're an application architect, developer, or production engineer new to Apache Pulsar, this practical guide shows you how to use this open source event streaming platform to handle real-time data feeds.
Jowanza Joseph, staff software engineer at Finicity, explains how to deploy production Pulsar clusters, write reliable event streaming applications, and build scalable real-time data pipelines with this platform. Through detailed examples, you'll learn Pulsar's design principles, reliability guarantees, key APIs, and architecture details, including the replication protocol, the load manager, and the storage layer.
This book helps you:
Osoby które kupowały "Mastering Apache Pulsar", wybierały także:
- Cisco CCNA 200-301. Kurs video. Administrowanie bezpieczeństwem sieci. Część 3 665,00 zł, (39,90 zł -94%)
- Cisco CCNA 200-301. Kurs video. Administrowanie urządzeniami Cisco. Część 2 665,00 zł, (39,90 zł -94%)
- Cisco CCNA 200-301. Kurs video. Podstawy sieci komputerowych i konfiguracji. Część 1 665,00 zł, (39,90 zł -94%)
- Impact of P2P and Free Distribution on Book Sales 427,14 zł, (29,90 zł -93%)
- Cisco CCNP Enterprise 350-401 ENCOR. Kurs video. Programowanie i automatyzacja sieci 443,33 zł, (39,90 zł -91%)
Spis treści
Mastering Apache Pulsar eBook -- spis treści
- Preface
- Why I Wrote This Book
- Who This Book Is For
- How I Organized This Book
- Conventions Used in This Book
- Using Code Examples
- OReilly Online Learning
- How to Contact Us
- Acknowledgments
- 1. The Value of Real-Time Messaging
- Data in Motion
- Resource Efficiency
- Interesting Applications
- Banking
- Medical
- Security
- Internet of Things
- Summary
- 2. Event Streams and Event Brokers
- Publish/Subscribe
- Queues
- Failure Modes
- Push Versus Poll
- The Need for Pulsar
- Unification
- Modularity
- Performance
- Summary
- 3. Pulsar
- Origins of Pulsar
- Pulsar Design Principles
- Multitenancy
- Geo-Replication
- Performance
- Modularity
- Pulsar Ecosystem
- Pulsar Functions
- Pulsar IO
- Pulsar SQL
- Pulsar Success Stories
- Yahoo! JAPAN
- Splunk
- Iterable
- Summary
- 4. Pulsar Internals
- Brokers
- Message Cache
- BookKeeper and ZooKeeper Communication
- Schema Validation
- Inter-Broker Communication
- Pulsar Functions and Pulsar IO
- Apache BookKeeper
- Write-Ahead Logging
- Message Storing
- Object/Blob Storage
- Pravega
- Majordodo
- Apache ZooKeeper
- Naming Service
- Configuration Management
- Leader Election
- Notification System
- Apache Kafka
- Apache Druid
- Pulsar Proxy
- Java Virtual Machine (JVM)
- Netty
- Apache Spark
- Apache Lucene
- Summary
- Brokers
- 5. Consumers
- What Does It Mean to Be a Consumer?
- Subscriptions
- Exclusive
- Shared
- Key_Shared
- Failover
- Acknowledgments
- Individual Ack
- Cumulative Ack
- Schemas
- Consumer Schema Management
- Consumption Modes
- Batching
- Chunking
- Advanced Configuration
- Delayed Messages
- Retention Policy
- Backlog Quota
- Configuring a Consumer
- Replay
- Dead Letter Topics
- Retry Letter Topics
- Summary
- 6. Producers
- Synchronous Producers
- Asynchronous Producers
- Producer Routing
- Round-Robin Routing
- Single Partition Routing
- Custom Partition RoutingÂ
- Producer Configuration
- topicName
- producerName
- sendTimeoutMs
- blockIfQueueFull
- maxPendingMessages
- maxPendingMessagesAcrossPartitions
- messageRoutingMode
- hashingScheme
- cryptoFailureAction
- batchingMaxPublishDelayMicros
- batchingMaxMessages
- batchingEnabled
- compressionType
- Schema on Write
- Using the Schema Registry
- Nonpersistent Topics
- Use Cases
- Using Nonpersistent Topics
- Transactions
- Summary
- 7. Pulsar IO
- Pulsar IO Architecture
- Runtime
- Performance Considerations
- Use Cases
- Simple Event Processing Pipelines
- Change Data Capture
- Considerations
- Message Serialization
- Pipeline Stability
- Failure Handling
- Examples
- Elasticsearch
- Netty
- Writing Your Connector
- TimescaleDB
- Summary
- Pulsar IO Architecture
- 8. Pulsar Functions
- Stream Processing
- Pulsar Functions Architecture
- Runtime
- Isolation
- Isolation with Kubernetes Function Deployments
- Use Cases
- Creating Pulsar Functions
- Simple Event Processing
- Topic Hygiene
- Sensitivity scrubbing
- Language translation
- Schema normalization
- Topic Accounting
- Incrementing counts
- Getting and setting state
- Summary
- 9. Tiered Storage
- Storing Data in the Cloud
- Object Storage
- Use Cases
- Replication
- CQRS
- Disaster Recovery
- Offloading Data
- Pulsar Offloaders
- Amazon S3
- Google Cloud Storage
- Pulsar Offloaders
- Retrieving Offloaded Data
- Interacting with Object Store Data
- Repopulating Topics
- Utilizing Pulsar Client
- Summary
- Storing Data in the Cloud
- 10. Pulsar SQL
- Streams as Tables
- SQL-on-Anything Engines
- Apache Flink: An Alternative Perspective
- Presto/Trino
- How Pulsar SQL Works
- Configuring Pulsar SQL
- Performance Considerations
- Summary
- 11. Deploying Pulsar
- Docker
- Bare Metal
- Minimum Requirements
- Getting Started
- Deploying ZooKeeper
- Starting BookKeeper
- Starting Pulsar
- Public Cloud Providers
- AWS
- Azure
- Google Cloud Platform
- Kubernetes
- Summary
- 12. Operating Pulsar
- Apache BookKeeper Metrics
- Server Metrics
- Journal Metrics
- Storage Metrics
- Apache ZooKeeper Metrics
- Server Metrics
- Request Metrics
- Topic Metrics
- Consumer Metrics
- Pulsar Transaction Metrics
- Pulsar Function Metrics
- Advanced Operating Techniques
- Interceptors and Tracing
- Pulsar SQL Metrics
- Metrics Forwarding
- Dashboards
- Summary
- Apache BookKeeper Metrics
- 13. The Future
- Programming Language Support
- Extension Interface
- Enhancements to Pulsar Functions
- Watermarks
- Windowing
- Complex event processing
- WebAssembly (WASM)
- Architectural Simplification/Expansion
- Metadata management
- Log expansion
- Messaging Platform Bridges
- NATS
- AMQP 1.0
- Summary
- Programming Language Support
- A. Pulsar Admin API
- Use Cases
- Examples
- Creating a Partitioned Topic
- Deleting a Partitioned Topic
- Creating a Namespace with Specific Policies
- Deleting a Namespace
- Summary
- B. Pulsar Admin CLI
- CLI API
- Examples
- Creating a Partitioned Topic
- Creating a Pulsar IO Source
- Creating a Pulsar IO Sink
- Uploading a Schema
- Deleting a Schema
- Creating a Namespace
- Deleting a Namespace
- Summary
- C. Geo-Replication
- Synchronous Replication
- Asynchronous Replication
- Replication Patterns
- Mesh
- Aggregation
- Standby
- Admin- and Producer-Level Control
- Summary
- D. Security, Authentication, and Authorization in Pulsar
- Encryption in Transit
- Encryption at Rest
- Authentication
- Authorization
- Summary
- Index