reklama - zainteresowany?

Principles of Data Wrangling. Practical Techniques for Data Preparation - Helion

Principles of Data Wrangling. Practical Techniques for Data Preparation
ebook
Autor: Tye Rattenbury, Joseph M. Hellerstein, Jeffrey Heer
ISBN: 978-14-919-3887-4
stron: 94, Format: ebook
Data wydania: 2017-06-29
Księgarnia: Helion

Cena książki: 109,65 zł (poprzednio: 127,50 zł)
Oszczędzasz: 14% (-17,85 zł)

Dodaj do koszyka Principles of Data Wrangling. Practical Techniques for Data Preparation

Tagi: Bazy danych

A key task that any aspiring data-driven organization needs to learn is data wrangling, the process of converting raw data into something truly useful. This practical guide provides business analysts with an overview of various data wrangling techniques and tools, and puts the practice of data wrangling into context by asking, "What are you trying to do and why?"

Wrangling data consumes roughly 50-80% of an analyst’s time before any kind of analysis is possible. Written by key executives at Trifacta, this book walks you through the wrangling process by exploring several factors—time, granularity, scope, and structure—that you need to consider as you begin to work with data. You’ll learn a shared language and a comprehensive understanding of data wrangling, with an emphasis on recent agile analytic processes used by many of today’s data-driven organizations.

Appreciate the importance—and the satisfaction—of wrangling data the right way.

  • Understand what kind of data is available
  • Choose which data to use and at what level of detail
  • Meaningfully combine multiple sources of data
  • Decide how to distill the results to a size and shape that can drive downstream analysis

Dodaj do koszyka Principles of Data Wrangling. Practical Techniques for Data Preparation

 

Osoby które kupowały "Principles of Data Wrangling. Practical Techniques for Data Preparation", wybierały także:

  • Oracle Database 12c. Programowanie w jÄ™zyku PL/SQL
  • Bazy danych. Podstawy projektowania i jÄ™zyka SQL
  • Head First PHP & MySQL. Edycja polska
  • MySQL. Mechanizmy wewnÄ™trzne bazy danych
  • Metody i techniki odkrywania wiedzy. NarzÄ™dzia CAQDAS w procesie analizy danych jakoÅ›ciowych

Dodaj do koszyka Principles of Data Wrangling. Practical Techniques for Data Preparation

Spis treści

Principles of Data Wrangling. Practical Techniques for Data Preparation eBook -- spis treści

  • Foreword
  • 1. Introduction
    • Magic Thresholds, PYMK, and User Growth at Facebook
  • 2. A Data Workflow Framework
    • How Data Flows During and Across Projects
    • Connecting Analytic Actions to Data Movement: A Holistic Workflow Framework for Data Projects
    • Raw Data Stage Actions: Ingest Data and Create Metadata
      • Ingesting Known and Unknown Data
      • Creating Metadata
        • Structure
        • Granularity
        • Accuracy
        • Temporality
        • Scope
    • Refined Data Stage Actions: Create Canonical Data and Conduct Ad Hoc Analyses
      • Designing Refined Data
        • Addressing structural issues
        • Addressing granularity issues
        • Addressing accuracy issues
        • Addressing scope issues
      • Refined Stage Analytical Actions
    • Production Data Stage Actions: Create Production Data and Build Automated Systems
      • Creating Optimized Data
      • Designing Regular Reports and Automated Products/Services
    • Data Wrangling within the Workflow Framework
  • 3. The Dynamics of Data Wrangling
    • Data Wrangling Dynamics
      • Additional Aspects: Subsetting and Sampling
    • Core Transformation and Profiling Actions
    • Data Wrangling in the Workflow Framework
      • Ingesting Data
      • Describing Data
      • Assessing Data Utility
      • Designing and Building Refined Data
      • Ad Hoc Reporting
      • Exploratory Modeling and Forecasting
      • Building an Optimized Dataset
      • Regular Reporting and Building Data-Driven Products and Services
  • 4. Profiling
    • Overview of Profiling
    • Individual Value Profiling: Syntactic Profiling
    • Individual Value Profiling: Semantic Profiling
    • Set-Based Profiling
    • Profiling Individual Values in the Candidate Master File
      • Syntactic Profiling in the Candidate Master File
      • Set-Based Profiling in the Candidate Master File
  • 5. Transformation: Structuring
    • Overview of Structuring
    • Intrarecord Structuring: Extracting Values
      • Positional Extraction
      • Pattern Extraction
      • Complex Structure Extraction
    • Intrarecord Structuring: Combining Multiple Record Fields
    • Interrecord Structuring: Filtering Records and Fields
    • Interrecord Structuring: Aggregations and Pivots
      • Simple Aggregations
      • Column-to-Row Pivots
      • Row-to-Column Pivots
  • 6. Transformation: Enriching
    • Unions
    • Joins
    • Inserting Metadata
    • Derivation of Values
      • Generic
      • Proprietary
  • 7. Using Transformation to Clean Data
    • Addressing Missing/NULL Values
    • Addressing Invalid Values
  • 8. Roles and Responsibilities
    • Skills and Responsibilities
      • Data Engineer
      • Data Architect
      • Data Scientist
      • Analyst
    • Roles Across the Data Workflow Framework
    • Organizational Best Practices
  • 9. Data Wrangling Tools
    • Data Size and Infrastructure
    • Data Structures
      • Excel
      • SQL
      • Trifacta Wrangler
    • Transformation Paradigms
      • Excel
      • SQL
      • Trifacta Wrangler
    • Choosing a Data Wrangling Tool

Dodaj do koszyka Principles of Data Wrangling. Practical Techniques for Data Preparation

Code, Publish & WebDesing by CATALIST.com.pl



(c) 2005-2024 CATALIST agencja interaktywna, znaki firmowe należą do wydawnictwa Helion S.A.