Ferret - Helion

ebook

Autor: David Balmain
ISBN: 978-05-965-5471-2
stron: 112, Format: ebook
Data wydania: 2008-03-21
Księgarnia: Helion

Cena książki: 79,90 zł

Osoby, które kupiły tę książkę, wybierały także »

With the introduction of Ferret, Ruby users now have one of the fastest and most flexible search libraries available. And it's surprisingly easy to use.

This book will show you how to quickly get up and running with Ferret. You'll learn how to index different document types such as PDF, Microsoft Word, and HTML, as well as how to deal with foreign languages and different character encodings. Ferret describes the Ferret Query Language in detail along with the object-oriented approach to building queries.

You will also be introduced to sorting, filtering, and highlighting your search results, with an explanation of exactly how you need to set up your index to perform these tasks. You will also learn how to optimize a Ferret index for lightning fast indexing and split-second query results.

Osoby które kupowały "Ferret", wybierały także:

Jak zhakowa 125,00 zł, (10,00 zł -92%)
Windows Media Center. Domowe centrum rozrywki 66,67 zł, (8,00 zł -88%)
PLC i TIA Portal. Kurs video. Techniki programowania SIMATIC S7-1200 230,00 zł, (29,90 zł -87%)
PLC i TIA Portal. Kurs video. Logika sterowania w automatyce przemys 230,00 zł, (29,90 zł -87%)
Matematyka a programowanie. Kurs video. Od pojęcia liczby po płaszczyznę zespoloną w Pythonie 142,14 zł, (19,90 zł -86%)

Spis treści

Ferret eBook -- spis treści

Ferret
- SPECIAL OFFER: Upgrade this ebook with OReilly
- Preface
  - Conventions Used in This Book
  - Using Code Examples
  - Safari Enabled
  - How to Contact Us
- 1. Getting Started
  - Installing Ferret
  - A Quick Example: Indexing the Filesystem
  - Summary
- 2. Indexing
  - Index Storage
  - Documents, Fields, and Boosts
    - Documents
    - Fields
    - Boosts
  - Setting Up the Index
    - FieldInfo
      - :store
      - :index
      - :term_vector
    - FieldInfos
  - Basic Indexing Operations
    - Add
    - Get
    - Delete
    - Update
  - Indexing Non-String Datatypes
    - Number Fields
    - Date Fields
    - Sort Fields
  - Summary
- 3. Advanced Indexing
  - How the Indexing Process Works
  - Tuning Indexing Performance
    - In-Memory Indexing
    - Indexing Parameters
      - :max_buffer_memory and :chunk_size
      - :merge_factor
      - :max_buffered_docs
      - :max_merged_docs
      - :max_field_length
      - :use_compound_file
      - :index_skip_interval
      - :doc_skip_interval
      - Indexing parameter testing
    - Parallel Indexing
  - Optimizing the Index
  - Index Locking and Concurrency Issues
    - Multithreaded Environment
    - Multiprocess Environment
  - Summary
- 4. Search
  - Overview of Searching Classes
    - IndexSearcher
    - Query
    - QueryParser
    - Filter
    - Sort
  - Building Queries
    - TermQuery
    - BooleanQuery
    - PhraseQuery
    - RangeQuery
    - MultiTermQuery
    - PrefixQuery
    - WildcardQuery
    - FuzzyQuery
    - MatchAllQuery
    - ConstantScoreQuery
    - FilteredQuery
    - Span Queries
      - SpanTermQuery
      - SpanFirstQuery
      - SpanOrQuery
      - SpanNotQuery
      - SpanNearQuery
    - Boosting Queries
  - QueryParser
    - Setting Up the QueryParser
    - Ferret Query Language
      - TermQuery
      - BooleanQuery
      - PhraseQuery
      - RangeQuery
      - WildcardQuery
      - FuzzyQuery
      - Boosting a query in FQL
  - Filtering Search Results
    - Using the RangeFilter
    - Using the QueryFilter
    - Writing Your Own Filter
    - :filter_proc, the New Filter
  - Sorting Search Results
    - SortField
    - Sort
    - Sorting by Date
  - Highlighting Query Results
  - Summary
- 5. Analysis
  - Token
  - TokenStream
    - Tokenizer
      - WhiteSpaceTokenizer
      - LetterTokenizer
      - StandardTokenizer
      - RegExpTokenizer
    - TokenFilter
      - LowerCaseFilter
      - StopFilter
      - StemFilter
      - HyphenFilter
  - Analyzer
    - StandardAnalyzer
    - PerFieldAnalyzer
  - Custom Analysis
- 6. Ferret in Practice
  - Indexing Multiple Document Types
    - TextReader
    - HtmlReader
    - OOoReader (OpenOffice.org Reader)
    - JpegReader
    - Mp3Reader
    - PdfReader
  - Other Indexing Improvements
  - Search Improvements
  - Putting It All Together
  - Summary
- Index
- About the Author
- Colophon
- SPECIAL OFFER: Upgrade this ebook with OReilly