Ferret - Helion
ISBN: 978-05-965-5471-2
stron: 112, Format: ebook
Data wydania: 2008-03-21
Księgarnia: Helion
Cena książki: 89,90 zł
With the introduction of Ferret, Ruby users now have one of the fastest and most flexible search libraries available. And it's surprisingly easy to use.
This book will show you how to quickly get up and running with Ferret. You'll learn how to index different document types such as PDF, Microsoft Word, and HTML, as well as how to deal with foreign languages and different character encodings. Ferret describes the Ferret Query Language in detail along with the object-oriented approach to building queries.
You will also be introduced to sorting, filtering, and highlighting your search results, with an explanation of exactly how you need to set up your index to perform these tasks. You will also learn how to optimize a Ferret index for lightning fast indexing and split-second query results.
Osoby które kupowały "Ferret", wybierały także:
- Windows Media Center. Domowe centrum rozrywki 66,67 zł, (8,00 zł -88%)
- Ruby on Rails. Ćwiczenia 18,75 zł, (3,00 zł -84%)
- Przywództwo w świecie VUCA. Jak być skutecznym liderem w niepewnym środowisku 58,64 zł, (12,90 zł -78%)
- Scrum. O zwinnym zarządzaniu projektami. Wydanie II rozszerzone 58,64 zł, (12,90 zł -78%)
- Od hierarchii do turkusu, czyli jak zarządzać w XXI wieku 58,64 zł, (12,90 zł -78%)
Spis treści
Ferret eBook -- spis treści
- Ferret
- SPECIAL OFFER: Upgrade this ebook with OReilly
- Preface
- Conventions Used in This Book
- Using Code Examples
- Safari Enabled
- How to Contact Us
- 1. Getting Started
- Installing Ferret
- A Quick Example: Indexing the Filesystem
- Summary
- 2. Indexing
- Index Storage
- Documents, Fields, and Boosts
- Documents
- Fields
- Boosts
- Setting Up the Index
- FieldInfo
- :store
- :index
- :term_vector
- FieldInfos
- FieldInfo
- Basic Indexing Operations
- Add
- Get
- Delete
- Update
- Indexing Non-String Datatypes
- Number Fields
- Date Fields
- Sort Fields
- Summary
- 3. Advanced Indexing
- How the Indexing Process Works
- Tuning Indexing Performance
- In-Memory Indexing
- Indexing Parameters
- :max_buffer_memory and :chunk_size
- :merge_factor
- :max_buffered_docs
- :max_merged_docs
- :max_field_length
- :use_compound_file
- :index_skip_interval
- :doc_skip_interval
- Indexing parameter testing
- Parallel Indexing
- Optimizing the Index
- Index Locking and Concurrency Issues
- Multithreaded Environment
- Multiprocess Environment
- Summary
- 4. Search
- Overview of Searching Classes
- IndexSearcher
- Query
- QueryParser
- Filter
- Sort
- Building Queries
- TermQuery
- BooleanQuery
- PhraseQuery
- RangeQuery
- MultiTermQuery
- PrefixQuery
- WildcardQuery
- FuzzyQuery
- MatchAllQuery
- ConstantScoreQuery
- FilteredQuery
- Span Queries
- SpanTermQuery
- SpanFirstQuery
- SpanOrQuery
- SpanNotQuery
- SpanNearQuery
- Boosting Queries
- QueryParser
- Setting Up the QueryParser
- Ferret Query Language
- TermQuery
- BooleanQuery
- PhraseQuery
- RangeQuery
- WildcardQuery
- FuzzyQuery
- Boosting a query in FQL
- Filtering Search Results
- Using the RangeFilter
- Using the QueryFilter
- Writing Your Own Filter
- :filter_proc, the New Filter
- Sorting Search Results
- SortField
- Sort
- Sorting by Date
- Highlighting Query Results
- Summary
- Overview of Searching Classes
- 5. Analysis
- Token
- TokenStream
- Tokenizer
- WhiteSpaceTokenizer
- LetterTokenizer
- StandardTokenizer
- RegExpTokenizer
- TokenFilter
- LowerCaseFilter
- StopFilter
- StemFilter
- HyphenFilter
- Tokenizer
- Analyzer
- StandardAnalyzer
- PerFieldAnalyzer
- Custom Analysis
- 6. Ferret in Practice
- Indexing Multiple Document Types
- TextReader
- HtmlReader
- OOoReader (OpenOffice.org Reader)
- JpegReader
- Mp3Reader
- PdfReader
- Other Indexing Improvements
- Search Improvements
- Putting It All Together
- Summary
- Indexing Multiple Document Types
- Index
- About the Author
- Colophon
- SPECIAL OFFER: Upgrade this ebook with OReilly