万本电子书0元读

万本电子书0元读

顶部广告

Apache Solr Search Patterns电子书

售       价:¥

2人正在读 | 0人评论 9.8

作       者:Jayant Kumar

出  版  社:Packt Publishing

出版时间:2015-04-24

字       数:105.8万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:数字商品不支持退换货,不提供源文件,不支持导出打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
This book is for developers who already know how to use Solr and are looking at procuring advanced strategies for improving their search using Solr. This book is also for people who work with analytics to generate graphs and reports using Solr. Moreover, if you are a search architect who is looking forward to scale your search using Solr, this is a must have book for you. It would be helpful if you are familiar with the Java programming language.
目录展开

Apache Solr Search Patterns

Table of Contents

Apache Solr Search Patterns

Credits

About the Author

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Solr Indexing Internals

The job site problem statement – Solr indexing fundamentals

Working of analyzers, tokenizers, and filters

Handling a multilingual search

Measuring the quality of search results

The e-commerce problem statement

The job site problem statement

Challenges of large-scale indexing

Using multiple threads for indexing on Solr

Using the Java binary format of data for indexing

Using the ConcurrentUpdateSolrServer class for indexing

Solr configuration changes that can improve indexing performance

Planning your commit strategy

Using better hardware

Distributed indexing

The SolrCloud solution

Summary

2. Customizing the Solr Scoring Algorithm

Relevance calculation

Building a custom scorer

Drawbacks of the TF-IDF model

The information gain model

Implementing the information gain model

Options to TF-IDF similarity

BM25 similarity

DFR similarity

Summary

3. Solr Internals and Custom Queries

Working of a scorer on an inverted index

Working of OR and AND clauses

The eDisMax query parser

Working of the eDisMax query parser

The minimum should match parameter

Working of filters

Using BRS queries instead of DisMax

Building a custom query parser

Proximity search using SWAN queries

Creating a parboiled parser

Building a Solr plugin for SWAN queries

Integrating the SWAN plugin in Solr

Summary

4. Solr for Big Data

Introduction to big data

Getting data points using facets

Field faceting

Query and range faceting

Radius faceting for location-based data

The geofilt filter

The bounding box filter

The rectangle filter

Distance function queries

Radius faceting

Data analysis using pivot faceting

Graphs for analytics

Getting started with Highcharts

Displaying Solr data using Highcharts

Summary

5. Solr in E-commerce

Designing an e-commerce search

Handling unclean data

Handling variations in the product

Sorting

Problems and solutions of flash sale searches

Faceting with the option of multi-select

Faceting with hierarchical taxonomy

Faceting with size

Implementing semantic search

Optimizations

Summary

6. Solr for Spatial Search

Features of spatial search

Java Topology Suite

Well-known Text

The Spatial4j library

Lucene 4 spatial module

SpatialRecursivePrefixTreeFieldType

BBoxField (to be introduced in Solr 4.10)

Indexing for spatial search

Searching and filtering on a spatial index

The bbox query

Distance sort and relevancy boost

Advanced concepts

Quadtree

Indexing data

Searching data

Geohash

Summary

7. Using Solr in an Advertising System

Ad system functionalities

Architecture of an ad distribution system

Requirements of an ad distribution system

Schema for a listing ad

Schema for targeted ads

Performance improvements

fieldCache

fieldValueCache

documentCache

filterCache

queryResultCache

Application cache

Garbage collection

Merging Solr with Redis

Summary

8. AJAX Solr

The purpose of AJAX Solr

The AJAX Solr architecture

The Manager controller

The ParameterStore model

Available parameters

Exposed parameters

Using the ParameterHashStore class

Extending the ParameterStore class

Widgets

Working with AJAX Solr

Talking to AJAX Solr

Displaying the result

Adding facets

Adding pagination

Adding a tag cloud

Performance tuning

Summary

9. SolrCloud

The SolrCloud architecture

Centralized configuration

Setting up SolrCloud

Test setup for SolrCloud

Setting up SolrCloud in production

Setting up the Zookeeper ensemble

Setting up Tomcat with Solr

Distributed indexing and search

Routing documents to a particular shard

Adding more nodes to the SolrCloud

Fault tolerance and high availability in SolrCloud

Advanced sharding with SolrCloud

Shard splitting

Deleting a shard

Moving the existing shard to a new node

Shard splitting based on split key

Asynchronous calls

Migrating documents to another collection

Sizing and monitoring of SolrCloud

Using SolrCloud as a NoSQL database

Summary

10. Text Tagging with Lucene FST

An overview of FST and text tagging

Implementation of FST in Lucene

Text tagging algorithms

Fuzzy string matching algorithm

The Levenshtein distance algorithm

Damerau–Levenshtein distance

Using Solr for text tagging

Implementing a text tagger using Solr

Summary

Index

累计评论(0条) 0个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部