万本电子书0元读

万本电子书0元读

顶部广告

Apache Solr 3.1 Cookbook电子书

售       价:¥

30人正在读 | 0人评论 9.8

作       者:Rafal Kuc

出  版  社:Packt Publishing

出版时间:2011-07-22

字       数:214.6万

所属分类: 进口书 > 外文原版书 > 电脑/网络

温馨提示:此类商品不支持退换货,不支持下载打印

为你推荐

  • 读书简介
  • 目录
  • 累计评论(0条)
  • 读书简介
  • 目录
  • 累计评论(0条)
This book is part of Packt's Cookbook series; each chapter looks at a different aspect of working with Apache Solr. The recipes deal with common problems of working with Solr by using easy-to-understand, real-life examples. The book is not in any way a complete Apache Solr reference and you should see it as a helping hand when things get rough on your journey with Apache Solr. Developers who are working with Apache Solr and would like to know how to combat common problems will find this book of great use. Knowledge of Apache Lucene would be a bonus but is not required.
目录展开

Apache Solr 3.1 Cookbook

Table of Contents

Apache Solr 3.1 Cookbook

Credits

About the Author

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why Subscribe?

Free Access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Apache Solr Configuration

Introduction

Running Solr on Jetty

Getting ready

How to do it...

How it works...

There's more...

I want Jetty to run on a different port

Buffer size is too small

Running Solr on Apache Tomcat

Getting ready

How to do it...

How it works...

There's more...

Changing the port on which we see Solr running on Tomcat

Using the Suggester component

How to do it...

How it works...

There's more...

Suggestions from a static dictionary

Rebuilding the suggestion word base after commit

Removing uncommon words from suggestions

See also

Handling multiple languages in a single index

How to do it...

How it works...

See also

Indexing fields in a dynamic way

How to do it...

How it works...

See also

Making multilingual data searchable with multicore deployment

How to do it...

How it works...

There's more...

More information about core admin interface

See also

Solr cache configuration

Getting ready

How to do it...

How it works...

Filter cache

Query result cache

Document cache

Query result window

There's more...

Using filter cache with faceting

When we have no cache hits

When we have more "puts" than "gets"

See also

How to fetch and index web pages

Getting ready

How to do it...

How it works...

There's more...

Multiple thread crawling

See also

Getting the most relevant results with early query termination

Getting ready

How to do it...

How it works...

See also

How to set up Extracting Request Handler

How to do it...

How it works...

See also

2. Indexing your Data

Introduction

Indexing data in CSV format

How to do it...

How it works...

There's more...

Splitting encapsulated data

Indexing data in XML format

How to do it...

How it works...

Indexing data in JSON format

How to do it...

How it works...

Indexing PDF files

Getting ready

How to do it...

How it works...

Indexing Microsoft Office files

Getting ready

How to do it...

How it works...

See also

Extracting metadata from binary files

Getting ready

How to do it...

How it works...

See also

How to properly configure Data Import Handler with JDBC

How to do it...

How it works...

There's more...

How to change the default behavior of deleting index contents at the beginning of a full import

Indexing data from a database using Data Import Handler

Getting ready

How to do it...

How it works...

How to import data using Data Import Handler and delta query

Getting ready

How to do it...

How it works...

How to use Data Import Handler with URL Data Source

Getting ready

How to do it...

How it works...

How to modify data while importing with Data Import Handler

Getting ready

How to do it...

How it works...

There's more...

Using scripts other than JavaScript

3. Analyzing your Text Data

Introduction

Storing additional information using payloads

How to do it...

How it works...

Eliminating XML and HTML tags from the text

How to do it...

How it works...

Copying the contents of one field to another

How to do it...

How it works...

There's more...

Copying the contents of dynamic fields to one field

Limiting the number of characters copied

Changing words to other words

How to do it...

How it works...

There's more...

Equivalent synonyms setup

Splitting text by camel case

How to do it...

How it works...

Splitting text by whitespace only

How to do it...

How it works...

Making plural words singular, but without stemming

How to do it...

How it works...

There's more...

Lowercasing the whole string

How to do it...

How it works...

Storing geographical points in the index

How to do it...

How it works...

There's more...

Stemming your data

How to do it...

How it works...

There's more...

Alternative English stemmer

Stemming other languages

Preparing text to do efficient trailing wildcard search

How to do it...

How it works...

There's more...

See also

Splitting text by numbers and non-white space characters

How to do it...

How it works...

There's more...

See also

4. Solr Administration

Introduction

Monitoring Solr via JMX

How to do it...

How it works...

There's more...

Connecting to an existing JMX agent

Connecting to an existing MBean server

Running a remote JXM server

How to check the cache status

How to do it...

How it works...

See also

How to check how the data type or field behave

How to do it...

How it works...

How to check Solr query handler usage

How to do it...

How it works...

How to check Solr update handler usage

How to do it...

How it works...

How to change Solr instance logging configuration

How to do it...

How it works...

How to check the Java based replication status

Getting ready

How to do it...

How it works...

How to check the script based replication status

Getting ready

How to do it...

How it works...

Setting up a Java based index replication

How to do it...

How it works...

There's more...

Slave and HTTP Basic authorization

Changing the configuration file names when replicating

See also

Setting up script based replication

Getting ready

How to do it...

How it works...

See also

How to manage Java based replication status using HTTP commands

Getting ready

How to do it...

How it works...

There's more...

Aborting index fetching

Disabling replication

Enabling replication

How to analyze your index structure

How to do it...

How it works...

5. Querying Solr

Introduction

Asking for a particular field value

How to do it...

How it works...

There's more...

Querying for a particular value using dismax query parser

Querying for multiple values in the same field

Sorting results by a field value

How to do it...

How it works...

Choosing a different query parser

How to do it...

How it works...

How to search for a phrase, not a single word

How to do it...

How it works...

There's more...

Defining the distance between words in a phrase

Boosting phrases over words

Getting ready

How to do it...

How it works...

There's more...

Boosting phrases with standard query parser

Positioning some documents over others on a query

How to do it...

How it works...

There's more...

Excluding documents with QueryElevationComponent

Positioning documents with words closer to each other first

How to do it...

How it works...

There's more...

Phrase boosting using standard query parser

Sorting results by a distance from a point

How to do it...

How it works...

See also

Getting documents with only a partial match

Getting ready

How to do it...

How it works...

Affecting scoring with function

How to do it...

How it works...

See also

Nesting queries

How to do it...

How it works...

6. Using Faceting Mechanism

Introduction

Getting the number of documents with the same field value

How to do it...

How it works...

There's more...

How to show facets with counts greater than zero

Lexicographical sorting of the faceting results

Getting the number of documents with the same date range

How to do it...

How it works...

See also

Getting the number of documents with the same value range

How to do it...

How it works...

Getting the number of documents matching the query and sub query

How to do it...

How it works...

How to remove filters from faceting results

Getting ready

How to do it...

How it works...

How to name different faceting results

How to do it...

How it works...

How to sort faceting results in an alphabetical order

Getting ready

How to do it...

How it works...

There's more...

Choosing the sort order in Solr earlier than 1.4

How to implement the autosuggest feature using faceting

Getting ready

How to do it...

How it works...

There's more...

Suggesting words, not whole phrases

How to get the number of documents that don't have a value in the field

Getting ready

How to do it...

How it works...

How to get all the faceting results, not just the first hundred ones

Getting ready

How to do it...

How it works...

How to have two different facet limits for two different fields in the same query

Getting ready

How to do it...

How it works...

7. Improving Solr Performance

Introduction

Paging your results quickly

How to do it...

How it works...

Configuring the document cache

How to do it...

How it works...

Configuring the query result cache

How to do it...

How it works...

Configuring the filter cache

How to do it...

How it works...

See also

Improving Solr performance right after the startup or commit operation

How to do it...

How it works...

There's more...

Improving Solr performance after commit operations

Setting up a sharded deployment

How to do it...

How it works...

There's more...

Dealing with queries taking too much time

Caching whole result pages

Getting ready

How to do it...

How it works...

Improving faceting performance

Getting ready

How to do it...

How it works...

What to do when Solr slows down during indexing when using Data Import Handler

How to do it...

How it works...

There's more...

Commit after a set amount of documents

Commit within a set amount of time

Getting the first top documents fast when having millions of them

Getting ready

How to do it...

How it works...

8. Creating Applications that use Solr and Developing your Own Solr Modules

Introduction

Choosing a different response format than the default one

How to do it...

How it works...

Using Solr with PHP

Getting ready

How to do it...

How it works...

See also

Using Solr with Ruby

Getting ready

How to do it...

How it works...

Using SolrJ to query Solr

Getting ready

How to do it...

How it works...

Developing your own request handler

Getting ready

How to do it...

How it works...

Developing your own filter

Getting ready

How to do it...

How it works...

See also

Developing your own search component

Getting ready

How to do it...

How it works...

Developing your own field type

Getting ready

How to do it...

How it works...

9. Using Additional Solr Functionalities

Introduction

Getting more documents similar to those returned in the results list

How to do it...

How it works...

Presenting search results in a fast and easy way

How to do it...

How it works...

There's more...

Running Solritas on Solr 1.4.1 or 1.4

See also

Highlighting matched words

How to do it...

How it works...

There's more...

Specifying the fields for highlighting

Changing the default HTML tags that surround the matched word

How to highlight long text fields and get good performance

How to do it...

How it works...

Sorting results by a function value

How to do it..

How it works...

Searching words by how they sound

How to do it...

How it works...

See also

Ignoring defined words

How to do it...

How it works...

Computing statistics for the search results

How to do it...

How it works...

Checking user's spelling mistakes

How to do it...

How it works...

See also

Using "group by" like functionalities in Solr

Getting ready

How to do it...

How it works...

There's more...

Fetching more than one document in a group

10. Dealing with Problems

Introduction

How to deal with a corrupted index

How to do it...

How it works...

There's more...

Checking the index without the repair procedure

How to reduce the number of files the index is made of

How to do it...

How it works...

How to deal with a locked index

How to do it...

How it works...

How to deal with too many opened files

Getting ready

How to do it...

How it works...

See also

How to deal with out of memory problems

How to do it...

How it works...

There's more...

Seeing heap when out of memory error occurs

How to sort non-English languages properly

How to do it...

How it works...

See also

How to deal with the infinite loop exception when using shards

How to do it...

How it works...

How to deal with garbage collection running too long

How to do it...

How it works...

There's more...

Monitoring the garbage collector

See also

How to update only one field in all documents without the need of full indexation

How to do it...

How it works...

How to make your index smaller

How to do it...

How it works...

See also

Index

累计评论(0条) 2个书友正在讨论这本书 发表评论

发表评论

发表评论,分享你的想法吧!

买过这本书的人还买过

读了这本书的人还在读

回顶部