Index Tuning


You can exercise a degree of control over the collection index contents to support the desired search behavior. The following style files affect the index contents.

Style File
Description
style.lex
The style.lex file is used to affect how word, paragraph, and sentence tokens are interpreted during indexing. For instance, if you need to search for words with non-alphanumeric characters, like OS/2 and AT&T, then you need to define those alpha-numeric characters as valid word elements.
style.stp
The style.stp file is used to determine which words are stopped out of the collection's word index.
style.go
The style.go file is used to determine which words are included into the collection's word index.
style.prm
The style.prm file is used to build the word index and specialized indexes which support clustering and the use of the SOUNDEX operator. By default, a Soundex index is not built for a collection. Without a Soundex index, the SOUNDEX operator does not work.
style.wld
The style.wld file determines the indexing policy for word assist generation.
style.ufl
The style.ufl file defines the custom fields to be included in the internal documents table. Additionally, optional field attribute indexes can be generated for certain field types. The field attribute indexes enable the Verity engine to do a quick lookup of field values without opening every document record. Field attribute indexes have two types: Indexed and Minmax. Use of these index types results in faster field search performance.
style.fxs
The style.fxs file specifies words to be excluded during feature extraction so that the words do not appear in document summaries and clusters.




Copyright © 2002, Verity, Inc. All rights reserved.