Server parameters: Indexing settings

The indexing parameters for an IBM® Content Search Services server concern index servers only. These parameters are viewed and set by using the command-line configuration tool.

The phrase text document refers to the text that is indexed for a Content Engine object.

IMPORTANT Do not alter server parameter values unless instructed to do so by a support representative from IBM Customer Support.

Document batch size
The default number of text documents that are retrieved from the input queue in each call.

To set the parameter value:

      configTool configureParams –configPath path –documentBatchSize document-batch-size

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.

Document retry number
The number of times that text preprocessing is reattempted for an text document if the initial attempt times out. An attempt times out if the duration of the attempt exceeds the value specified by the Document timeout parameter. The Docment timeout parameter applies to each attempt. If all attempts fail, an error is generated for the text document.

To set the parameter value:

      configTool configureParams –configPath path –documentRetryNumber document-retry-number

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.

Document timeout
The maximum number of seconds that is allowed for text preprocessing for an text document. If this value is exceeded, the Document retry number parameter determines the number of times that preprocessing is reattempted.

To set the parameter value:

      configTool configureParams –configPath path –documentTimeout document-timeout

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.
Input queue memory size
The memory size of the input queue on the index server. The input queue contains text documents that are waiting for preprocessing. A larger memory size permits faster server operation but consumes more system resources. The default memory size is 15 MB.

To set the parameter value:

      configTool configureParams –configPath path –inputQueueMemorySize input-queue-memory-size

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.

Maximum document size
The maximum number of characters that are indexed for an text document.

To set the parameter value:

      configTool configureParams –configPath path –maxDocumentSize maximum-document-size

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.
Number of indexer threads
The number of indexing threads that run on the server.

To set the parameter value:

      configTool configureParams –configPath path –numberOfIndexerThreads number-of-indexer-threads

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.
Number of preprocessing threads
The number of preprocessing threads that run on a server. Each preprocessing thread can include the following steps: document constructor, text extraction, language identification, tokenization, and language analysis.

To set the parameter value:

      configTool configureParams –configPath path –numberOfPreprocessingThreads number-of-threads

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.
Number of tokenizers
The number of tokenizer threads that run for parsing input into tokens. The tokens are based upon an analysis of the text to identify word stems and other language constructs. Typically, you set the same number of tokenizer threads as preprocessing threads.

To set the parameter value:

      configTool configureParams –configPath path –numberOfTokenizers number-of-tokenizers

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.
Output queue memory size
The memory size of the output queue on the index server. The output queue contains text documents that are waiting to be indexed after preprocessing. A larger memory size permits faster server operation but consumes more system resources. The default is 15 MB.

To set the parameter value:

      configTool configureParams –configPath path –outputQueueMemorySize output-queue-memory-size

For more information about this tool or the meaning of the variables that are used in the command syntax, see Configuration tool usage.