Internet-style Parser


The internet-style parser, available with Verity Information Server, is designed to let users enter familiar commands from the Web to search for information. The internet-style parser supports searching over the full text of documents in addition to document zones and fields. The internet-style parser does not support Verity query language except topics as described below.

In a search form enabled with the internet-style query parser, users can enter words, phrases, and plain language. Zone and field searches can be performed as discussed.

Words

To search with words, separate the words with spaces. Enter words without capital letters to searches for words with and without capitalization. When you enter a word with capitalization or punctuation, the Verity internet-style Query Parser looks for the word exactly as you typed it. A string of capitalized words is assumed to be a name and is automatically searched for as a phrase.

Phrases

To search for an exact phrase, surround the phrase with double quotation marks. A string of capitalized words is assumed to be a name. Separate a series of names with commas. Commas aren't needed when the phrases are surrounded by quotation marks. The following example searches for a document that contains the phrases "San Francisco" and "sourdough bread".

San Francisco "sourdough bread"

Plain Language

To search with plain language, enter a question or concept and the Verity internet-style Query Parser identifies the important words and searches for them. For example, enter a question such as:

Where is the sales office in San Francisco?

This query produces the same results as entering:

sales office San Francisco

Expanded Queries

Searches can be expanded by using wildcards and topics in an internet-style query. Use one asterisk (*) to substitute for one or more characters.

When a topic has been defined, it can be used in queries. To prevent a word from being interpreted as a topic, surround the word with single quotation marks. The Verity internet-style query parser will retrieve only documents that exactly match the word. The following table shows how the Verity internet-style query parser interprets various queries when a topic named "domestic-violence" exists.

Search Term
Interpreted as a Topic
domestic-violence
Yes
Domestic Violence
Yes
"domestic violence"
Yes
`domestic violence'
No
domestic violence in America
No

Including and Excluding Search Terms

You can limit searches by excluding or requiring search terms or limiting the areas of the document that are searched. Documents that contain optional words have higher scores than documents that contain only the required word.

A minus sign (-) immediately preceding a search term (word or phrase) excludes documents containing the term. A plus sign (+) immediately preceding a search term (word or phrase) return documents containing the term.

Zone Searches

The internet-style parser allows users to perform zone searches. The zones that are available for searching depend on the type of documents in the collection. Zones are available in Markup Language documents (such as HTML and SGML) and Internet Message format documents (such as standard email and Usenet newsgroup messages). The online search tips identify the zones available in each type of document.

To search a document zone, type the name of the zone, a colon (:), and the search terms.

Field Searches

The internet-style parser allows users to perform field searches. The fields that are available for searching depend on field extraction rules based on document type of documents in the collection. For more information, see the Verity Information Server Administration Guide.

To search a document field, type the name of the field, a colon (:), and the search terms.

NOTE: Field searches are enabled by an entry in the vdk30.qp file located here.

verity/common/english/vdk30.qp

In the vdk30.qp file, the /config modifier must be present, as shown below, in order for field searches to function properly:

/config="enableField=YES"

If the vdk30.qp file does not contain the /config="enableField="YES" entry, then no documents are returned for field searches. Verity Information Server V5.0.1 and greater includes the above /config modifier in the default vdk30.qp file so field searches function properly without the need for configuration.





Copyright © 2001, Verity, Inc. All rights reserved.