Operator
|
Modifiers
|
Automatically Relevance-ranked
|
---|---|---|
SOUNDEX
|
MANY, NOT
|
No
|
STEM
|
MANY, NOT
|
No
|
THESAURUS
|
MANY, NOT
|
No
|
TYPO/N
|
NOT
|
|
WILDCARD
|
CASE, MANY, NOT
|
No
|
WORD
|
CASE, MANY, NOT
|
No
|
Operator
|
Modifiers
|
Automatically Relevance-ranked
|
---|---|---|
IN
|
NOT, WHEN
|
|
NEAR
|
NOT
|
Yes
|
NEAR/N
|
NOT, ORDER
|
Yes
|
PARAGRAPH
|
MANY, NOT, ORDER
|
Yes
|
PHRASE
|
MANY, NOT
|
Yes
|
SENTENCE
|
MANY, NOT, ORDER
|
Yes
|
Operator
|
Modifiers
|
Automatically Relevance-ranked
|
---|---|---|
ACCRUE
|
NOT
|
Yes
|
ALL
|
NOT, ORDER
|
No
|
AND
|
NOT
|
Yes
|
ANY
|
NOT
|
No
|
OR
|
NOT
|
Yes
|
The
ACCRUE
operator scores retrieved documents according to the presence of each search element in the document using "the more, the better" approach; the more search elements found in the document, the better the document's score. The following examples illustrate the search syntax.For example, to select documents containing stemmed variations of the words "computers" and "laptops," enter any of the following:
For example, to select documents which contain stemmed variations of the phrase "pharmaceutical companies" and stemmed variations of the word "stock," enter the following:
AND
operator are relevance-ranked. For example, to select documents which contain stemmed variations of the phrase "pharmaceutical companies" and stemmed variations of the word "stock," enter the following:
For example, to select documents that contain stemmed variations of the word "election" or the phrases "national elections" or "senatorial race", enter the following:
IN
operator works only if document zones have been defined in your collections. If you use the IN
operator to search collections without defined zones, no documents will be selected. Also, the zone name you specify must match the zone names defined in your collections. Consult your collection administrator to determine which zones have been defined for specific collections.The
IN
operator can be qualified with the WHEN
operator to search for a term only within the one or more zones upon which certain conditions have been placed. Use of the WHEN
operator is described below.The following query expression searches document zones named "summary" for the word "safety."
Say you want to search for the word "here" in a zone named "A," whose HREF attribute contains the string "verity," and the text looks like this:
WHEN
operator must be enclosed in parentheses, as shown above. A query condition can include one or more Verity operators; it takes the form:
STARTS>
, <ENDS>
, <CONTAINS>
, <=>
, or <MATCHES>
. Except for =
, all operators must be surrounded by angle brackets.Attribute test operators can be combined with the combination operators
<AND>
or <OR>
. For example, you can search for the string "IBM" in a zone named "Company," when the attribute named "reference" is either equal to "major" or "significant" by using the following query:
For example, if the search expression includes two words, and those words occur next to each other in a document (so that the region size is two words long), then the score assigned to that document is 1.0. Thus, the document with the smallest possible region containing all search terms always receives the highest score. As search terms appear further apart, the score drops toward zero. A document receives a zero score only if it does not contain all search terms.
The
NEAR
operator is similar to the other proximity operators in the sense that the search words you enter must be found within close proximity of one another. However, unlike other proximity operators, the NEAR
operator calculates relative proximity and assigns scores based on its calculations. To retrieve relevance-ranked documents that contain stemmed variations of the words "war" and "peace" within close proximity to each other, enter the following:
For example, if the search expression
NEAR/5
is used to find two words within five words of each other, a document that has the specified words within three words of each other is scored higher than a document that has the specified words within five words of each other.The N variable can be an integer between 1 and 1,024, where
NEAR/1
searches for two words that are next to each other. If N is 1,000 or above, you must specify its value without commas, as in NEAR/1000
. You can specify multiple search terms using multiple instances of NEAR/
N, as long as the value of N is the same.For example, to retrieve relevance-ranked documents that contain stemmed variations of the words "commute," "bicycle," "train," and "bus" within 10 words of each other, enter the following:
NEAR/
N operator with the ORDER
modifier to perform ordered proximity searches. For more information about the ORDER
modifier, see "ORDER" in Chapter 4.
OR
Selects documents that show evidence of at least one of your search elements. Documents selected using the OR
operator are relevance-ranked.
To retrieve relevance-ranked documents that contain stemmed variations of the word "drug" and the phrase "cancer treating" in the same paragraph, enter the following:
PARAGRAPH
operator between each word or phrase.You can use the
PARAGRAPH
operator with the ORDER
modifier to perform ordered proximity searches. For more information about the ORDER
modifier, see "ORDER" in Chapter 4.
PHRASE
Selects documents that include a phrase you specify. A phrase is a grouping of two or more words that occur next to each other in a specific order.
To retrieve relevance -ranked documents that contain stemmed variations of the words "American," and "innovation" within the same sentence, enter the following:
SENTENCE
operator with the ORDER
modifier to perform ordered proximity searches. For more information about the ORDER
modifier, see "ORDER" in Chapter 4.
SOUNDEX
Selects documents that include one or more words that "sound like," or whose letter pattern is similar to, the word specified. Words must start with the same letter as the word you specify to be selected. SOUNDEX
operator. See your collection administrator for information.
MANY
modifier is used, as in:
For example, to retrieve documents containing a variation of the word "film," enter the following:
MANY
modifier is used, as in:
For example, to retrieve documents containing synonyms of the word "altitude" enter the following:
MANY
modifier is used, as in:
TYPO/
N operator performs "approximate pattern matching" to identify similar words. This makes it ideal for use in an environment where documents have been scanned using an Optical Character Reader (OCR). The optional N variable in the operator name expresses the maximum number of errors between the query term and a matched term, a value called the error distance. If N is not specified, an error distance of 2 is used.
The error distance between two words is based on the calculation of errors, where an error is defined to be a character insertion, deletion, or transposition. For example, for these sets of words, the second word matches the first within an error distance of 1:
- mouse, house (m\xde h)
- agreed, greed (a is deleted)
- cat, coat (o is inserted)
TYPO/
N operator must scan the collection's word list to find candidate matching words. This makes it impractical for use in large collections (greater than 100,000 documents unless a current spanning word list is available) or in performance-sensitive environments. Performance can be improved by generating a spanning word list for the collections to be used. NOTE: Please note these limitations. A query term specified with
TYPO/
N can have a maximum length of 32 characters. Also, TYPO/
N is not supported with multi-byte character sets.
WILDCARD
operator lets you define a wildcard string, which can be used to locate related word matches in documents. A wildcard string consists of special characters. For example, to retrieve documents that contain words such as, "pharmaceutical," "pharmacology," and "pharmacodynamics," enter the following:
MANY
modifier is used, as in:
WILDCARD
operator explicitly with any of the characters in the following table.
style.lex
file used to create the collections you are searching is configured to recognize the characters you want to locate. Consult your collection administrator for more information.
style.lex
file is configured for the collections to be searched, you can search for a word containing a wildcard character such as "/" or "*" by preceding the wildcard character with a backslash. For example, when you enter the following search string:
When you want to match a literal backslash, you must enter two backslashes.
style.lex
file used to create the collections you are searching is configured to recognize the backquote character. Consult your collection administrator for information.
For example, to search for documents that contain the word "rhetoric," without also considering the words "rhetorical" and "rhetorician," enter the following:
MANY
modifier is used, as in: