Content Engine includes an XML Classification Plug-in, called the XML Classifier in the Enterprise Manager. When properly configured, this XML plug-in automatically populates Document object properties with values based on data extracted from incoming XML document content. Administrators build XSL scripts that map XML tags in the incoming document to Content Engine properties. Each script is registered for a specific XML document type. When processing an XML document of a particular document type, the XML Classifier invokes the script associated with the same document type. The script identifies the tags and values of the incoming XML document and transforms them into the properties of the class assigned to the XML document.
XSL scripts are checked into an object store as objects of the XML Property Mapping Scripts class. Based on the root Document class, the XML Property Mapping Scripts class can be subclassed. Objects of this class are versionable.
You create scripting objects using the Create New XML Property Mapping Script wizard. The important classification-related properties of the XML Property Mapping Script class are:
Automatic classification appears as an option in the Create New Document wizard in Enterprise Manager. The XML Classifier only operates on the first content element of a document; additional content elements are ignored. The XML Classifier does not support binary properties, multi-value properties, or object-valued properties. If there is a schema reference present, the XML Classifier uses it to validate the document. If there is no schema, validation does not occur. In either case, the XML document must be well-formed.
The XML Classifier utilizes the XML features provided by the JDK and must be compliant with the XSLT Version 1.0 standards.