IBM FileNet P8, Version 5.2.1            

Understanding the XML Classifier

Content Platform Engine includes an XML Classification Plug-in called the XML Classifier. When properly configured, this XML plug-in automatically populates Document object properties with values based on data extracted from incoming XML document content. Administrators build XSL scripts that map XML tags in the incoming document to Content Platform Engine properties. Each script is registered for a specific XML document type. When processing an XML document of a particular document type, the XML Classifier invokes the script associated with the same document type. The script identifies the tags and values of the incoming XML document and transforms them into the properties of the class assigned to the XML document.

XSL scripts are checked into an object store as objects of the XML Property Mapping Scripts class. Based on the root Document class, the XML Property Mapping Scripts class can be subclassed. Objects of this class are versionable.

The important classification-related properties of the XML Property Mapping Script class are:
XML Document Type:
A string that identifies the document type of the source XML document. This value can be a UUID, a URI identifying an XML DTD or Schema, or the name of the root element. Maximum length is 255 characters.
Target Class:
The class the XML document is assigned to if automatic classification is successful. You can change the value of the Target Class on the XPMS tab of the property sheet of instances of the XML Property Mapping Script class.

You specify automatic classification when you create a document, such as when check in a document with the Content Engine API. Also, automatic classification appears as an option in the Create New Document wizard. The XML Classifier operates only on the first content element of a document; additional content elements are ignored. The XML Classifier does not support binary properties, multi-valued properties, or object-valued properties. However, the XML document must be well-formed.

The XML Classifier utilizes the XML features that are provided by the JDK and must be compliant with the XSLT Version 1.0 standards.



Last updated: October 2015
adc_understanding_the_xml_classifier.htm

© Copyright IBM Corporation 2015.