Classification flowchart

The diagram below illustrates the primary architectural components of the Automatic Classification sub-system and the steps involved using the XML Classifier as the plug-in.

Classification flowchart using the XML Classifier

<BEGIN TRANSACTION>

  1. The client checks in the target document with the automatic classification flag set.

    NOTE  The document's initial class is set at this point, along with the target document's security descriptor which is not changed during automatic classification. Automatic classification has not yet taken place.

  2. Content Engine persists properties and content. The classification status on the document is set to "Pending".
  3. Content Engine queues a classification request. The request includes a reference to the target document.
    <END TRANSACTION>

<BEGIN TRANSACTION>

  1. The dequeueing thread dispatches the classification request to Classification Manager.
  2. The Classification Manager determines the MimeType from the source document and queries the database for a Document Classification Action with a MimeType of text/xml.
  3. The Classification Manager invokes the XML Classifier, passing it a reference to the target document.
  4. The XML Classifier parses the target document to determine its XML document type and queries the catalog for an XML Property Mapping Script that matches this type. (If the XML Classifier cannot determine XML document type, it fails gracefully.) For more information, see How the XML Classifier chooses which XSL script to use.
  5. The XML Classifier applies the XSL script to the target document to determine the property mappings.
  6. The XML Classifier changes the class of the target document and updates its properties based on the property types and values determined by the mapping script.
  7. Control returns to the Classification Manager.
  8. The Classification Manager sets the ClassificationStatus on the target document to Complete or Failed depending on the return value from the XML Classifier.
    <END TRANSACTION>

Document classification architecture