As it processes data, the system tracks statistical information
about performance and data for the incoming data source files that
were loaded into the system. This information is summarized for you
on two reports: the Data Source Summary report and the Load Summary
report.
About this task
The statistics on these reports can help you quickly verify
that the system is processing all the incoming data records, make
operational decisions about system performance, evaluate the quality
of the incoming data, and show the number of new identities, new entities,
new relationships, and new alerts resulting from processing the data
files.
- In the Configuration Console, select .
- Required: From the Reportlist,
choose a statistical report:
- Data Source Summary Report - This report
provides a quick statistical summary by data source of the records
loaded and processed. Use it to see the total number of records loaded
by data source file, the total number of new identity records processed
by data source file, and the total number of new entities based on
the data in this data source file. The Data Source Summary report
is sorted by load date, load ID, data source, and data source file.
- Load Summary Report - This report summarizes
statistics and quality characteristics for one or more data sources. Use
the report to see load performance information, quality of the data
source file, and summaries of the data values used to resolve entities,
detect relationships, and generate alerts. This report can help you
determine the quality of the data being loaded from a particular data
source. Lower quality data can indicate that the data in this data
source requires additional cleansing, either before being loaded into
the system or during entity resolution by applying specific DQM (data
quality management) rules to the data. The Load Summary report is
sorted by Load ID.
- In the From Date field, enter the
starting date for the report using mm/dd/yyyy format. By default, this field contains the current date.
This
field can be left blank, which means that the system reports all data
within all other specified criteria beginning with the date the system
became operational.
- In the Thru Date field, enter the
ending date for the report using mm/dd/yyyy format. By
default, this field contains the current date.
This
field can be left blank, which means that the system reports all data
within all other specified criteria through the current date.
- Optional: In Data Source Code,
enter a specific data source code to report on. The data
source code you enter must exactly match a configured data source
code.
This field can be left blank, which means that the system
reports statistics for all data sources within all other specified
criteria.
- Required: Click Run Report to
generate the selected report.
Results
The system generates the selected statistical report based
on all specified criteria and displays the report in a separate window.
What to do next
Use the statistical information on this report to help tune
the system or data files.