Database

The relational database contains the cluster data, organised into tables, for reporting and analysis.

About the database

The relational database contains the cluster operations data for reporting and analysis. Platform Analytics components input and output data from the tables within the database. Apart from Vertica, Platform Analytics supports Oracle 9i, 10g, and 11g databases.

Default behavior

Data is stored and organized in tables within the database. The organization of this data is defined in the data schema of the tables.

The database and its data schema are partitioned for Platform Analytics data. A partitioned database has tables divided into multiple, smaller tables. This improves database performance for larger clusters.

In a large database, purging old job records, transforming data, and other database maintenance tasks can have a significant effect on database performance. Purging old job records and transforming data from smaller tables has less of an impact on the system performance of active tables than on larger tables.

The database tables are partitioned by quarter. Platform Analytics keeps three years of data in the database. Every month, Platform Analytics has a scheduled task that drops any partition that is older than three years by quarter.

Database interactions

All interactions between Platform Analytics and the database are through the JDBC connection as defined by the data sources.

The following diagram illustrates the interaction between the database and other components.