Obtaining variability estimates between or within groups

Use the Analysis of Variance (ANOVA) transformer to produce statistical calculations in two tables based on a small number of parameters. There are three types of ANOVA: one-way ANOVA, two-way ANOVA, and three-way ANOVA.

You can only use this step with tables that exist in the same database. In the Process Modeler, link your ANOVA step to two Warehouse targets. The ANOVA statistical calculations are written to tables on each of these Warehouse targets. The ANOVA transformer drops the existing database table and recreates it during each run. Each time you run a step using this transformer, the existing data is replaced.

You can make changes to the step only when the step is in Development mode.

../byb.gif Authorities and privileges

To define an ANOVA transformer:

  1. Open the step notebook.

  2. Specify information for your step:

  3. On the Parameters page, select the column or columns from the Available Columns field that will be the Grouping columns for the ANOVA transformer. Depending on the type of ANOVA transformer that you want to create, select either one, two, or three columns and click the > button next to the Grouping columns field.

  4. From the Available Columns field, click a column name and click > next to the Data column field. This column cannot be used as a grouping column. This column must contain numeric data.

  5. Select target tables for your transformer:

  6. Optional: On the Column Mapping page, you can view the mappings between the output columns that result from the transformations that you defined on the Parameters page and the columns on your target table. You cannot change these mappings.

    If the output table is not used by any steps that are in Test or Production, you can rename target columns. To rename a target column, double-click the column name and type the new name.

  7. On the Processing Options page, in the Agent Site list, select an agent site where you want your step to run. The selections in this list are agent sites that are common to the source tables, the target tables, and the transformer or program that you are defining.

  8. If you want to have the option to to run your step at any time, select the Run on demand checkbox. Your step must be in test or production mode before you can run it.

  9. Optional: Select the Populate externally check box if the step is populated externally, meaning that it is invoked in some way other than by the Data Warehouse Center. The step does not have to have any other means of running in the Data Warehouse Center in order to change the mode to production.

    If Populate externally is not selected, then the step must either have a schedule, be linked to a transient table that is input to another step, or be started by another program in order to change the mode to production.

  10. In the Retry area, specify how many times you want the step to run again if it needs to be retried and the amount of time that you want to pass before the next run of the step.

  11. In the Log table field, specify a log table.

  12. Optional: In the Trace level field, specify a trace level.

  13. Click OK to save your changes and close the step notebook.

Related information

Moving and transforming data

Population type descriptions

List of steps and step subtypes

Data Warehouse Center concepts