Operator SpecifiedStatistics

An operator which computes certain statistical values for the TheInputConcept. The computed values appear in a ColumnSet which contains exactly one row with the statistical values, and which belongs to TheOutputConcept.

The sum of all values in an attribute can be computed by specifying a BaseAttribute with the parameter AttributesComputeSum. There can be more such attributes; the sum is computed for each. TheOutputConcept must contain a BaseAttribute for each sum which is computed; their names must be those of the input attributes, followed by the suffix "_SUM".

The total number of entries in an attribute can be computed by specifying a BaseAttribute with the parameter AttributesComputeCount. There can be more such attributes; the number of entries is computed for each. TheOutputConcept must contain a BaseAttribute for each count which is computed; their names must be those of the input attributes, followed by the suffix "_COUNT".

The number of unique values in an attribute can be computed by specifying a BaseAttribute with the parameter AttributesComputeUnique. There can be more such attributes; the number of unique values is computed for each. TheOutputConcept must contain a BaseAttribute for each number of unique values which is computed; their names must be those of the input attributes, followed by the suffix "_UNIQUE".

Further, for a BaseAttribute specified with AttributesComputeDistrib, the distribution of its values is computed. For example, if a BaseAttribute contains the values 2, 4 and 6, three output BaseAttributes will contain the number of entries in the input where the value was 2, 4 and 6, respectively. For each BaseAttribute whose value distribution is to be computed, the possible values must be given with the parameter DistribValues. One entry in this parameter is a comma-separated string containing the different values; in the example, the string would be "2,4,6". Thus, the number of entries in AttributesComputeDistrib and DistribValues must be equal. TheOutputConcept must contain the corresponding number of BaseAttributes (three in the example); their names must be those of the input attributes, followed by the suffix "_<value>". In the example, TheOutputConcept would contain the BaseAttributes "inputBaName_2", " inputBaName_4" and "inputBaName_6".

Parameter

Parameter Object Type optional min_arg max_arg Remarks
TheInputConcept Concept Input no 1 1 inherited
GroupBy BaseAttribute Input yes 0
AttributesComputeSum BaseAttribute Input yes 0 numeric
AttributesComputeCount BaseAttribute Input yes 0 (see
AttributesComputeUnique BaseAttribute Input yes 0  
AttributesComputeMin BaseAttribute Input yes 0
AttributesComputeMax BaseAttribute Input yes 0
AttributesComputeDistrib BaseAttribute Input yes 0 text)
DistribValues Value Input yes 0  
TheOutputConcept Concept Output no 1 1 inherited