DataFlux Data Management Studio 2.6: User Guide
You can use the Metrics dialog to select for a profiling run from the standard set of default metrics. The Metrics dialog contains the following elements:
Frequency distribution - When selected, determines the distinct count and percentage that each different value makes up in your data.
Pattern frequency distribution - When selected, determines the distinct count and percentage of every different word or character pattern within your data. Frequency distribution is supported both for characters and numbers:
Percentiles - When selected, determines a numeric layout of your data at a percentile interval that you specify. Specifically, the pth percentile is a value so that approximately p% of the data is smaller and (100-p)% of the data is larger.
Outliers - When selected, extracts a number of specified outlier minimum and maximum values from your data. Furthermore, you can perform a drill-down operation for an exact view of the outlier records.
The Column Profiling list contains the following metrics:
Count - When selected, determines the total number of records in your data.
Unique Count - When selected, determines the number of distinct and unique records in your data.
Null Count - When selected, determines the number of records that have the null value.
Non-null Count - When selected, determines the number of records that do not have the null value.
Blank Count - When selected, determines the number of records that are blank.
Pattern Count - When selected, determines a count of different word or character patterns in your data.
Minimum Value - When selected, determines the minimum value in your data.
Minimum Length - When selected, determines the minimum string length in your data.
Maximum Value - When selected, determines the maximum value in your data.
Maximum Length - When selected, determines the maximum string length in your data.
Mean - When selected, calculates the mean value of your numeric data content. This value is calculated by dividing the sum of all the numbers by the total count of numbers.
Median - When selected, calculates the median value out of your numeric content. The median is the middle of a distribution: half the scores are above the median and half are below the median.
Mode - When selected, calculates the mode value of your numeric content. The mode is the most frequently occurring score in a distribution. If you have multimodal distributions, this feature is disabled.
Standard Deviation - When selected, calculates the standard deviation of your numeric content. The standard deviation measures the spread of the data about the mean value. It is useful in comparing sets of data which might have the same mean but a different range.
Standard Error - When selected, calculates the mean standard error of your numeric data. The standard error is the standard deviation of the sampling distribution of a statistic. Thus, the standard error of the mean is the standard deviation of the sampling distribution of the mean.
Data Type - When selected, describes the type of your data as it was created in the design of your table.
Data Length - When selected, describes the length of your data as it was created in the design of your table.
Ordinal Position - When selected, displays the actual location of a field within the table.
Primary Key Candidate - When selected, indicates whether the field contains distinct fields, making it a candidate as a primary key.
Nullable - When selected, indicates whether the values in a field can be set to null.
Decimal Places - When selected, determines the scale of your decimal float type variable.
Actual Type - When selected, determines the actual type of a variable from its SQL type. For example, varchar is seen as a string and currency is seen as numeric.
Uniqueness - When selected, determines the percentage of unique, distinct records of the total number of records.
Percent Null - When selected, determines as a percentage the amount of null records of the total number of records.
Documentation Feedback: yourturn@sas.com
|
Doc ID: dfU_Profile_Default.html |