DataFlux Data Management Studio 2.6: User Guide
You can use the Cluster tab to review and select the clusters in the entity resolution file. The tab contains the following tabs:
Clusters - Displays the list of clusters contained in the entity resolution file. The list contains the following columns:
The toolbar at the top of the clusters list enables you perform the following tasks:
Cluster Analysis Pane - Displays graphic views of the clusters in the entity resolution file. The toolbar enables you to select either a bubble plot view or a bar chart view. Note that you can put your cursor over the data points in the bubble plot or the bars in the bar chart to see more information.
Details Pane - Displays detailed information about a selected cluster. The Details pane contains the following tabs:
You can click Show Details in the toolbar adjacent to the menu bar to display the Details pane.
Confidence values are usually derived from scores that are generated by the Match Code node in a data job, but only if the underlying Match definition is configured to give scores. The CI 2011A QKB is the first production QKB that supports Match definitions that can be configured to give scores.
A Confidence value shown in the Cluster tab is the lowest confidence value among the records in the cluster (i.e. for that match code). For example, assume that records 1 and 2 generate the following match codes and scores:
The resulting clusters will be as follows:
Documentation Feedback: yourturn@sas.com
|
Doc ID: dfU_EntityResViewCluster.html |