Access Control Entry
An Access Control Entry (ACE) is an entry of user information made to the Access Control Lists (ACLs) which is used to secure access to individual DataFlux Data Management Server objects.
Access Control Lists
Access Control Lists (ACLs) are used to secure access to individual DataFlux Data Management Server objects.
address verification
Address verification (validation) is the process of comparing a physical address to a reference database of known physical addresses so the original address can be standardized and corrected according to postal authority standards.
AIC
Analyze, Improve, Control (AIC) - DataFlux enables organizations to analyze, improve, and control their data from a single data quality integration platform. DataFlux tools and approaches can help you build a comprehensive set of business rules that can create a unified view of your enterprise data and enhance the effectiveness of CDI, CRM, ERP, legacy data migration, or compliance initiatives.
AMAS
Address Matching Approval System (AMAS) is the program the Australia Post administers to certify address verification software.
API
Application Programming Interface (API) is a set of software protocols, routines, and/or tools used when building software applications.
APO
Army/Air Force post office (APO) is an indication for the USPS.
ASCII
ASCII (American Standard Code for Information Interchange) is a character set based on the English alphabet
basic category
A basic category is a category that represents a single word. Basic categories are the basic building blocks of Grammar rules. Every basic category in a Grammar corresponds to a category in an ordered word list. For this reason, you should design Grammar rules in parallel with word-analysis logic.
batch processing
The application of data management routines to data source records in what are often very large groups, usually in processes that require no manual user intervention. Contrast with real-time processing.
business functions
These are expressions which are written in a generic manner so they can be reused from multiple rules or applications.
business rule
A conditional statement that tells a system running a business process how to react to a particular situation.
case definition
A set of logic used to accurately change the case of an input value, accounting for unique values that need to be case sensitive, such as abbreviations and business names.
CASS
Coding Accuracy Support System (CASS) is the program the United States Postal Service (USPS) administers to certify address verification software.
CBSA
Census Bureau Statistical Areas (CBSA)
CEDA
Cross-Environment Data Access (CEDA)
census string
The census string is a US Census Bureau designation for the boundary area in which the centroid exists. The census string contains state, county, and other census-type information.
centroid
A centroid is the approximate mathematical center of the ZIP or ZIP+4 boundary.
checks
These are built-in checks (expressions) that provide a template to the user to build common standard expressions.
chop table
A proprietary file type used by DataFlux as a lex table to separate characters in a subject value into more usable segments.
CMRA
US Commercial Mail Receiving Agency (CMRA)
CMSA
Consolidated Metropolitan Statistical Areas (CMSA)
Comments
Comments are text within a code segment that is not executed. Comments can be either C-style (starts with /* and ends with */) or C++ style (starts with // and continues to the end of a line).
Core Fields
Default logic to handle data such as name and address, which inform the identity management process.
CPC
Canadian Post Certification (CPC) is the SERP program administered by the Canadian Post. This is similar to the CASS certification administered by the USPS.
CRM
Customer Relationship Management (CRM)
custom metrics
Custom metrics may be used when the standard metrics do not contain the rules you need to accomplish the desired results.
dashboard
The dashboard is a Web-based view of the task grid and graphs in the Monitor Viewer.
data profiling
A discovery process that uncovers potential problem areas in large amounts of structured data.
data type
Not used in the sense of a database data type ("varchar" for instance) but used to describe sets of data values that follow certain rules and conventions. "Name" and "Address" are two examples of data types.
database
A collection of tables containing data that can be accessed easily by a computer system.
definition
An algorithm available to a DataFlux application.
derived category
A derived category is a category composed of one or more other categories. The makeup of a derived category is described using rules.
dfIntelliServer
dfIntelliServer provides a real-time or transactional mechanism for communicating with the MCRD through the Architect API. dfIntelliServer has several client libraries (including a Web services client) that can be called from a number of different applications in many different computing environments. dfIntelliServer allows one at a time queries and modifications to the MCRD. dfIntelliServer allows organizations to access Architect jobs through an API that can accept one group of data elements at a time rather than a complete table. This functionality takes advantage of the power of encapsulation of discreet chunks of work in Architect, so a programmer need only make one call to the client API to perform a related set of activities.
DPV
Delivery Point Validation (DPV) specifies if the given address is a confirmed delivery point as opposed to being within a valid range of house numbers on the street.
DSN
Data Source Name (DSN)
EEL
Expression Engine Language (EEL)
ERP
Enterprise Resource Planning (ERP)
ETL
Extraction, Transformation, and Loading
event
An event represents an action which should be taken when a rule fails. Actions can include sending email messages, storing the offending row in the repository, or executing an external process.
Expression
This is the DataFlux syntax used in the Business Rule Manager to build business rules.
field
Also known as a "variable" or a "column," a single piece of data in a database table. Database tables can have many fields. The user defines the fields. Each field has a unique identifier in the repository. From a data monitoring standpoint, the fields are not tied to any specific database or table but are bound at the time of execution to the current data set or row.
field set
A field set is a collection of fields that belong together. These usually represent a table of data and are used to aid in building rules and viewing results.
FIPS
Federal Information Processing Standards (FIPS) - A 5-digit number assigned to each county in the U.S. by the Census Bureau. The first 2 digits are the state code, and the last 3 digits are the county number.
FPO
Fleet post office (FPO) indication for USPS used for military personnel.
gender analysis
An algorithm that can determine the gender of persons by their names.
gender analysis definition
A set of logic used to determine the probable gender of a name or identity-type input string.
grammar
A proprietary file type used to store hierarchical patterns pertinent to a specific subject area.
group rule
A group rule evaluates and applies all rules to groups of data (for example, data grouped by state and the rules evaluated for each state).
historical metrics
A historical metric is available when a business rule is run a second time under the same report name. You can view and compare the last two reports.
identification analysis
An algorithm that can determine from a known set of options what type of data is represented by a particular subject value.
identification analysis definition
A set of logic used to identify an input string as a member of a redefined or user-defined value group or category.
inputs
Input fields are the fields where you apply the checks specified in the Rule Manager. This list includes all the fields you have defined in the Business Rule Manager, including the Output fields from custom metrics and any grouped by field.
job
The saved configuration settings for a particular task in a Data Management Studio application. You can run jobs interactively or combine them with other jobs and schedule the set of jobs to run on a particular date or time.
LACS
US Locatable Address Conversion Service (LACS) is a product/system in a different USPS product line that allows mailers to identify and convert a rural route address to a "city-style" address.
locale
The country of origin based on an address or country code.
locale guessing
A process that attempts to identify the country of origin of a particular piece of data based on an address, country code, or other field.
Master Data Management
MDM focuses on managing master data that is shared by several different systems and groups.
match
The process of identifying data strings that can be different representations of the same semantic information. For example, the strings Mr. Bob Brauer, Robert J., and Brauer can be considered to match each other.
match cluster
A set of records grouped together based on some commonality. Cluster IDs are numeric values used to refer to these clusters. You can append cluster IDs to records in a database to document matches.
match codes
The end result of passing data through a match definition. A normalized, encrypted string that represents portions of a data string that are considered to be significant with regard to the semantic identity of the data. Two data strings are said to "match" if the same match code is generated for each.
match definition
A set of logic used to generate a match code for a data string of a specific data type.
match value
A string representing the value of a single token after match processing.
MCD
Minor Civil Division (MCD)
MDM
Master Data Management (MDM) focuses on master data shared by several different systems and groups.
merge
The process of joining records and eliminating duplicate records from a table based on user-specified conditions and rules.
metadata
Information that describes the properties of data, for example when was last accessed or the size of the data value.
micropolitan
This term is used in US Census data and refers to a population area including a city with 10,000 to 50,000 residents and surrounding areas.
MSA
Metropolitan Statistical Areas (MSA) - The MSA code assigned by the Office of Management and Budget. Use this code as an index key in the MSA file.
namespace
A namespace is a unique container created to hold a logical grouping of identifiers.
Object
An object is anything that can be stored in the Data Management Studio Navigator and accessed by the Data Management Studio applications.
objects
Objects are individual jobs and services.
ODBC
Open Database Connectivity (ODBC) - an open standard application programming interface (API) for accessing databases.
OFAC
Office of Foreign Assets Control (OFAC) - Federal regulations related to the Patriot Act.
OLAP
Online Analytical Processing (OLAP)
organization
A company, university, or other type of institution. For example: IBM Corporation, University of Connecticut, or St. Joseph's Hospital
outputs
The output field is the field(s) used to apply the rule in the custom metric. Set your output field to serve as the field where the results from your custom metric are collected.
parse
The process of dividing a data string into a set of token values. For example: Mr. Bob Brauer, Mr. = Prefix, Bob = Given, Brauer = Family
parse definition
A name for a context-specific parsing algorithm. A parse definition determines the names and contents of the sub-strings that will hold the results of a parse operation.
pattern analysis definition
A regular expression library that forms the basis of a pattern recognition algorithm.
phonetics
An algorithm applied to a data string to reduce it to a value that will match other data strings with similar pronunciations.
PMB
A private mailbox (PMB) is categorized as a mailbox located at a mail center other than the post office or home.
PMSA
Principal Metropolitan Statistical Areas (PMSA)
Primary Key
Primary key is a unique identifier assigned to a database field. Social Security Numbers or a ISBNs are examples of possible primary keys.
QAS
QuickAddress Software (QAS)
QKB
The Quality Knowledge Base (QKB) is a collection of files and configuration settings that contain all DataFlux data management algorithms. The QKB is directly editable using Data Management Studio.
qMDM
Quality Master Data Management (qMDM)
Quality Knowledge Base Locales
The Quality Knowledge Base (QKB) locales contain the files, file relationships, and metadata needed to correctly parse, match, standardize, and otherwise process data.
RDBMS
Relational Database Management System (RDBMS) allows you to access data in a database in unique ways, such as adding tables and records, and joining tables.
RDI
Residential Delivery Indicator (RDI)
real-time processing
Processing a record or data one piece at a time as it enters a computer system, for financial transactions, for example. Contrast with batch processing.
record
Also called a "row" or "observation," one complete set of fields in a database table.
regular expression
A mini-language composed of symbols and operators that enables you to express how a computer application should search for a specified pattern in text.�A pattern may then be replaced with another pattern, also described using the regular expression language.
repository
A Data Management Studio repository is a hierarchical data storage mechanism.
row rule
A row rule evaluates every row of data passed into the Monitoring node.
RP
Software Evaluation and Recognition Program is a program the Canada Post administers to certify address verification software.
rule
A single rule can be either a row level rule or a data set level rule. A row level rule is applied to each row which enters the system while a data set level rule is applied to an entire data set or a portion of a data set.
rule set
A rule set is a set of one or more rules which are applied together as a group. Use a rule set when you find you are using a few rules together frequently.
SDK
Software Development Kit (SDK)
sensitivity
Regarding matching procedures, sensitivity refers to the relative tightness or looseness of the expected match results. A higher sensitivity indicates you want the values in your match results to be very similar to each other. A lower sensitivity setting indicates that you would like the match results to be "fuzzier" in nature.
SERP
The Software Evaluation and Recognition Program (SERP) is a program the Canadian Post administers to certify address verification software.
Service Oriented Architecture
Service Oriented Architecture (SOA) - All of the interaction with the master customer reference database is through a service-oriented architecture that enables any system to talk to the customer database and request or update information.
set rule
A set rule evaluates and applies rules to all of the input data completely (for example, it will evaluate all 1000 rows of data as a set).
SQL
Structured Query Language (SQL) is a language used to request information from database systems.
standard metrics
Standard metrics are pre-defined rules (expressions) set in Data Management Studio. Most of the time, this is enough to achieve the results for your job.
standardization definition
A set of logic used to standardize a string.
standardization scheme
A collection of transformation rules that typically apply to one subject area, like company name standardization or province code standardization.
standardize
The process of transforming a data string so each of the string's token values conforms to a preferred standard representation: IBM Corporation = IBM CORP; Mister Bob Brauer, Junior = MR BOB BRAUER JR.
Statement of Accuracy
Statement of Accuracy (SoA) is the form used for Canadian Post Certification (CPC) standards.
table
A table is a collection of records in a database.
tasks
Tasks contain the rules and the events that go with your individual rule. Tasks associate alert events with a rule that are triggered after a rule fails.
token
Used by DataFlux to designate the output strings of a parse process. The output string of a parse process. A word or atomic group of words with semantic meaning in a data string. A set of expected tokens is defined for each data type.
Unicode
An industry standard used to allow text and symbols from languages around the world.
unified
This is the version of the repository you are using. The term "unified" means the repository contains data for Data Management Studio Profile reports, Business Rules, and Data Monitoring results.
URI
Uniform Resource Identifier (URI) is a string of characters identifying a resource or file path.
USPS
United States Postal Service (USPS) provides postal services in the United States. The USPS offers address verification and standardization tools.
vocabulary
A proprietary file type used for categorizing data look-ups pertinent to a specific subject area.