concept - DataStaple

A DataStaple is a definition consisting of a group of characteristics that are used to interpret the priority and relevance of similar data.

DataStaple defines how to identify unique data, which data source to value most to ensure integrity and how to merge data to maximise value.

Groups of DataStaple can be applied to the input data to maximise data quality and minimise data loss. A DataStaple in a group has a weighting to indicate the sequence in which it is processed (weighting processed in descending values - highest to lowest).

The matching characteriistics of a DataStaple can be simple logic operations (=, >, <, >=, <=) or fuzzy, allowing partial data matches to be considered.

A simple DataStaple may check for duplicate first name and surname and post code - if all 3 match then the records are considered duplicate and the data will be merged according to the data value.

<< previousnext >>