Database - a collection of interrelated tables / datasets / files.
Database Integrity - generally refers to the technical / relational condition of a database. (Codd).
Keys (Primary / secondary) - the way in which records are inter linked / referenced.
Records - a collection of data items identified by a key or keys.
Data cleaning - the process of ensuring that a database is valid in technical and / or literal terms.
Data Mining - the process of extracting meaningful information from a database / collection of related databases or data mart / warehouse.
Data Mart / Warehouse - an aggregated layer of datasets abstracted from multiple sources eg operational systems, external data, management reporting systems.
Matching - the process of determining automatically the probability of one record from one database being equivalent to a similar record from another database where unique keys are missing or damaged.
RDBs - Relational Databases (most modern databases are relational).
|