Default classifications and datasets
Published 06 November 2023
PII Linked Information
Looking for a classification type, but can't find it among our defaults?
Our default classification types are designed to cover NIST's definition of linked information.
By masking these types of data, you minimize the chance of any other parts of the record being identifiable as the individual.
If you still want to assign classification types or datasets that do not appear in our defaults, see Custom configuration.
A predefined set of classification types and datasets are provided. For each column that the tool classifies as containing sensitive data, one of the classification types and corresponding datasets in the table below will be used.
For some datasets, it is useful to enable determinism by default. Those columns will be set to deterministic: true
in the Masking file by default.
Name of default classification type* | Deterministic by default | Example Output |
---|---|---|
Cities | Chesapeake | |
Countries | Maldives | |
CreditCardNumbers | 237904675622786 | |
DatesOfBirth | 1992-04-08** | |
EmailAddresses | ||
FamilyNames | Doe | |
FullNames | Jane Doe | |
GivenNames | Jane | |
IPAddresses | 247.302.475.685 | |
PassportNumbers | 737586078 | |
PhoneNumbers | 060 463-5989 | |
PostCodes | N8L 1FX | |
StreetAddresses | 734 Fieldston Boulevard | |
UKCounties | Greater Manchester | |
UKNationalInsuranceNumbers | SB018518C | |
USSocialSecurityNumbers | 989-11-9389 | |
ZipCodes | 67018 |
*The corresponding default dataset has the same name
**Dates are moved to a random point +/-90 days from the original by default
Other datasets
Some extra datasets are provided. These aren't used as part of automated classification, but are present to help with the masking of databases in specific situations.
Name of dataset | Generated value |
---|---|
NullValue | A NULL value. |
EmptyValue | An empty string. |
LoremIpsum | A random lorem ipsum string with a length up to the maximum width of the column. |
USStates | One of the US states. |