# Built-in Identifier Reference

Immuta comes with a pack of built-in identifiers that look for common data types. These identifiers were written by Immuta's research and development team and cannot be deleted or edited by users. However, users can add these built-in identifiers to their own domains and edit the tags applied by them.

Identifiers must match at least 90% of the sampled data to be tagged, with three exceptions noted below. See the [How competitive pattern analysis works](https://documentation.immuta.com/2025.1/configuration/manage-data-metadata/data-discovery/reference-guides/sdd-scoring) guide for more information about sampling and thresholds.

## Identifier descriptions and default resulting tags

<table><thead><tr><th width="249">Identifier</th><th>Description</th><th>Resulting tags from the default identifier</th></tr></thead><tbody><tr><td>ARGENTINA_DNI_NUMBER</td><td>Detects strings consistent with Argentina's National Identity (DNI) Number. Requires an eight-digit number with periods after the second and fifth digits.</td><td><ul><li><code>Discovered.Country.Argentina</code></li><li><code>Discovered.Entity.DNI Number</code></li></ul></td></tr><tr><td>AUSTRALIA_MEDICARE_NUMBER<br><a data-footnote-ref href="#user-content-fn-1"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects numeric strings consistent with Australian Medicare Number. Requires a ten- or eleven-digit number. The starting digit must be between 2 and 6, inclusive. Spaces must be placed between the fourth and fifth and ninth and tenth digits. Optional eleventh digit separated by a <code>/</code> or a space. <a data-footnote-ref href="#user-content-fn-2">Examples</a></td><td><ul><li><code>Discovered.Country.Australia</code></li><li><code>Discovered.Entity.Medicare Number</code></li></ul></td></tr><tr><td>AUSTRALIA_PASSPORT<br><a data-footnote-ref href="#user-content-fn-3"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with the Australian Passport number. A string of 8 or 9 characters is required, with a starting uppercase character (A, B, C, D, E, F, G, H, J, L, M, N, R, X, or U) or a two-character alphabetic prefix (P followed by A, B, C, D, E, F, U, W, X, or Z) followed by seven numeric digits. <a data-footnote-ref href="#user-content-fn-4">Examples</a></td><td><ul><li><code>Discovered.Country.Australia</code></li><li><code>Discovered.Entity.Passport</code></li></ul></td></tr><tr><td>BELGIUM_NATIONAL_ID_CARD_NUMBER</td><td>Detects numeric strings consistent with Belgium's National ID card. Requires a twelve-digit number with a required hyphen (<code>-</code>) between the third and fourth digits. Allows for an optional hyphen between the tenth and eleventh digits.</td><td><ul><li><code>Discovered.Country.Belgium</code></li><li><code>Discovered.Entity.National ID Card Number</code></li></ul></td></tr><tr><td>BELGIUM_NATIONAL_REGISTRATION_NUMBER<br><mark style="color:green;"><strong>New</strong></mark></td><td>Detects numeric strings consistent with Belgium's National Registration Number. Requires 11 characters in the form YY.MM.DD-NNN-XX, where YY.MM.DD corresponds to birth date, NNN is a number, and XX is a checksum digit. <a data-footnote-ref href="#user-content-fn-5">Example</a></td><td><ul><li><code>Discovered.Country.Belgium</code></li><li><code>Discovered.Entity.National Registration Number</code></li></ul></td></tr><tr><td>BITCOIN_INVOICE_ADDRESS</td><td>Detects strings consistent with the following Bitcoin Invoice Address formats: P2PKH, P2SH, and Bech32.</td><td><ul><li><code>Discovered.Entity.CRYPTO</code></li></ul></td></tr><tr><td>BRAZIL_CPF_NUMBER<br><a data-footnote-ref href="#user-content-fn-6"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects a numeric string consistent with Brazil's CPF (Cadastro de Pessoas Físicas) number. An eleven-digit numeric string with optional non-numeric separators (<code>.</code>, <code>-</code>, or space) after the third, sixth, and ninth digits. <a data-footnote-ref href="#user-content-fn-7">Examples</a></td><td><ul><li><code>Discovered.Country.Brazil</code></li><li><code>Discovered.Entity.CPF Number</code></li></ul></td></tr><tr><td>CANADA_BC_PHN</td><td>Detects numeric strings consistent with British Columbia's Personal Health Number (PHN). Requires a ten-digit numeric string with hyphens (<code>-</code>) or spaces after the fourth and seventh digits.</td><td><ul><li><code>Discovered.Country.Canada</code></li><li><code>Discovered.Entity.British Columbia Health Network Number</code></li></ul></td></tr><tr><td>CANADA_OHIP</td><td>Detects alphanumeric strings consistent with Ontario's Health Insurance Plan (OHIP). Requires a twelve-digit capitalized alphanumeric code. Optional hyphens (<code>-</code>) or spaces can appear after the fourth, seventh, and tenth digits.</td><td><ul><li><code>Discovered.Country.Canada</code></li><li><code>Discovered.Entity.Ontario Health Insurance Number</code></li></ul></td></tr><tr><td>CANADA_PASSPORT<br><a data-footnote-ref href="#user-content-fn-8"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with the Canadian Passport Number format. Allows for two formats. One format requires two capital letters followed by six digits. The other format requires one letter, followed by six digits, and ends in two letters. <a data-footnote-ref href="#user-content-fn-9">Examples</a></td><td><ul><li><code>Discovered.Country.Canada</code></li><li><code>Discovered.Entity.Passport</code></li></ul></td></tr><tr><td>CANADA_QUEBEC_HIN</td><td>Detects alphanumeric strings consistent with Quebec's Health Insurance Number (HIN). Requires four alphabetic characters followed by an optional space or hyphen (<code>-</code>), and then eight digits with an optional hyphen or space after the fourth digit.</td><td><ul><li><code>Discovered.Country.Canada</code></li><li><code>Discovered.Entity.Quebec Health Insurance Number</code></li></ul></td></tr><tr><td>COUNTRY<br><mark style="color:green;"><strong>New</strong></mark></td><td>Detects strings consistent with the names of all countries in the world. This identifier is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Location</code></li></ul></td></tr><tr><td>CREDIT_CARD_NUMBER<br><a data-footnote-ref href="#user-content-fn-10"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with a credit card number with prefixes matching major credit card companies.</td><td><ul><li><code>Discovered.Entity.Credit Card Number</code></li></ul></td></tr><tr><td>DATE<br><a data-footnote-ref href="#user-content-fn-11"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with dates in <a data-footnote-ref href="#user-content-fn-12">over 30 different formats</a> or date type: date, date+time, or timestamp. This identifier is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Date</code></li></ul></td></tr><tr><td>DOMAIN_NAME<br><a data-footnote-ref href="#user-content-fn-13"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings that begin with a letter and are no more than 225 characters. A full domain can have one to four labels separated by a <code>.</code>. Each label can be one to 63 alphanumeric characters long. And each label after the first must be in the dictionary list of possible labels. This identifier is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Domain Name</code></li></ul></td></tr><tr><td>EMAIL_ADDRESS</td><td>Detect strings consistent with an email address. Usernames are required to be fewer than 255 characters, follow by <code>@</code>, a domain of fewer than 255 characters, and a top level domain of between 2 and 20 characters.</td><td><ul><li><code>Discovered.Entity.Electronic Mail Address</code></li></ul></td></tr><tr><td>ETHNIC_GROUP</td><td>Detects strings consistent with the US Census <a data-footnote-ref href="#user-content-fn-14">race designations</a>. This identifier allows for dashes to be used in place of spaces and is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Ethnic Group</code></li></ul></td></tr><tr><td>FDA_CODE<br><a data-footnote-ref href="#user-content-fn-15"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects a string consistent with a drug or ingredient registered with the Food and Drug Administration (FDA). Must start with between 4 to 5 digits, followed by a hyphen, followed by 3 to 4 digits, followed by a hyphen, and finishing with 1 to 2 digits.</td><td><ul><li><code>Discovered.Country.US</code></li><li><code>Discovered.Entity.FDA Code</code></li></ul></td></tr><tr><td>FINANCIAL_INSTITUTIONS <mark style="color:green;"><strong>New</strong></mark></td><td>Detects strings consistent with names of financial institutions based on lists provided by the FDIC and OCC, includes alternative names.</td><td><ul><li><code>Discovered.Entity.Financial Institutions</code></li></ul></td></tr><tr><td>FRANCE_NIR<br><a data-footnote-ref href="#user-content-fn-16"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects numeric strings consistent with France's National ID number (Numéro d'Inscription au Répertoire). Requires a fifteen-digit numeric string. An optional hyphen (-) or space can appear after the 13th digit.</td><td><ul><li><code>Discovered.Country.France</code></li><li><code>Discovered.Entity.NIR</code></li></ul></td></tr><tr><td>FRANCE_PASSPORT</td><td>Detects alphanumeric strings consistent with the French Passport number. Requires two numbers followed by two uppercase letters and ends with five digits.</td><td><ul><li><code>Discovered.Country.France</code></li><li><code>Discovered.Entity.Passport</code></li></ul></td></tr><tr><td>GENDER<br><a data-footnote-ref href="#user-content-fn-17"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with <a data-footnote-ref href="#user-content-fn-18">gender types</a> and common abbreviations. This identifier is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Gender</code></li></ul></td></tr><tr><td>GERMANY_DRIVERS_LICENSE_NUMBER</td><td>Detects alphanumeric strings consistent with Germany's driver's license number. Requires an eleven-element string of the format CDDCCCCCCDC where C is an uppercase Latin letter and D is a numeric digit.</td><td><ul><li><code>Discovered.Country.Germany</code></li><li><code>Discovered.Entity.Drivers License Number</code></li></ul></td></tr><tr><td>GREAT_BRITAIN_DRIVERS_LICENSE<br><a data-footnote-ref href="#user-content-fn-19"><mark style="color:green;"><strong>New</strong></mark></a></td><td>Detects alphanumeric strings consistent with the United Kingdom's driver's license number. Requires either a 16- or 18-character string. The first five characters represent the driver's surname, padded with <code>9</code>s, followed by a single digit for decade of birth, two digits for month of birth (incremented by 50 for female drivers), two digits for day of birth, one digit for year of birth, two letters, an arbitrary digit, and two digits. Two additional digits can be present for each license issuance. <a data-footnote-ref href="#user-content-fn-20">Examples</a></td><td><ul><li><code>Discovered.Country.UK</code></li><li><code>Discovered.Entity.Drivers License Number</code></li></ul></td></tr><tr><td>IBAN_CODE</td><td>Detects strings consistent with an International Bank Account Number (IBAN). Requires a string in the form ZZ-DD-BBAN, where ZZ is a country code, DD is two numeric digits, and BBAN is a Basic Bank Account Number comprising two to seven groups of three to five uppercase alphanumeric characters, optionally separated by space or dash, and optionally followed by a final group of length one to three.</td><td><ul><li><code>Discovered.Entity.IBAN Code</code></li></ul></td></tr><tr><td>ICD10_CODE<br><a data-footnote-ref href="#user-content-fn-21"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with codes from the International Statistical Classification of Diseases and Related Health Problems (ICD), as drawn from the Clinical Modification lexicon from the year 2025. This identifier is case-insensitive.</td><td><ul><li><code>Discovered.Entity.ICD10 Code</code></li></ul></td></tr><tr><td>ICD_10_PCS<br><mark style="color:green;"><strong>New</strong></mark></td><td>Detects strings consistent with procedure codes from the International Statistical Classification of Diseases and Related Health Problems (ICD), as drawn from the Clinical Modification lexicon from 2020. <a data-footnote-ref href="#user-content-fn-22">Example</a></td><td><ul><li><code>Discovered.Entity.ICD10 Procedure Code</code></li></ul></td></tr><tr><td>IMEI_HARDWARE_ID<br><a data-footnote-ref href="#user-content-fn-23"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with an International Mobile Equipment Identity (IMEI) number. Must contain 15 or 16 digits with optional hyphens or spaces after the 2nd, 8th, and 14th digits. <a data-footnote-ref href="#user-content-fn-24">Examples</a></td><td><ul><li><code>Discovered.Entity.IMEI</code></li></ul></td></tr><tr><td>IP_ADDRESS</td><td>Detects IP Addresses in the V4 and V6 formats. This identifier is case-insensitive.</td><td><ul><li><code>Discovered.Entity.IP Address</code></li></ul></td></tr><tr><td>LOCATION</td><td>Detects ISO3166 formatted locations. This identifier must match at least 80% of the data sampled.</td><td><ul><li><code>Discovered.Entity.Location</code></li></ul></td></tr><tr><td>MAC_ADDRESS<br><a data-footnote-ref href="#user-content-fn-25"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with a Media Access Control (MAC) address. Must contain twelve hexadecimal digits, with every two digits separated by a colon or hyphen. <a data-footnote-ref href="#user-content-fn-26">Examples</a></td><td><ul><li><code>Discovered.Entity.MAC Address</code></li></ul></td></tr><tr><td>NAICS_CODE<br><mark style="color:green;"><strong>New</strong></mark><br></td><td>Detects strings consistent with North American Industry Classification System (NAICS). A two-digit number represents a basic sector and each preceding digit represents a more specific sub sector with a maximum of six digits. <a data-footnote-ref href="#user-content-fn-27">Examples</a></td><td><ul><li><code>Discovered.Entity.NAICS Code</code></li></ul></td></tr><tr><td>PERSON_NAME<br><a data-footnote-ref href="#user-content-fn-28"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with a dictionary of people's names. The name dictionary is US-centric with person names drawn from the US Social Security database, covering 80% of the U.S. population. This identifier must match at least 45% of the data sampled. This identifier is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Person Name</code></li></ul></td></tr><tr><td>PHONE_NUMBER<br><a data-footnote-ref href="#user-content-fn-29"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with telephone numbers. Primarily looks for strings consistent with the United States telephone numbers naming convention. Optional area codes allowed.</td><td><ul><li><code>Discovered.Entity.Telephone Number</code></li></ul></td></tr><tr><td>POSTAL_CODE<br><a data-footnote-ref href="#user-content-fn-30"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with a valid US Zip code with an optional +4 separated by a dash. Only valid five-digit zip codes are detected. This identifier is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Postal Code</code></li></ul></td></tr><tr><td>SEC_STOCK_TICKER <mark style="color:green;"><strong>New</strong></mark></td><td>Detects strings consistent with the stock tickers recognized by the U.S. Securities and Exchange Commission (SEC).</td><td><ul><li><code>Discovered.Entity.Stock Ticker Symbol</code></li></ul></td></tr><tr><td>SPAIN_NIF_NUMBER<br><a data-footnote-ref href="#user-content-fn-31"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with Spain's Tax Identification number. Requires a string with nine alphanumeric characters. Requires either eight digits followed by an optional hyphen or space and a single uppercase letter or the initial character must be X, Y, or Z, followed by an optional dash or space, seven numeric digits, followed by an optional dash or space, and finally, by a single uppercase letter. <a data-footnote-ref href="#user-content-fn-32">Examples</a></td><td><ul><li><code>Discovered.Country.Spain</code></li><li><code>Discovered.Entity.NIF Number</code></li></ul></td></tr><tr><td>SPAIN_PASSPORT</td><td>Detects string consistent with Spain's Passport Number. Requires a eight- or nine-character string starting with either two or three uppercase letters followed by six numeric digits.</td><td><ul><li><code>Discovered.Country.Spain</code></li><li><code>Discovered.Entity.Passport</code></li></ul></td></tr><tr><td>SWIFT_CODE</td><td>Detects alphanumeric strings consistent with a SWIFT code (or Bank Identifier Code (BIC)) format. Requires values consistent with AAAAAACCDDD, where A is an uppercase letter, C is an uppercase letter or numeric digit, and DDD is an optional three-character sequence of uppercase letters or numeric digits.</td><td><ul><li><code>Discovered.Entity.Swift Code</code></li></ul></td></tr><tr><td>TIME<br><a data-footnote-ref href="#user-content-fn-33"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with times in various formats or data type: time. If date is included in the time, it will not match. Use the <code>DATE</code> identifier instead.</td><td><ul><li><code>Discovered.Entity.Date</code></li></ul></td></tr><tr><td>UK_NATIONAL_INSURANCE_NUMBER<br><a data-footnote-ref href="#user-content-fn-34"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects alphanumeric strings consistent with the United Kingdom's National Insurance Number. Requires a nine-character string. The first two digits must be uppercase letters, followed by an optional space, then six digits with optional spaces or hyphens (<code>-</code>) every two digits, ending with A, B, C, or D.</td><td><ul><li><code>Discovered.Country.UK</code></li><li><code>Discovered.Entity.National Insurance Number</code></li></ul></td></tr><tr><td>URL<br><a data-footnote-ref href="#user-content-fn-35"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects string consistent with a URL. String must begin with a common schema, followed a string and ending with a top level domain of no more than 128 alphanumeric characters.</td><td><ul><li><code>Discovered.Entity.URL</code></li></ul></td></tr><tr><td>US_DEA_NUMBER</td><td>Detects alphanumeric strings consistent a Drug Enforcement Administration (DEA) number is assigned to a health care provider. It must have a length of nine characters. The first two digits must be uppercase alphanumeric characters, and the last seven characters are numeric digits. The first character may not be <code>I</code>, <code>N</code>, <code>O</code>, <code>Q</code>, <code>V</code>, <code>W</code>, <code>Y</code>, or <code>Z</code>.</td><td><ul><li><code>Discovered.Country.US</code></li><li><code>Discovered.Entity.DEA Number</code></li></ul></td></tr><tr><td>US_EMPLOYER_IDENTIFICATION_NUMBER</td><td>Detects numeric string consistent United States Employer Identification Number (EIN). Strings must contain nine digits with a hyphen after the second digit.</td><td><ul><li><code>Discovered.Country.US</code></li><li><code>Discovered.Entity.Employer ID Number</code></li></ul></td></tr><tr><td>US_HEALTHCARE_NPI<br><a data-footnote-ref href="#user-content-fn-36"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects 10-digit numeric strings consistent with US National Provider Identifier (NPI). It must either start with 80840 followed by a 1 or 2, or it must begin with a 1 or 2.</td><td><ul><li><code>Discovered.Country.US</code></li><li><code>Discovered.Entity.Healthcare NPI</code></li></ul></td></tr><tr><td>US_PERSON_FULL_NAME<br><mark style="color:green;"><strong>New</strong></mark></td><td>Detects strings consistent with a person's {first name} space {last name}. Uses the same names from the PERSON_NAME identifier. This identifier must match at least 20% of the data sampled and is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Person Name</code></li></ul></td></tr><tr><td>US_PREPARER_TAXPAYER_IDENTIFICATION_NUMBER</td><td>Detects strings consistent with a Preparer Taxpayer ID number. Strings must have nine characters, starting with a <code>P</code> that is followed by eight digits.</td><td><ul><li><code>Discovered.Country.US</code></li><li><code>Discovered.Entity.Preparer Taxpayer ID Number</code></li></ul></td></tr><tr><td>US_SOCIAL_SECURITY_NUMBER<br><a data-footnote-ref href="#user-content-fn-37"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with a US Social Security Number. Strings must contain nine digits and comprise three parts: the three left-most digits designating the area number, the middle two digits designating the group number, and the four right-most digits designating the serial number. For a column to be tagged, none of these parts can contain all zeroes, and area numbers must not be 666 or in the range of 900-999. <a data-footnote-ref href="#user-content-fn-38">Examples</a></td><td><ul><li><code>Discovered.Country.US</code></li><li><code>Discovered.Entity.Social Security Number</code></li></ul></td></tr><tr><td>US_STATE<br><a data-footnote-ref href="#user-content-fn-39"><mark style="color:blue;"><strong>Improved</strong></mark></a></td><td>Detects strings consistent with either a full name or two-letter abbreviation of a US state or territory.</td><td><ul><li><code>Discovered.Country.US</code></li><li><code>Discovered.Entity.State</code></li></ul></td></tr><tr><td>US_STREET_ADDRESS<br><a data-footnote-ref href="#user-content-fn-40"><mark style="color:green;"><strong>New</strong></mark></a></td><td>Detects strings consistent with U.S. street addresses. Requires the street naming convention of {address_number} {street_name} {unit number (optional)} with an optional road suffix after the street name. The maximum length for street name is 20 alphanumeric characters. This identifier must match at least 80% of the data sampled and is case-insensitive.</td><td><ul><li><code>Discovered.Entity.Location</code></li></ul></td></tr><tr><td>VEHICLE_IDENTIFICATION_NUMBER</td><td>Detects strings consistent with Vehicle Identification Numbers. A valid World Manufacturer Identifier is required.</td><td><ul><li><code>Discovered.Country.US</code></li><li><code>Discovered.Entity.Vehicle Identifier or Serial Number</code></li></ul></td></tr></tbody></table>

[^1]: Now, the last separator is optional and can be a space.

[^2]: `6609 84432 7`, `2773 03455 4`, `2827 37441 6`

[^3]: Previously, only included the "N, E, D, F, A, C, U, X" passport types. Now, includes the "B, G, H, J, L, M, N, R, T" passport types.

[^4]: `J3578114`, `R1044589`, `G8213624`

[^5]: `24.01.01-123-45`

[^6]: Previously, allowed either `^` or a digit. Now, the separator is either a `.` or space for the first two separators and then either a `-` or space for the third.

[^7]: `400197.82055`, `992.35316100`, `260645.20398`

[^8]: Now, a new variant is detected with one letter, six numbers, and ending in two letters.

[^9]: `I158861QF`, `S744523IN`, `C164246TN`

[^10]: Previously, was restricted to card numbers than can currently be issued. Now, can detect credit card numbers that were formerly issued. Additionally, China Union Pay, InterPayment, and UATP formats are detected.

[^11]: Previously, only detected on string data types. Now, will match on all date data types in addition to strings that match the regex. Additionally, the number of detected date formats in string columns has increased.

[^12]: In addition to day/month/year date formats, the following formats will also be detected: Syslog, common log, SQL server, ISO 8601 UTC, and ISO week date.

[^13]: Now, must begin with a letter.

[^14]: `White`, `Black`, `African American`, `American Indian`, `Alaska Native`, `Asian`, `Native Hawaiian`, and `Pacific Islander`

[^15]: Previously, detected a string starting with 4 to 6 digits, followed by a hyphen, followed by 3 to 4 digits, followed by a hyphen, and finishing with one to two digits. Now, detects three more-specific patterns.

[^16]: Now, the separator is required and all month digits in the ID must be valid.

[^17]: Previously detected single-letter abbreviations of common genders: `M`, `F`, and `O` (for other). Now, those single-letter abbreviations have been removed.

[^18]: agender, androgyne, bi-gender, bigender, cis-gender, cis, cisgender, enby, female, femme, fluid, gender-binary, gender-expansive, gender-fluid, gender non-binary, gender-non-conforming, gender-nonconforming, gender-queer, genderbinary, genderfluid, genderless, gendernonconforming, gender-outlaw, genderqueer, intersex, male, masc, masculine-of-center, nb, nogender, non-binary, nonbinary, nonconforming, neutral gender, neutrois, null-gender, omnigender, other, pangender, polygender, queer, questioning, trans, transgender, two-spirit, uncertain, unsure

[^19]: Previously named `UK_DRIVERS_LICENSE_NUMBER`. Now, renamed because it does not detect license numbers from Northern Ireland.

[^20]: `36234895`, `56114513`, `23878877`

[^21]: Previously, the codes were from 2020. Now, detects codes from the 2025 ICD master list.

[^22]: `0WWR81Z`

[^23]: Previously, only one digit was allowed at the end. Now, allows for either one or two digits at the end.

[^24]: `4330041011360011`, `5203284653967219`, `5478020339989783`

[^25]: Combined two previous identifiers: `MAC_ADDRESS_LOCAL` and `MAC_ADDRESS`.

[^26]: `eB-22-Ae-D1-7E-d8`, `9d-99-CA-E2-Db-FA`, `42-ac-FE-7A-E9-9A`

[^27]: `33`, `456`, `3425`, `43643`, `235780`

[^28]: Previously, detected common names that are also common words (Grant, Brown, White, Young, etc.). Now, most common words have been removed. Additionally, the variety of names has been increased.

[^29]: Previously, detected `#` extensions.

[^30]: Now, the first five digits must be a valid US zip code.

[^31]: Previously, detected only the first pattern with eight digits, optional separator, and ending in a character. Now, detects a second variant as described in the description.

[^32]: `X5547922I`, `X3453197C`, `Y7272972A`

[^33]: Previously, only detected on string data types. Now, will match on time data types in addition to strings that match the regex. Additionally, detects Redshift time formats.

[^34]: Now, the final digit must be `A`, `B`, `C`, or `D`.

[^35]: Previously, allowed for non-Latin characters in a URL and required specific schemas. Now, allows for most commonly accepted schemas.

[^36]: Previously, detected a 15- and 10-digit variant. Now, detects 10-digit variants beginning with specific digits.

[^37]: Previously, detected invalid US SSNs with digit groups of all zeros, or beginning with `666`, `734`, `998`, and other invalid area numbers.

[^38]: `884-322573`, `88538-0563`, `53941-7974`

[^39]: Now, detects US territory abbreviations.

[^40]: Previously named `STREET_ADDRESS`.
