PII detection
The pii detector
The pii detector identifies personally identifiable information (PII) and returns them together with their positions in the text as linked data in JSON-LD format (see also https://json-ld.org/).
The detector's output allows you to determine if a document contains potentially sensitive data and possibly create a new version of the text in which the PII is de-identified.
These are the information types pii can detect:
| Information type | Notes |
|---|---|
| Personal attributes | Of a real person or a fictional character |
| Postal address | |
| Bank account | |
| IP address | |
| E-mail address | |
| URL | |
| Financial product | Credit or debit card |
| Phone number |
These are the properties of each information type:
Useful resources:
- How to request information detection API resources.
- How to interpret
piidetector output.
Tip
To play with the JSON-LD object and get ideas for its possible uses, take a look at the JSON-LD playground site, where you can paste the JSON-LD object returned by the pii detector.