Updated on December 21, 2023

Identify and mask (Auto-detect)

This method automatically identifies and masks personal/sensitive data within specific sentences.

Endpoint and Authentication:

To use Protecto.ai’s token-based masking, you need to send a PUT request to the following endpoint:

Endpoint: https://trial.protecto.ai/api/vault/mask

For authentication, include the following token in the request headers:

Headers: {“Authorization”: “Bearer <AUTH_TOKEN>”}

Request Payload:

Here’s an example of a request payload to mask sensitive data:

Request Payload:
{"mask": [{ "value": "George Williams lives in Washington"}]}
Parameters for the Request Payload:
· value (string): Sensitive data value that needs to be masked.
Response :
Upon successful masking, you will receive a response with the masked data. Here’s an example of a response:
{
    "data": [
        {
            "value": "George Williams lives in Washington",
            "token_value": "<PER>8oGEcGVEkj 700ZSgo1pA</PER> lives in <ADDRESS>Yd0HtAN62H</ADDRESS>",
            "individual_tokens": [
                {
                    "value": "George Williams",
                    "pii_type": "PERSON",
                    "token": "8oGEcGVEkj 700ZSgo1pA",
                    "prefix": "<PER>",
                    "suffix": "</PER>"
                },
                {
                    "value": "Washington",
                    "pii_type": "GPE",
                    "token": "Yd0HtAN62H",
                    "prefix": "<ADDRESS>",
                    "suffix": "</ADDRESS>"
                }
            ]
        }
    ],
    "success": true,
    "error": {
        "message": ""
    }
}
 Response Parameters:
  • · token_value (string): Generated token replacing the sensitive data.
Advantage :
This approach is particularly useful when users may not be aware of all the sensitive data present in a given text or when dealing with large volumes of data where manual identification is impractical. Auto-detect masking offers the following advantages:
  1. Effortless Sensitive Data Identification: Users do not need to manually identify and specify sensitive data elements within the text. The system automatically detects patterns and formats indicative of sensitive information, such as names, phone numbers, credit card numbers, etc
  2. Comprehensive Data Protection: Auto-detect masking ensures that all instances of sensitive data within a dataset are masked, providing a comprehensive approach to data security. This is especially valuable for scenarios where users may not be aware of all the potential data types that need protection
  3. Reduced Human Error: Automated detection reduces the risk of human errors that can occur during manual identification and specification of sensitive data. It enhances accuracy and consistency in data protection processes.
  4. Time and Resource Savings: Auto-detect masking saves time and effort, particularly when dealing with large amounts of data. Users do not need to spend time identifying each instance of sensitive data, allowing them to focus on other tasks.

What are your feelings?
Scroll to Top