Skip to content

Setting Up Discovery Scanning

This section guides you through setting up and configuring Discovery for efficient and customizable scanning of your data sources.

Configure scan settings

  • Go to Discovery > Scan Setup.
  • Configure the following settings:
    • Application Status: Enable or disable applications for scanning.
    • System Classification: Set the global classification threshold for auto-classification.
    • Minimum Review: Set the minimum score for manual review of tagged resources.
    • Reduce Score: Adjust the score for empty data columns.
    • Rescan Type: Choose between incremental or full scans.

For more details, refer to the Scan Configuration documentation.

Managing Dictionaries, Models and Rules

Discovery relies on different types of dictionaries and heuristic models to match data and apply classification rules. These dictionaries and models are essential for identifying sensitive data patterns and formats.

  • Keyword dictionaries match based on column names or metadata.
  • Lookup dictionaries match values against predefined entries.
  • Patterns use regex to match formats like Emails, phone numbers, etc.
  • Heuristic models apply logic to identify structured data (e.g., SSNs, credit card numbers).

You can manage all of these under Discovery Menu:

  • Create new dictionaries or patterns.
  • Customize existing models or create new ones.
  • Create rules that combine dictionaries, patterns, and models for classification.

For more details, refer to the Classification Techniques documentation.

5. Defining Classification Rules

Rules tell Discovery how to identify specific types of sensitive data.

  • Go to Discovery > Rules.
  • Use existing rules or create new ones.
  • Rules use dictionaries, patterns, and models to detect data based on column names, values, or both.
  • Configure Discovery > Scan Setup to specify the minimum score for review or auto-classification, or adjust the score for certain conditions.

8. Validating and Monitoring

After setup:

  • View progress and status under Discovery > Scan Status.
  • Check classification results via Data Inventory.
  • Validate rule effectiveness by reviewing sample matches and tag distributions.
  • Accept or reject tags based on scan results.

Discovery setup is flexible and supports advanced use cases including:

  • Supporting multiple data sources and formats
  • Integration with classification techniques for improved accuracy

Once the setup is complete, Discovery continuously scans your environment and applies tagging logic based on your defined rules and classification models.


Comments