Managing extensive data sets within a limited timeframe demands precision and speed. But ongoing compliance with the latest regulations requires regularly scanning vast data repositories, often packed with static files. To achieve accurate and high-speed discovery, you need to know how to fine-tune the right configurations.
Tasked with implementing a High Speed Discovery (HSD) set-up, Data Loss Prevention (DLP) administrators may find themselves in one of these three scenarios:
- You’re new to HSD: Exploring or implementing the solution for the first time.
- You’re achieving desired scan speeds: The HSD cluster has been provisioned and is delivering the expected scan performance, but you’re unsure if the cluster is overprovisioned and can be safely scaled back.
- You’re not achieving desired scan speeds: The HSD cluster is provisioned but underperforming.
No matter where you fall, Symantec’s HSD solution addresses all these scenarios.
Ensuring success with lightning-fast scanning
Introduced with Symantec DLP 16, this enhancement in data-at-rest scanning is designed to deliver speeds of 1 TB per hour or more. However, provisioning the solution is only the first step—optimization is just as important. With insights gleaned from the Scan Details report, administrators can refine their data protection strategies and unlock the full potential of lightning-fast scanning.
In simple terms, an HSD solution comprises a Data Node and one or multiple Worker Nodes (WN) intended to scan large volumes of data. However, various factors can impact the scan throughput.
Among many parameters, scan throughput primarily depends on:
- Network speed
- Repository load
- Data type
- Policy complexity
- Hardware specs
- Disk I/O
Understanding how these factors affect scan throughput helps properly size a Network Discover Cluster for your data repository. The Scan Details report generated by HSD scan offers valuable insights into these key indicators that can help inform adjustments for optimization.
What insights can the Scan Details report provide?
Network Discover scans follow four phases to detect and remediate the sensitive data: crawling, content fetching, detection and remediation. The downloadable Scan Details report details specific parameters for each phase that reflect its performance and health.