Dataguise enhances DG for Hadoop

Dataguise has announced DG for Hadoop™ v4.3. Now the first and only solution of its kind to provide both masking and selective encryption for sensitive data in major Hadoop distributions, DG for Hadoop allows organizations to determine the most appropriate remediation technique based on privacy requirements. The new version also delivers expanded capabilities, including contextual-based search to identify sensitive data in unstructured files, simplified management with automatic notifications and detailed audit reporting to demonstrate compliance.

  • Wednesday, 3rd April 2013 Posted 11 years ago in by Phil Alsop

Organizations globally are exploring the advantages of Hadoop and its ability to enable the analysis of data patterns previously inaccessible. However, compliance and security officers are mindful of the sensitive information located in these large data repositories and the lack of controls to prevent unauthorized access. Traditional approaches to securing Hadoop fail because they are too complex, expensive, and incapable of selectively protecting the data that matters in these large and diverse environments. DG for Hadoop provides an efficient, economical and effective method of determining where and how to secure sensitive data in Hadoop.


DG for Hadoop v4.3, part of the DgSecure™ suite of products, identifies the unique characteristics of Big Data, processing multiple terabytes of structured, unstructured and semi-structured data in only a few hours to protect sensitive data at the source, during ingestion and in the Hadoop Distributed File System (HDFS). Key features available in the latest generation software include:
• Selective encryption: Complementing Dataguise masking technology for configurations where data mining needs to operate on actual data values. DG for Hadoop uses symmetric key based encryption of data and also encrypts the encryption keys themselves for stronger security.
• Contextual-based data discovery: The Dataguise contextual-based data discovery capability uses a "neural-like network" approach for highly accurate sensitive data search instead of a simple "rule-based" approach. As a result, information surrounding a given string is correlated and complex inferences are made to determine whether that string is relevant to the search.
• Consistent masking across a single or multiple Hadoop clusters: This capability preserves analytical value of information for trend analysis and aggregations.
• Simplified management: DG for Hadoop provides automatic notifications so that security personnel can be alerted by e-mail or SMS when a job is completed or when changes occur.
• Compliance audit reports: New reporting that compliance auditors can integrate in their analysis of the company’s overall compliance process and posture.


According to Gartner, “Dataguise DG for Hadoop is a security offering of great value in an insecure platform, which Hadoop certainly is today.” DG for Hadoop is deployed across Fortune 200 institutions and built for the enterprise to evaluate exposure risks and enforce the most appropriate remediation to prevent unauthorized access, financial penalties and negative brand impact. The solution allows the user to define and detect the data in a Hadoop installation that is sensitive in nature (credit card numbers, social security numbers, account numbers, personally identifiable information, etc.), analyze the company’s risk from the exposure of that data and protect the information with masking or encryption so the data can be used safely.


“The various distributions of Apache Hadoop provide a high performance platform for managing large volumes of data, helping organizations harness the potential of Big Data to make informed decisions,” said Ashar Baig, Founder & Principal Analyst, Analyst Connection. “For security solutions to be effective in this environment requires both the ability to secure the information effectively and do so without significant impact to operational performance. DG for Hadoop is a sophisticated solution that addresses these areas to provide the assurance and confidence in dealing with sensitive data.”


“With Hadoop deployments projected to grow in an upward direction for the foreseeable future, the threat to organizations that do not adopt a comprehensive approach to securing this data remains high,” said Manmeet Singh, CEO, Dataguise. “DG for Hadoop provides a feature set unmatched by comparable alternatives, helping users benefit from the promise of Big Data without the potential risks.”