What is Data Classification? | Best Practices & Data Types | Imperva (2024)

What is Data Classification

Data classification tags data according to its type, sensitivity, and value to the organization if altered, stolen, or destroyed. It helps an organization understand the value of its data, determine whether the data is at risk, and implement controls to mitigate risks. Data classification also helps an organization comply with relevant industry-specific regulatory mandates such as SOX, HIPAA, PCI DSS, and GDPR.

Blog: Top Challenges to Implementing Data Privacy: Nailing Down Discovery and Classification First is Key.

Data Sensitivity Levels

Data is classified according to its sensitivity level—high, medium, or low.

  • High sensitivity data—if compromised or destroyed in an unauthorized transaction, would have a catastrophic impact on the organization or individuals. For example, financial records, intellectual property, authentication data.
  • Medium sensitivity data—intended for internal use only, but if compromised or destroyed, would not have a catastrophic impact on the organization or individuals. For example, emails and documents with no confidential data.
  • Low sensitivity data—intended for public use. For example, public website content.

What is Data Classification? | Best Practices & Data Types | Imperva (1)

Data Sensitivity Best Practices

Since the high, medium, and low labels are somewhat generic, a best practice is to use labels for each sensitivity level that make sense for your organization. Two widely-used models are shown below.

SENSITIVITYMODEL 1MODEL 2
HighConfidentialRestricted
MediumInternal Use OnlySensitive
LowPublicUnrestricted

If a database, file, or other data resource includes data that can be classified at two different levels, it’s best to classify all the data at the higher level.

Solution Spotlight: Enable Data Discovery and Classification.

Types of Data Classification

Data classification can be performed based on content, context, or user selections:

  • Content-based classification—involves reviewing files and documents, and classifying them
  • Context-based classification—involves classifying files based on meta data like the application that created the file (for example, accounting software), the person who created the document (for example, finance staff), or the location in which files were authored or modified (for example, finance or legal department buildings).
  • User-based classification—involves classifying files according to a manual judgement of a knowledgeable user. Individuals who work with documents can specify how sensitive they are—they can do so when they create the document, after a significant edit or review, or before the document is released.

Data States and Data Format

Two additional dimensions of data classifications are:

  • Data states—data exists in one of three states—at rest, in process, or in transit. Regardless of state, data classified as confidential must remain confidential.
  • Data format—data can be either structured or unstructured. Structured data are usually human readable and can be indexed. Examples of structured data are database objects and spreadsheets. Unstructured data are usually not human readable or indexable. Examples of unstructured data are source code, documents, and binaries. Classifying structured data is less complex and time-consuming than classifying unstructured data.

Blog: How Organizations Manage to Understand Millions of Unstructured Data Files at Scale.

Data Discovery

Classifying data requires knowing the location, volume, and context of data. Most modern businesses store large volumes of data, which may be spread across multiple repositories:

  • Databases deployed on-premises or in the cloud
  • Big data platforms
  • Collaboration systems such as Microsoft SharePoint
  • Cloud storage services such as Dropbox and Google Docs
  • Files such as spreadsheets, PDFs, or emails

Before you can perform data classification, you must perform accurate and comprehensive data discovery. Automated tools can help discover sensitive data at large scale. See our article on Data Discovery for more information.

The Relation Between Data Classification and Compliance

Data classification must comply with relevant regulatory and industry-specific mandates, which may require classification of different data attributes. For example, the Cloud Security Alliance (CSA) requires that data and data objects must include data type, jurisdiction of origin and domicile, context, legal constraints, sensitivity, etc. PCI DSS does not require origin or domicile tags.

Creating Your Data Classification Policy

A data classification policy defines who is responsible for data classification—typically by defining Program Area Designees (PAD) who are responsible for classifying data for different programs or organizational units.

The data classification policy should consider the following questions:

  • Which person, organization or program created and/or owns the information?
  • Which organizational unit has the most information about the content and context of the
    information?
  • Who is responsible for the integrity and accuracy of the data?
  • Where is the information stored?
  • Is the information subject to any regulations or compliance standards, and what are the penalties associated with non-compliance?

Data classification can be the responsibility of the information creators, subject matter experts, or those responsible for the correctness of the data.

The policy also determines the data classification process: how often data classification should take place, for which data, which type of data classification is suitable for different types of data, and what technical means should be used to classify data. The data classification policy is part of the overall information security policy, which specifies how to protect sensitive data.

Data Classification Examples

Following are common examples of data that may be classified into each sensitivity level.

Sensitivity LevelExamples
HighCredit card numbers (PCI) or other financial account numbers, customer personal data, FISMA protected information, privileged credentials for IT systems, protected health information (HIPAA), Social Security numbers, intellectual property, employee records.
MediumSupplier contracts, IT service management information, student education records (FERPA), telecommunication systems information, internal correspondence not including confidential data.
LowContent of public websites, press releases, marketing materials, employee directory.

See how Imperva Data Security Solutions can help you with data classification.

Request demo Learn more

Imperva Data Protection Solutions

Imperva provides automated data discovery and classification, which reveals the location, volume, and context of data on premises and in the cloud.

In addition to data classification, Imperva protects your data wherever it lives—on premises, in the cloud and in hybrid environments. It also provides security and IT teams with full visibility into how the data is being accessed, used, and moved around the organization.

Our comprehensive approach relies on multiple layers of protection, including:

  • Database firewall—blocks SQL injection and other threats, while evaluating for known vulnerabilities.
  • User rights management—monitors data access and activities of privileged users to identify excessive, inappropriate, and unused privileges.
  • Data masking and encryption—obfuscates sensitive data so it would be useless to the bad actor, even if somehow extracted.
  • Data loss prevention (DLP)—inspects data in motion, at rest on servers, in cloud storage, or on endpoint devices.
  • User behavior analytics—establishes baselines of data access behavior, uses machine learning to detect and alert on abnormal and potentially risky activity.
  • Data discovery and classification—reveals the location, volume, and context of data on premises and in the cloud.
  • Database activity monitoring—monitors relational databases, data warehouses, big data and mainframes to generate real-time alerts on policy violations.
  • Alert prioritization—Imperva uses AI and machine learning technology to look across the stream of security events and prioritize the ones that matter most.
What is Data Classification? | Best Practices & Data Types | Imperva (2024)
Top Articles
What the Tech: App of the day - CoinSnap
Retirer Bitcoin: Découvrez La Méthode Qui Vous Convient!
Jazmen Jafar Linkedin
Die Windows GDI+ (Teil 1)
Sunday World Northern Ireland
Espn Expert Picks Week 2
Planets Visible Tonight Virginia
Iron Drop Cafe
Capitulo 2B Answers Page 40
What Happened To Maxwell Laughlin
Healing Guide Dragonflight 10.2.7 Wow Warring Dueling Guide
Evil Dead Rise Showtimes Near Regal Columbiana Grande
finaint.com
N2O4 Lewis Structure & Characteristics (13 Complete Facts)
Mani Pedi Walk Ins Near Me
Urban Dictionary: hungolomghononoloughongous
Walmart stores in 6 states no longer provide single-use bags at checkout: Which states are next?
Equibase | International Results
Candy Land Santa Ana
1989 Chevy Caprice For Sale Craigslist
Best Mechanics Near You - Brake Masters Auto Repair Shops
Bekijk ons gevarieerde aanbod occasions in Oss.
Winco Employee Handbook 2022
Uncovering The Mystery Behind Crazyjamjam Fanfix Leaked
Chime Ssi Payment 2023
Drying Cloths At A Hammam Crossword Clue
Star Wars Armada Wikia
Solo Player Level 2K23
Obsidian Guard's Skullsplitter
How Much Is An Alignment At Costco
Was heißt AMK? » Bedeutung und Herkunft des Ausdrucks
Calculator Souo
Netherforged Lavaproof Boots
Federal Student Aid
Otter Bustr
The Vélodrome d'Hiver (Vél d'Hiv) Roundup
Craigslist List Albuquerque: Your Ultimate Guide to Buying, Selling, and Finding Everything - First Republic Craigslist
888-333-4026
Flipper Zero Delivery Time
Gotrax Scooter Error Code E2
Paul Shelesh
Sour OG is a chill recreational strain -- just have healthy snacks nearby (cannabis review)
22 Golden Rules for Fitness Beginners – Barnes Corner Fitness
Catchvideo Chrome Extension
Unblocked Games - Gun Mayhem
Killer Intelligence Center Download
Doe mee met ons loyaliteitsprogramma | Victoria Club
5103 Liberty Ave, North Bergen, NJ 07047 - MLS 240018284 - Coldwell Banker
300+ Unique Hair Salon Names 2024
King Fields Mortuary
Used Curio Cabinets For Sale Near Me
Latest Posts
Article information

Author: Duane Harber

Last Updated:

Views: 5559

Rating: 4 / 5 (71 voted)

Reviews: 94% of readers found this page helpful

Author information

Name: Duane Harber

Birthday: 1999-10-17

Address: Apt. 404 9899 Magnolia Roads, Port Royceville, ID 78186

Phone: +186911129794335

Job: Human Hospitality Planner

Hobby: Listening to music, Orienteering, Knapping, Dance, Mountain biking, Fishing, Pottery

Introduction: My name is Duane Harber, I am a modern, clever, handsome, fair, agreeable, inexpensive, beautiful person who loves writing and wants to share my knowledge and understanding with you.