Securing our dataset is extremely important for both protecting individuals' data and ensuring that our data stays compliant with global data regulations.

From the PDL Side

We're committed to ensuring that our bulk data dumps don't get exposed. We're extremely sensitive to this and have multiple white-hat security partners who are searching the Internet in an effort to find vulnerable datasets and clamp down on them before nefarious actors discover them.

We are also continuing to update our Services Agreement and contracts in an effort to encourage our customers to secure the data and take initiative in ensuring that the data is protected for everyone.

From the Customer Side

The most common way that data gets exposed is during its storage and processing. We understand the need to store such large datasets off-premise but encourage you to read the following documentation if you are planning to use any of the below services:

MongoDB
Elasticsearch
Hadoop
Jupyter
Amazon S3

We do not recommend any services outside of these.

In addition to our standard customer onboarding, we're happy to hop on a call to help your engineering team ensure that you are properly securing your data. We can also perform a security consultation.