October 2021 Release Notes (v16.0)

Release NameDataset VersionPublish Date
October 2021v16.010/11/2021

Person v16.0 was released on 10/11/2021

❗️

DEPRECATION NOTICE -- v4 API ENDPOINT

Support has officially ended for the v4 (Schema 4) Person Enrichment API. We will no longer provide bug fixes for this endpoint. We strongly urge all customers currently using the Schema 4 Person Enrichment endpoint (v4/person) to transition to the Schema 5 endpoint by October 31, 2021. This endpoint will have an End of Life date of January 1, 2022.

Please see our v4 (Schema 4) to v5 (Schema 5) migration guide for more information, and reach out to your Customer Success team if you have any questions or need assistance with the migration.

❗️

DEPRECATION NOTICE -- Canonical Data

This is the final release where we will put the "canonical" location, school, and company files in s3://pdl-prod-schema. We will continue providing access to this relational data via complimentary access to our Cleaner Endpoints and Autocomplete API for all customers.

❗️

DEPRECATION NOTICE -- Version Status from API Responses

People Data Labs will be launching monthly data updates for API users by Jan 1, 2022 and will be deprecating the Version Status field (along with its nested fields) from API responses as part of that change. We will continue supporting version status for our data license customers in our quarterly releases.

Additionally, the ID Changelog provided to all customers through S3 will continue to be updated through our major quarterly releases, but will not be updated in monthly API releases for minor version updates.

Please reach out to your Customer Success team at PDL if you have any questions or concerns.

Person Schema Changes

last_initial Field

As of the October 2021 release (this release) will allow records with no last_name and only a last_initial (new field).

🚧

last_name is now a nullable field

While previously full_name, first_name, and last_name were not nullable, now full_name, first_name, and last_initial are the minimum requirements for a record.

Example Name

{
  “full_name”: “ben e”,
  “first_name”: “ben”,
  “last_name”: null,
  “last_initial”: “e”,
   ...
}

This change should lead to an increase in our coverage of profiles in our Resume Data. This change will also yield increased matches for customers desiring a match even when a name is not fully populated. Person Enrichment customers desiring a full name should use the required parameter to get desired fields according to their use case.

Indexing Date Fields as Dates

As of this current release (October 2021 - v16) we will be changing the mapping of all date fields to Date type. This will allow range searches on dates for our Search API users. See the Search API Field Changes section below for more information.

Changing founded and birth_year to Integer

As of this current release (October 2021 - v16) we will be returning birth_year, job_company_founded, and experience.company.founded as integer types instead of string types to data license customers. See the Data License Field Changes section below for more information.

Company Schema Changes

employee_count Field

As of the October 2021 release (this release - v16) we are adding a new employee_count field to the Company Schema:

Company Schema

Field NameField TypeField Description
employee_countIntThis is a new field representing the current number of employees working at the company.

New Products, Features and Integrations

We launched a number of new API endpoints and integrations this past quarter, and we are excited to get them into the hands of current and potential customers. Check them out!

New Endpoints

Autocomplete API [open-access beta]

  • Access via: v5/autocomplete endpoint
  • Description: The Autocomplete API allows users to retrieve suggestions for Search API query values along with the number of available records for each suggestion (for example, schools starting with "stanf"). This endpoint is in open-beta and we would love to hear your feedback as you use it!
  • Resources:

Person Retrieve API [open-access beta]

  • Access via: v5/person/retrieve endpoint
  • Description: The Person Retrieve API allows users to pull profiles based on PDL ID in order to “refresh” their record. This endpoint is in open-beta and we would love to hear your feedback as you use it!
  • Resources:

New Integrations and Tools (No Coding Required!)

Zapier Integration [open access to all customers]

Query Builder [open access to all customers]

New Page Updates

API Status Page [publicly viewable]

  • Access via: People Data Labs System Status Page
  • Description: We recently launched our updated API status page, which provides not only a refreshed view of each of our endpoints, but also provides detailed reporting on outages and other incidents. Users can now also subscribe to receive real time notifications of outages and scheduled downtime as well.
  • Resources: N/A

Data Field Changes

Data License Field Changes

Data license customers will see the following field changes in their flat file dataset:

Person Schema

Field NameField TypeField Description
birth_yearIntWe have changed the field type from String to Int
job_company_foundedIntWe have changed the field type from String to Int
experience.company.foundedIntWe have changed the field type from String to Int

Search API Field Changes

Person Schema

Field NameField TypeField Description
birth_dateDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
birth_yearDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
certifications.end_dateDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
certifications.start_dateDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
education.end_dateDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
education.start_dateDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
experience.company.foundedDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
experience.end_dateDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
experience.start_dateDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
job_company_foundedDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
job_last_updatedDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
job_start_dateDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
location_last_updatedDateWe have changed the type from Keyword to Date in the Elasticsearch mapping

Company Schema

Field NameField TypeField Description
foundedDateWe have changed the type from Keyword to Date in the Elasticsearch mapping
employee_count                         IntThis is a new field representing the current number of employees working at the company.

Freshness

This quarter, we made huge strides in refreshing our datasets and have updated job titles for over 208mm of our global profiles and locations for over 182mm in our resume dataset. We also updated jobs for 57mm of our United States profiles and locations for 62mm in our resume dataset. Our street_address slice, in particular, saw a significant number of location updates from our efforts this past quarter leading to a total of 449mm updates globally in the API.

Coverage

We are continuing to make strides to link more PII to our core datasets. See some highlights below and click the links on each slice to see the full set of stats for each.

Resume Dataset

LinkageCoverage in v15Coverage in v16Increase (%)
total_records641,883,288666,506,8793.84%
work_email42,575,90250,293,15818.13%
job_title414,447,226448,958,8018.33%

API Dataset

LinkageCoverage in v15Coverage in v16Increase (%)
total_records2,983,095,9183,045,094,7182.08%
work_email43,637,06351,396,50817.78%

Street Address Dataset

LinkageCoverage in v15Coverage in v16Increase (%)
total_records230,869,583252,027,8629.16%
work_email945,3881,109,69417.38%
languages23,360,99926,015,54011.36%
personal_emails76,307,27884,355,80510.55%
interests8,054,3828,810,8209.39%

Mobile Phone Dataset

LinkageCoverage in v15Coverage in v16Increase (%)
location_names142,500,446190,126,56233.42%

Commentary

  • We increased our current work email coverage across all our datasets including linkages between work_email <> B2B info and work_email <> mobile phone.
  • We significantly improved our linkages between mobile_phone and location_names (and other related location information) as well as linkages between mobile_phone and work_email.
  • We increased our coverage or records in the resume dataset by almost 4% and our coverage of job titles for that dataset by ~8%.
  • Overall, we had a significant lift in new jobs, emails, and locations this quarter.

Improvements

  • We merged ~1.5mm records together in the resume dataset, reducing our rate of duplication.
  • We added a new last_initial field to our data which allows users to retrieve profile matches for records even without a fully populated name. See notice above.
  • We added an employee_count field to our company data representing the current employee headcount as an integer value. See the Company Schema Changes section above.
  • We improved the matching logic for the Company Enrichment API and also support enriching on ticker and location.
  • We improved the accuracy of our estimation for inferred_salary.
  • We added the ability for users to exclude datasets when using the Search API (see code example).
  • We added clearer messaging when hitting rate limits in Search APIs.
  • We updated our Search APIs to support queries containing uppercased strings.

Bug Fixes

  • We fixed a bug in our data causing null values in job_company_founded to have value "None".
  • Fixed bug with bulk person enrichment API not handling nested required and min_likelihood parameters.
  • Fixed bug with Bulk Person Enrichment API returning incorrect metadata values for unmatched(404) requests.
  • We fixed a bug with the Person Enrichment API in handling hashed emails with capitalized characters.
  • We corrected a bug in our data with undesired characters appearing in experience summaries.
  • We fixed a bug in our build process parsing for names with false-positive inputs.