July 2024 Release Notes - People Data Labs Documentation

Release Name	Dataset Version	Publish Date
July 2024	`v27.0`	07/02/2024

v27.0 was released on July 2, 2024. Welcome to our July 2024 release notes! We have exciting updates to share to kick off the second half of the year! Here are some of the key highlights:

Premium field bundles for our self-serve customers
A Company Changelog to help track updates across releases
New linkedin_follower_count field in our Company Schema
Nearly 200 million job updates and 12 million job changes tracked in our dataset over the past quarter!

Excited yet? Read on to learn more, or jump to a specific section using the table of contents below.

📣 Key Announcements

Schema Changes

`linkedin_follower_count` (Company Schema)

FIELD NAME	FIELD TYPE	FIELD DESCRIPTION	EXAMPLE
`linkedin_follower_count`	`Integer (>= 0)`	The number of followers on a company’s LinkedIn profile	`"linkedin_follower_count": 5880`

We are adding a new linkedin_follower_count field to our Company Data schema that tracks the number of followers on a company’s LinkedIn profile. This field is available in our Premium and Comprehensive field bundles, and is immediately provided to current customers using those bundles. To get access to this field, please reach out to your Customer Success team.

❗Breaking Changes (Going Live This Month)

❗Deprecation of `job_last_updated`

Products Impacted: Person Schema Previous Announcements: v26 / April 2024, v26.1 / May 2024, v26.2 / June 2024 As part of the new resume timestamps that were released last quarter, the job_last_updated field is now fully deprecated and has been removed from our Person Schema in the v27.0 release. Customers previously using this field should leverage our new job_last_verified field which provides the same functionality. Please see this easy-to-follow guide prepared by our Technical Services team for detailed instructions and best practices on this transition. Breaking Changes Guide: Deprecation of job_last_updated For further support, please reach out to your Customer Success and Technical Services team.

⚠️ Upcoming Breaking Changes

Upcoming Breaking ChangesUpcoming breaking changes in future versions may impact your current processes. We are announcing them here to provide ample time for you to adjust your processes accordingly.

⚠️ New Role and Sub Role Job Title Taxonomy (Person / Company / IP)

Change expected in: v29.1 / February 2025 Previous Announcements: v26 / April 2024 Products Impacted: Person / Company / IP Schema

Person Fields Impacted	Company Fields Impacted	IP Fields Impacted
job_title_role job_title_sub_role experience.title.role experience.title.sub_role	average_tenure_by_role employee_count_by_month_by_role employee_count_by_role recent_exec_departures recent_exec_hires top_next_employers_by_role top_previous_employers_by_role	person.job_title_role person.job_title_sub_role

Over the next 2 quarters, we will be making significant changes to our job_title_role and job_title_subrole enum values in order to improve our fill rates and categorization of job titles. These changes will include a revamped taxonomy for role and subrole values containing additions, renamings, recategorizations, removals and other modifications to the current set of canonical role and subrole values. For the specific details on these changes see the Resources table at the end of this notice. As indicated in the table above, this change will impact our Person, Company, and IP data fields. Customers using the Company Insights fields shown above (such as for visualization, modeling or other uses) will need to ensure that they update their code to handle the new / deleted / renamed role values. While this is a significant change for many of our customers, it is necessary to improve our data quality and to provide a better overall user experience. Timeline Given the scope of the changes, our goal is to provide clear visibility on the process and ample opportunity to work through this transition together. The projected timeline for this release is as follows: V27.0 (July 2024) - Breaking Change Announcement and Resource Launch:

Public notice of our planned role / subrole transition and initial resources provided (see below)

V27.1 (August 2024) - Beta:

We will open up beta access to the new role / subrole taxonomy as well as a new data field title.class
Customers will be able to test sample data by our Technical Services team to explore the new taxonomy and the potential data impacts
We will release a guide documenting our recommended best practices for transitioning to the new taxonomy with the beta release as well

V28.0 (October 2024) - General Availability:

We will make the new role / subrole taxonomy generally available for all customers and begin the deprecation process for the previous taxonomy
Customers can opt in to accessing the new taxonomy via API and flat file deliveries (but will have the option to delay transitioning until their systems are updated)

V29.1 (February 2025) - Final Deprecation:

We will fully deprecate and officially end support for the previous taxonomy
All new and existing customers will be moved onto the new role / subrole taxonomy.

Resources Please use the following resources to better understand the upcoming changes and to start preparing for the transition. As always, reach out to your Customer Success and Technical Services teams for questions and support. The new set of canonical classes, roles, and subroles is here:

The mapping from the current role/subrole taxonomy to the improved taxonomy is here: Role / Subrole Taxonomy Restructure [post-v27.1]

⚠️ Location Country Enum Updates

Change expected in: v28.0 / October 2024 Products Impacted: Person / Company / IP Schema

Person Fields Impacted	Company Fields Impacted	IP Fields Impacted
location_country countries street_addresses.country possible_street_addresses.country job_company_location_country experience.company.location.country education.school.country	location.country employee_count_by_country	ip.location.country Ip.company.location.country

Next quarter, we will be updating the set of canonical countries values to better accommodate geographical renamings as well as correct redundancies in our set of country values. This change is part of an ongoing effort to improve our overall location standardization process within our data. As such it will impact the location country values in our Person, Company and IP datasets. The updated set of country values that will be released in v28.0 is here:

Location Countries [post-v28.0]

In addition a mapping from the current country values to the upcoming country values can be found here:

Country (pre-v28.0)	Change Type	Country (post-v28.0)	Comments
`swaziland`	Renamed	`eswatini`
`antarctica`	Deleted	–
`macedonia`	Renamed	`north macedonia`
`pitcairn`	Renamed	`pitcairn islands`
`gambia`	Renamed	`the gambia`
`ivory coast`	Deleted	–	Redundant with `côte d'ivoire`

⚠️ Company Type Enum Change

Products Impacted: Person / Company Schema Change expected in: v28.0 / October 2024

Person Fields Impacted	Company Fields Impacted
job_company_type experience.company.type	type

Next quarter, we will be updating the set of canonical company type values to include a new public_subsidiary value in the set of canonical values:

Canonical Company Types (pre-v28.0)	Canonical Company Types (post-v28.0)
educational government nonprofit private public	educational government nonprofit private public public_subsidiary

This change is part of an ongoing effort to improve our coverage of stock ticker fields and how we enable customers to roll up Company Insights information to public companies. The addition of the public_subsidiary company type specifically is intended to help provide customers a mechanism to easily filter and pull all public companies and their subsidiaries.

✨ New Products and Features

Company Changelog

This quarter, we are excited to release our Company Changelog into Beta for all customers. Similar to our existing Person Changelog, the Company Changelog allows users to see which company records have been updated across each build and keep track of record merges and deletions.

Beta ReleaseThe beta release of this product is the first feature-complete version of the Company Changelog that is publicly available. While we do not anticipate major changes to the product, we hope to collect customer feedback over the next few releases to determine any further improvements or refinements to make to this product. If you have any feedback on the Company Changelog please reach out to us or share it with your customer success team.

The Company Changelog is a public list of company record IDs that are categorized into the following groups:

Updated: Any record that had a value change to any non-insights field or had a record merged into it
Merged: A record that was merged into another record (and as a result no longer exists in the dataset)
Deleted: This record was deleted and no longer exists in the dataset
Added: This record did not exist in previous dataset version and was added in the latest version

Note that Company Insights fields, which have expected changes due to new periods added each month, are among the fields whose changes are not factored into update calculation by design. The Company Changelog is helpful for customers looking to streamline their data update and ETL pipelines by filtering data ingestion to just the records that have changed in a release. In addition, the Changelog also allows customers to track which records and IDs have changed and how they’ve been updated across builds. The Company Changelog is publicly available on our S3 bucket as a flat file and freely accessible for all customers to use. For more information, see our documentation.

Self-Serve Premium Fields

We are excited to announce that our premium field bundles will be available in early July through the API Dashboard for all self-serve Pro plans. This means that self-serve customers will be able to access premium fields across our person and company datasets, such as job summary, company revenue data, company funding data, and more. Previously, these fields were only accessible to enterprise customers. Each field bundle can be added on to a new or existing Pro plan so teams can immediately start building, testing and evaluating more of our data without committing to a large upfront package. To get started, log into the API Dashboard and select the field bundles you would like to add on by clicking the Manage button on the Plans & Billing page. Note that existing enterprise customers will not be able to self-serve premium fields through their API dash. Instead please reach out to your Customer Success team for adding or updating your access to premium fields.

🚀 Data Updates

Freshness

The number of jobs and locations verified in our datasets over the past month (based on the job_last_verified and location_last_updated fields).

Dataset	Geography	Field	Records Updated
Resume	Global	`experience`	198,448,154
Resume	Global	`location`	318,427,940
Resume	United States	`experience`	72,866,214
Resume	United States	`location`	103,534,938

Job Changes

The number of person records where the primary job experience changed in our Person Dataset over the past month (based on the job_last_changed field).

Dataset	Geography	Records Updated
Resume	Global	11,466,607
Resume	United States	3,696,119

Coverage (Full Stats: Person, Company)

Resume Dataset

Linkage	Coverage in v26	Coverage in v27	Increase (%)
`total_records`	744,191,278	721,091,212	-3.10%
name_aliases	28,268,384	36,551,556	29.30%
twitter_username	10,345,484	11,774,688	13.81%
phones	69,139,249	76,973,049	11.33%
job_company_ticker	47,044,362	42,694,043	-9.25%

API Dataset

Linkage	Coverage in v26	Coverage in v27	Increase (%)
`total_records`	3,178,815,044	2,794,528,725	-12.09%
twitter_url	205,195,859	55,415,644	-72.99%
github_url	5,543,742	3,693,024	-33.38%
emails	1,108,590,476	890,326,394	-19.69%
experience.company.id	648,645,145	569,128,028	-12.26%

Company Dataset

Linkage	Coverage in v26	Coverage in v27	Increase (%)
`total_records`	62,109,427	66,496,323	7.06%
summary	41,703,717	20,466,945	-50.92%
alternative_domains	4,504,390	4,971,197	10.36%
headline	10,310,725	11,086,177	7.52%
linkedin_id	61,249,183	65,727,018	7.31%
website	29,047,677	30,726,681	5.78%
ticker	26,709	23,633	-11.52%

Commentary

Increase in the number of Twitter URLs tied to LinkedIn profiles in our Resume dataset
Increased number of phone numbers tied to LinkedIn profiles in our Resume dataset
12% decrease in the size of our API dataset as well as some a relative decrease in the # of records with some social URLs as part of our ongoing deduplication efforts to improve data quality and accuracy
Decreased number of company summaries by 50% by filtering out autogenerated summaries (see Improvements below).
Drop in stock tickers across company and person records -> this is due to an bug fix (below)

🛠 Improvements and Bug Fixes

Improvements

Removed auto-generated summaries from our company dataset, to help ensure all summaries in our data have been written by the company.
Improved our canonicalization of people tied to the twitter/X profile in our data
We improved our matching / canonicalization logic for schools

Bug Fixes

Removed a legacy source of Stock Tickers which decreased our ticker coverage (but improved our quality)
Fixed a bug where our levels tagging did not occur deterministically.
Cleaned / Removed a few small sets of LinkedIn URLs that we’ve determined are invalid in all cases.

​📣 Key Announcements

​Schema Changes

​linkedin_follower_count (Company Schema)

​❗Breaking Changes (Going Live This Month)

​❗Deprecation of job_last_updated

​⚠️ Upcoming Breaking Changes

​⚠️ New Role and Sub Role Job Title Taxonomy (Person / Company / IP)

​⚠️ Location Country Enum Updates

​⚠️ Company Type Enum Change

​✨ New Products and Features

​Company Changelog

​Self-Serve Premium Fields

​🚀 Data Updates

​Freshness

​Job Changes

​Coverage (Full Stats: Person, Company)

​Commentary

​🛠 Improvements and Bug Fixes

​Improvements

​Bug Fixes

📣 Key Announcements

Schema Changes

`linkedin_follower_count` (Company Schema)

❗Breaking Changes (Going Live This Month)

❗Deprecation of `job_last_updated`

⚠️ Upcoming Breaking Changes

⚠️ New Role and Sub Role Job Title Taxonomy (Person / Company / IP)

⚠️ Location Country Enum Updates

⚠️ Company Type Enum Change

✨ New Products and Features

Company Changelog

Self-Serve Premium Fields

🚀 Data Updates

Freshness

Job Changes

Coverage (Full Stats: Person, Company)

Commentary

🛠 Improvements and Bug Fixes

Improvements

Bug Fixes