September 2024 Release Notes (v27.2)
Release Name | Dataset Version | Publish Date |
---|---|---|
September 2024 | v27.2 | 09/03/2024 |
September 2024 is a Minor Release
Since January 2022, PDL has been releasing data updates every month with a major release every quarter. Minor releases typically contain fewer product updates or key changes, but still contain important data improvements.
This data version was released on 9/3/2024.
Welcome to our September 2024 release notes! We have exciting updates to share this month:
Here are some of the key highlights:
- Improvements to our Stock Ticker data and new Stock Ticker fields for public subsidiaries in our Company Dataset
- Exciting additional company data improvements (including improvements to our
inferred_revenue
estimates and easier parent / subsidiary aggregations) - Big increases in the quality and coverage of our mobile phone data
- Decrease in low-quality data records in our email and phone datasets as a result of our QA improvements
Excited yet? Read on to learn more, or jump to a specific section using the table of contents below.
New Stock Ticker Fields (Company Schema)
Field Name | Field Type | Field Description |
---|---|---|
ultimate_parent_ticker | String | Stock symbol of the company's ultimate parent (only for subsidiaries of public companies) |
ultimate_parent_mic_exchange | String | MIC exchange code that corresponds to the stock exchange of the company's ultimate parent (only for subsidiaries of public companies) |
This month, we are adding two new stock-ticker fields to our Company Data Schema. The ultimate_parent_ticker
and ultimate_parent_mic_exchange
are designed to make it easier to search the Company Dataset for all the subsidiaries of a particular parent company. These fields will be populated for company records that are public subsidiaries. In our next major release (v28.0) those records will also be tagged with the type public_subsidiary
(see the Company Type Enum Change section below for additional details).
These new fields are available in our Premium and Comprehensive field bundles, and are immediately provided to current customers using those bundles. To get access to these field bundles, please reach out to your Customer Success team.
Upcoming Breaking Changes
These are upcoming breaking changes in future versions that may impact your current processes. We are announcing them here to provide ample time for you to adjust your processes accordingly.
Change expected in: February 2025
Previous Announcements: April 2024, July 2024, August 2024
Products Impacted: Person / Company / IP Schema
Person Fields Impacted | Company Fields Impacted | IP Fields Impacted |
---|---|---|
job_title_role job_title_sub_role experience.title.role experience.title.sub_role | average_tenure_by_role employee_count_by_month_by_role employee_count_by_role recent_exec_departures recent_exec_hires top_next_employers_by_role top_previous_employers_by_role | person.job_title_role person.job_title_sub_role |
As announced in our July 2024 release notes, we are in the process of revising our job_title_role
and job_title_sub_role
enums by implementing a new job role taxonomy. This breaking change will impact customers using any of the fields in the table listed above.
Last month, we opened up beta access to our updated taxonomy along with a migration guide to support customers through the transition to the new taxonomy. As a reminder, all customers must be fully transitioned to the new taxonomy by February 2025.
The full timeline is shown below:
Timeline
Given the scope of the changes, our goal is to provide clear visibility on the process and ample opportunity to work through this transition together. The projected timeline for this release is as follows:
- ✅ July 2024)- Breaking Change Announcement and Resource Launch:
- Public notice of our planned role / subrole transition and initial resources provided (see below)
- ✅ August 2024 - Beta:
- Beta access is now available for the new role / subrole taxonomy which includes a new data field
title.class
- Customers are now able to test sample data through our Technical Services team to explore the new taxonomy and the potential data impacts
- We have released a guide documenting our recommended best practices for transitioning to the new taxonomy with the beta release as well
- See section below for more details
- Beta access is now available for the new role / subrole taxonomy which includes a new data field
- October 2024 - General Availability:
- We will make the new role / subrole taxonomy generally available for all customers and begin the deprecation process for the previous taxonomy
- Customers can opt in to accessing the new taxonomy via API and flat file deliveries (but will have the option to delay transitioning until their systems are updated)
- February 2025 - Final Deprecation:
- We will fully deprecate and officially end support for the previous taxonomy
- All new and existing customers will be moved onto the new role / subrole taxonomy.
Please use the following resources to better understand the upcoming changes and to start preparing for the transition. As always, reach out to your Customer Success and Technical Services teams for questions and support.
[✨NEW] Migration Guide: A document that walks customers through transitioning their codebase and datasets to the new taxonomy is here:
The new set of canonical classes, roles, and subroles is here:
- Job Title Class [post-update]
- Job Title Roles [post-update]
- Job Title Subroles [post-update]
- Mapping Job Title Class to Roles to Subroles [post-update]
The mapping from the current role/subrole taxonomy to the improved taxonomy is here:
Sample records using the updated role / subrole taxonomy are here:
- Example Person Record [New Role / Subrole Taxonomy]
- Example Company Record [New Role / Subrole Taxonomy]
- Example IP Record [New Role / Subrole Taxonomy]
Change expected in: October 2024
Previous Announcements: July 2024, August 2024
Products Impacted: Person / Company / IP Schema
Person Fields Impacted | Company Fields Impacted | IP Fields Impacted |
---|---|---|
countries street_addresses.country street_addresses.country possible_street_addresses.country job_company_location_country experience.company.location.country education.school.country | location.country employee_count_by_country | ip.location.country ip.company.location.country |
In October 2024, we will be updating the set of canonical countries values to better accommodate geographical renamings as well as correct redundancies in our set of country values.
This change is part of an ongoing effort to improve our overall location standardization process within our data. As such it will impact the location country values in our Person, Company and IP datasets.
The updated set of country values that will be released in October is here:
Additionally, a mapping from the current country values to the upcoming country values can be found here:
Country (pre-October) | Change Type | Country (post-October) | Comments |
---|---|---|---|
swaziland | renamed | eswatini | |
antarctica | deleted | -- | |
macedonia | renamed | north macedonia | |
pitcairn | renamed | pitcairn islands | |
gambia | renamed | the gambia | |
ivory coast | deleted | -- | Redundant with côte d'ivoire |
Change expected in: October 2024
Products Impacted: Person / Company Schema
Person Fields Impacted | Company Fields Impacted |
---|---|
job_company_type experience.company.type | type |
Next month, we will be updating the set of canonical company type values to include a new public_subsidiary
value in the set of canonical values:
Canonical Company Types | Canonical Company Types |
---|---|
• educational • government • nonprofit • private • public | • educational • government • nonprofit • private • public • public_subsidiary --> new type |
This change is part of the now-released effort to improve our coverage of stock ticker fields and to better enable customers to roll up Company Insights information to public companies. The addition of the public_subsidiary
company type specifically is intended to help provide customers a mechanism to easily filter and pull all public companies and their subsidiaries.
Company Data Improvements (Company Schema)
This month, we are excited to release a series of improvements to our company data:
Stock Ticker Improvements
We’ve improved the accuracy of our stock ticker and stock ticker exchange fields across our company dataset. This fixes some data quality issues in previous builds, but more importantly also allows us to more regularly update and refresh this information, providing better coverage of recent changes to public companies. Furthermore, we now provide more reliable stock ticker information for companies trading on multiple exchanges by prioritizing US exchanges and primary tickers.
inferred_revenue
Improvements
As part of these improvements, we also updated our methodology for estimating inferred_revenue
ranges for companies in addition to incorporating public SEC data more strongly into our predictions. While our previous revenue estimation approach was regarded very positively by customers, we have seen meaningful improvements as a result of these updates.
Easier Parent/Subsidiary Aggregations
Our new ultimate_parent_ticker
and ultimate_parent_mic_exchange
fields make it easy to search the PDL Company dataset for all the subsidiaries of a particular parent company. For example, it is now possible to use our Company Search API to query for all the subsidiaries of a public company like Apple by using the search criteria ultimate_parent_ticker=AAPL
. Additionally, our new public_subsidiary
company type enum makes it easy for users to identify private companies that are owned by public companies in our dataset.
September 2024 is a Minor Release
As a reminder, September 2024 is a minor release. This means that the information reported below is based off of the last one month of updates (rather than the 3 months of updates we typically report for the major releases).
The number of jobs and locations verified in our datasets over the past month (based on the job_last_verified
and location_last_updated
fields).
Dataset | Geography | Field | Records Updated |
---|---|---|---|
Resume | Global | experience | 43,052,999 |
Resume | Global | location | 108,828,647 |
Resume | United States | experience | 10,525,371 |
Resume | United States | location | 26,701,379 |
The number of person records where the primary job experience changed in our Person Dataset over the past month (based on the job_last_changed
field).
Dataset | Geography | Field | Records Updated |
---|---|---|---|
Resume | Global | experience | 1,431,816 |
Resume | US | location | 450,280 |
Linkage | Coverage in August 2024 | Coverage in September 2024 | Increase (%) |
---|---|---|---|
total_records | 725,973,888 | 725,541,785 | -0.06% |
mobile_phone | 62,390,071 | 66,775,208 | 7.03% |
birth_date | 8,955,725 | 10,055,892 | 12.28% |
Linkage | Coverage in August 2024 | Coverage in September 2024 | Increase (%) |
---|---|---|---|
total_records | 489,934,338 | 565,883,659 | 15.50% |
Linkage | Coverage in August 2024 | Coverage in September 2024 | Increase (%) |
---|---|---|---|
total_records | 992,916,344 | 793,731,451 | -20.06% |
Linkage | Coverage in August 2024 | Coverage in September 2024 | Increase (%) |
---|---|---|---|
total_records | 823,900,163 | 636,335,238 | -22.77% |
personal_emails | 563,585,485 | 407,293,539 | -27.73% |
street_addresses | 353,693,755 | 238,152,475 | -32.67% |
Linkage | Coverage in August 2024 | Coverage in September 2024 | Increase (%) |
---|---|---|---|
total_records | 71,656,772 | 72,060,224 | 0.56% |
gics_sector | 21,523 | 27,700 | 28.70% |
mic_exchange | 22,956 | 27,776 | 21.00% |
ticker | 23,793 | 27,776 | 16.74% |
- We saw a >16% increase in our coverage of stocker tickers and >20% increase our coverage of exchange-related information in our company dataset as part of our stock ticker improvements this month.
- We saw a significant increase in our coverage of mobile phone information across both the resume (7%) and mobile phone datasets (15%). This increase in coverage was despite our purging of low quality phone numbers from our datasets and build sources (see Improvements section below).
- The Phone and Email datasets saw significant decreases in records as a result of the removal of low-quality data sources identified through our mobile phone QA updates.
- We made the decision to remove these sources since most of the lift they provided was outdated data (e.g. earthlink.net emails, historic addresses, and home phone numbers)
- We were also observing a material number of inaccuracies in the data
- We improved our accuracy of stock tickers and ticker exchange fields in our company dataset and improved our build process to enable more frequent and accurate updates to these fields.
- We made several improvements to our mobile phone data including a removal of numbers not registered to mobile phones and removing a source of low-quality mobile phone data from our build.
- We no longer allow generic work emails (e.g. [email protected]) to be in the primary
work_email
field.
- We fixed a bug where PDL legacy company LinkedIn URLs were being selected over the current LinkedIn URL.
- We fixed a bug where experience objects with a start_date before the founding date of a company were being nullified (e.g. Ivan Zhao, founder of Notion, was missing his experience at Notion).