Person Schema
Breakdown of the PDL Fields
Overview
This page contains a listing of all the fields contained in the Person Dataset. Fields are broken out into the following sections:
- Base Person Fields: Common fields available to all customers by default
- Person Risk Attributes: Premium identity-risk-related fields with enhanced tracking and sourcing information
- Premium Resume Fields: Premium resume-related fields such as raw-text job titles and summary descriptions
For access to fields beyond the Base Person Fields, please speak to one of our data consultants.
Base Person Fields
Formatting Notes
The dot notation indicates that the property is one level deep inside a hash. If a field is not present then we’ll return an empty list or null value for that field.
FIELD NAME | FIELD TYPE | Persistence Commitments and Format | SHORT DESCRIPTION | EXAMPLE |
---|---|---|---|---|
id | String | PDL persistent ID | qEnOZ5Oh0poWnQ1luFBfVw_0000 | |
full_name | String | Free text field | The first and the last name fields appended with a space | sean thorne |
first_name | String | Free text field | A person's first name | sean |
middle_initial | String | Free text field, single character | A person's middle initial | f |
middle_name | String | Free text field | A person's middle name | fong |
last_initial | String | Free text field, single character | The first letter of a person's last name | t |
last_name | String | Free text field | A person's last name | thorne |
gender | String | Canonical Genders | The person's gender | male |
birth_year | Date * | Date string. YYYY-MM-DD. Represented as Int for Data License customers (see October 2021 Release Notes (v16)) | Approximated birth date associated with this person profile. If a profile has a birth_date, the birth_year field will match | 1990 |
birth_date | Date | Date string in YYYY-mm-dd format. Can be accurate to the day (YYYY-MM-DD), month (YYYY-MM) or year (YYYY). If this exists, birth_year will agree with this | Birth date associated with this person profile | 1990-12-03 |
linkedin_url | String | Url of the form linkedin.com/in/<linkedin_username> -- not necessarily an active profile | Main linkedin profile for this record based on source agreement | linkedin.com/in/seanthorne |
linkedin_username | String | Free text field with blacklisted values | Main linkedin username for this record based on source agreement | seanthorne |
linkedin_id | String | Main linkedin profile id for this record based on source agreement | 145991517 | |
facebook_url | String | Url of the form facebook.com/<facebook_username> | facebook profile | facebook.com/deseanthorne |
facebook_username | String | facebook username | deseanthorne | |
facebook_id | String | persistent facebook id associated with a person's facebook profile | 1089351304 | |
twitter_url | String | Twitter URL | twitter.com/seanthorne5 | |
twitter_username | String | Twitter Username | seanthorne5 | |
github_url | String | Url of the form github.com/<github_username> | Main github profile for this record based on source agreement | github.com/deseanathan_thornolotheu |
github_username | String | Main github profile username for this record based on source agreement | deseanathan_thornolotheu | |
work_email | String | Valid email address formatting | Current Professional email | [email protected] |
personal_emails | Array (String) | List of all emails tagged as type = personal | ["[email protected]", "[email protected]"] | |
mobile_phone | String | Highly confident direct dial mobile phone associated with this person | 17095944554 | |
industry | Enum (String) | The most relevant industry for this record based primarily on their tagged personal industries and secondarily on the industries of the companies that they have worked for | computer software | |
job_title | String | A person's current job title | co-founder and chief executive officer | |
job_title_role | Enum (String) | Canonical Job Roles | A person's current job title derived role | operations |
job_title_sub_role | Enum (String) | Canonical Job Sub Roles | A person's job title derived subrole. Each subrole maps to a role | logistics |
job_title_levels | Array (Enum (String)) | Canonical Job Title Levels | A person's current job title derived levels | ["cxo", "owner"] |
job_company_id | String | A person's current company's PDL ID | peopledatalabs | |
job_company_name | String | A person's current company's name | people data labs | |
job_company_website | String | A person's current company's website | peopledatalabs.com | |
job_company_size | Enum (String) | Canonical Company Sizes | A person's current company's size range | 51-200 |
job_company_founded | Date * | Date string. YYYY-MM-DD. Represented as Int for Data License customers (see October 2021 Release Notes (v16)) | A person's current company's founded date | 1911 |
job_company_industry | Enum (String) | Canonical Industries | A person's current company's industry | computer software |
job_company_linkedin_url | String | A person's current company's linkedin url | linkedin.com/company/peopledatalabs | |
job_company_linkedin_id | String | A person's current company's linkedin id | 1640694639 | |
job_company_facebook_url | String | A person's current company's facebook url | facebook.com/peopledatalabs | |
job_company_twitter_url | String | A person's current company's twitter url | twitter.com/peopledatalabs | |
job_company_location_name | String | A person's current company's HQ canonical location | san francisco, california, united states | |
job_company_location_locality | String | A person's current company's HQ locality | san francisco | |
job_company_location_metro | Enum (String) | Canonical Metros | A person's current company's HQ metro area | san francisco, california |
job_company_location_region | String | A person's current company's HQ region | california | |
job_company_location_geo | String | A person's current company's HQ geo | 37.77,-122.41 | |
job_company_location_street_address | String | A person's current company's HQ street_address | 455 Market Street Suite 1690 | |
job_company_location_address_line_2 | String | A person's current company's HQ address line 2 | suite 1690 | |
job_company_location_postal_code | String | A person's current company's HQ postal code | 94105 | |
job_company_location_country | Enum (String) | Canonical Countries | A person's current company's HQ country | united states |
job_company_location_continent | Enum (String) | Canonical Continents | A person's current company's HQ continent | north america |
job_last_updated | Date | Date string. YYYY-MM-DD | The timestamp a new data source contributed to this record for the current job indicating either new information or validation of existing information | 2018-11-05 |
job_start_date | Date | Date string. YYYY-MM-DD | Indicates the start period of the object. Can be accurate to the day (YYYY-MM-DD), month (YYYY-MM) or year (YYYY) | 2015-03 |
location_name | String | the current canonical location of the person | berkeley, california, united states | |
location_locality | String | the current locality of the person | berkeley | |
location_metro | Enum (String) | Canonical Metros | the current MSA of the person | san francisco, california |
location_region | String | the current region of the person | california | |
location_country | Enum (String) | Canonical Countries | the current country of the person | united states |
location_continent | Enum (String) | Canonical Continents | the current continent of the person | north america |
location_street_address | String | the current street address of the person | 455 fake st | |
location_address_line_2 | String | the current address line 2 of the person | apartment 12 | |
location_postal_code | String | the current postal code of the person | 94704 | |
location_geo | String | the current geo of the person | 37.87,-122.27 | |
location_last_updated | Date | Date string. YYYY-MM-DD | The timestamp a new data source contributed to this record for the current location indicating either new information or validation of existing information | 2018-11-05 |
phone_numbers | Array (String) | Phone numbers associated with this person profile in E164 format | 17095944554 | |
emails | Array (Object) | Emails associated with this person profile. | ||
emails.address | String | The full parsed email | [email protected] | |
emails.type | Enum (String) | The type of email either current_professional, professional, personal or null | professional | |
interests | Array (String) | Interests associated with the profile | ||
skills | Array (String) | Skills associated with the profile | ||
location_names | Array (String) | list of all canonical location names associated with the person | ["berkeley, california, united states", "san francisco, california, united states"] | |
regions | Array (String) | List of regions associated with the person | ["california, united states"] | |
countries | Array (Enum (String)) | Canonical Countries | list of countries associated with a person | ["united states"] |
street_addresses | Array (Object) | List of full parsed addresses associated with the person | List of full parsed addresses associated with the person | |
street_addresses.street_address | String | The street address associated with the location object | 455 fake st | |
street_addresses.address_line_2 | String | The secondary street address associated with the location object | apartment 12 | |
street_addresses.name | String | A string that appends location fields together to create a standard location field | berkeley, california, united states | |
street_addresses.locality | String | The administrative locality associated with the location object | berkeley | |
street_addresses.metro | Enum (String) | Canonical Metros | The metro area associated with the location object | san francisco, california |
street_addresses.region | String | The administrative region associated with the location object | california | |
street_addresses.postal_code | String | The postal code associated with the location object | 94704 | |
street_addresses.country | String | The country associated with the location object | united states | |
street_addresses.geo | String | The geolocation associated with the location object in latitude, longitude format | 37.87,-122.27 | |
street_addresses.continent | Enum (String) | Canonical Continents | The continent associated with the country in the location object | north america |
experience | Array (Object) | Experience objects associated with this person profile. When outputted as a csv, the indexing is based on recency and associativity | ||
experience.title | Object | A dictionary object that provides a canonicalized title, role and level. For raw titles, see the premium field experience.title.raw . | ||
experience.title.name | String | The inputted title from our data sources with some basic cleaning and mapping in order to help with merging | chief executive officer and co-founder | |
experience.title.levels | Array (Enum (String)) | Canonical Job Title Levels | Levels associated with a title | ["cxo"] |
experience.title.role | Enum (String) | Canonical Job Roles | A person's job title derived role | operations |
experience.title.sub_role | Enum (String) | Canonical Job Sub Roles | A person's job title derived subrole. Each subrole maps to a role | logistics |
experience.company | Object | A dictionary of information for the associated company | ||
experience.company.id | String | Our current NOT PERSISTENT ids that tie company data to the canonical data | peopledatalabs | |
experience.company.name | String | The name associated with the company | people data labs | |
experience.company.website | String | The website associated with the company | peopledatalabs.com | |
experience.company.founded | Date * | Date string. YYYY-MM-DD. Represented as Int for Data License customers (see October 2021 Release Notes (v16)) | The year that the company was founded | 1911 |
experience.company.size | Enum (String) | Canonical Company Sizes | The size range of the company | 51-200 |
experience.company.industry | Enum (String) | Canonical Industries | The industry associated with the company | computer software |
experience.company.linkedin_url | String | The linkedin url associated with the company | linkedin.com/company/peopledatalabs | |
experience.company.linkedin_id | String | The linkedin id associated with the company | 1640694639 | |
experience.company.facebook_url | String | The facebook url associated with the company | facebook.com/peopledatalabs | |
experience.company.twitter_url | String | The twitter associated with the company | twitter.com/peopledatalabs | |
experience.start_date | Date | Date string. YYYY-MM-DD | Indicates the start period of the object. Can be accurate to the day (YYYY-MM-DD), month (YYYY-MM) or year (YYYY) | 2015-03 |
experience.end_date | Date | Date string. YYYY-MM-DD | Indicates the end period of the object | 2011-07 |
experience.location_names | Array (String) | Canonical locations associated with this particular job/experience object (where the person is working, which may or may not be where the company is headquartered.) | ["san francisco, california, united states"] | |
experience.is_primary | Boolean | Indicates if the experience is the primary experience object in our dataset. This experience object will exist in the job_XXX fields | TRUE | |
experience.company.location | Object | A dictionary of information for the associated company location | ||
experience.company.location.street_address | String | Company HQ address | 455 Market Street Suite 1690 | |
experience.company.location.address_line_2 | String | The adress line 2 associated with the company HQ | suite 1690 | |
experience.company.location.name | String | The canonical location name associated with the company HQ | san francisco, california, united states | |
experience.company.location.locality | String | Company locality | san francisco | |
experience.company.location.metro | Enum (String) | Canonical Metros | Company metro area | san francisco, california |
experience.company.location.region | String | Company region | california | |
experience.company.location.country | Enum (String) | Canonical Countries | Company country | united states |
experience.company.location.postal_code | String | The postal code associated with the company HQ | 94105 | |
experience.company.location.continent | Enum (String) | Canonical Continents | The continent associated with the company HQ | north america |
experience.company.location.geo | String | The geo code associated with the company HQ | 37.77,-122.41 | |
education | Array (Object) | Education objects associated with this person profile. When outputted as a csv, the indexing is based on recency and associativity | ||
education.degrees | Array (Enum (String)) | Canonical Education Degrees | A list of canonical degrees associated with this education object | ["bachelors", "bachelors of arts"] |
education.majors | Array (Enum (String)) | Canonical Education Majors | A list of majors associated with this education object | ["entrepreneurship"] |
education.minors | Array (Enum (String)) | Canonical Education Majors | A list of minors associated with this education object | ["business"] |
education.school | Object | A dictionary of information for the associated school | ||
education.school.id | String | Our current NOT PERSISTENT ids that tie company data to the canonical data | a56df063-4562-4e59-bc4c-68b33c14df1e | |
education.school.name | String | The name associated with the school | university of oregon | |
education.school.website | String | The website associated with the school, could include subdomains | business.uoregon.edu | |
education.school.domain | String | The website associated with the school | uoregon.edu | |
education.school.type | Enum (String) | Canonical School Types | The type of school | post-secondary institution |
education.school.linkedin_url | String | The linkedin url associated with the school | linkedin.com/school/university-of-oregon | |
education.school.linkedin_id | String | The linkedin ID associated with the school | 19207 | |
education.school.facebook_url | String | The facebook url associated with the school | facebook.com/universityoforegon | |
education.school.twitter_url | String | The twitter url associated with the school | twitter.com/uoregon | |
education.school.location | Object | The location associated with the school | ||
education.school.location.name | String | The canonical name of the location associated with the school | eugene, oregon, united states | |
education.school.location.locality | String | The locality associated with the school | eugene | |
education.school.location.region | String | The region associated with the school | oregon | |
education.school.location.country | Enum (String) | Canonical Countries | The country associated with the school | united states |
education.school.location.continent | Enum (String) | Canonical Continents | The continent associated with the school | north america |
education.start_date | Date | Date string. YYYY-MM-DD | Indicates the start period of the object | 2008-09 |
education.end_date | Date | Date string. YYYY-MM-DD | Indicates the end period of the object | 2013-06 |
education.gpa | Float | The gpa associated with the given degree | 2.23 | |
profiles | Array (Object) | Social media profiles associated with this person profile | ||
profiles.url | String | The url of the social profile | linkedin.com/in/seanthorne | |
profiles.id | String | the persistent id related to this social profile (varies by social network) | 145991517 | |
profiles.network | Enum (String) | Canonical Profile Networks | The network the profile exists on | |
profiles.username | String | The username associated with the profile | seanthorne | |
operation_id | String | Currently only in data license deliveries. Allows PDL employees to identify the timestamp and operations performed on the internal data to return a record in a delivery. | acee3bde2e1a2cb7e75c57b80d5b7bc2d5de5b02e7ea51f91304c28df77251dc | |
version_status | Object | Deprecated in API Responses | Allows customers track the pervious and current dataset version, any other persistent IDs that were merged into this record using improved entity resolution, and the status of the record | |
version_status.contains | Array (String) | Deprecated in API Responses | List of IDs merged into this record since the last release | ["qEnOZ5Oh0poWnQ1luFBfVw_0000"] |
version_status.current_version | String | Deprecated in API Responses | Current version of the data | 15 |
version_status.previous_version | String | Deprecated in API Responses | Previous version of the data | 14 |
version_status.status | String | Deprecated in API Responses. Canonical Version Status | Explains what happened to this record between the previous release and the current release | updated |
dataset_version | String | Explains the current major or minor release number. | 17.1 |
Person Risk Attributes
Formatting Notes
The dot notation indicates that the property is one level deep inside a hash. If a field is not present then we’ll return an empty list or null value for that field.
FIELD NAME | FIELD TYPE | Persistence Commitments and Format | SHORT DESCRIPTION | EXAMPLE |
---|---|---|---|---|
profiles.first_seen | Date | Date string. YYYY-MM-DD | The date when this profile was first associated to this record | 2017-06-02 |
profiles.last_seen | Date | Date string. YYYY-MM-DD | The date when this profile was last associated to this record | 2019-07-18 |
profiles.num_sources | Integer | Greater than 0 | The number of sources that have contributed to the association of this profile to this record | 17 |
emails.first_seen | Date | Date string. YYYY-MM-DD | The date when this email was first associated to this record | 2017-06-02 |
emails.last_seen | Date | Date string. YYYY-MM-DD | The date when this email was last associated to this record | 2019-07-18 |
emails.num_sources | Integer | Greater than 0 | The number of sources that have contributed to the association of this email to this record | 17 |
phones | Array (Object) | A list of the phone numbers known to be associated with this record | ||
phones.number | String | A phone number in E164 format | An individual phone number associated with this record | 17095944554 |
phones.first_seen | Date | Date string. YYYY-MM-DD | The date when this phone number was first associated to this record | 2017-06-02 |
phones.last_seen | Date | Date string. YYYY-MM-DD | The date when this phone number was last associated to this record | 2019-07-18 |
phones.num_sources | Integer | Greater than 0 | The number of sources that have contributed to the association of this phone number to this record | 17 |
street_addresses.first_seen | Date | Date string. YYYY-MM-DD | The date when this street address was first associated to this record | |
street_addresses.last_seen | Date | Date string. YYYY-MM-DD | The date when this street address was last associated to this record | |
street_addresses.num_sources | Integer | Greater than 0 | The number of sources contributed that have contributed to the association of this street address to this record | 17 |
linkedin_connections | Integer | Typically 0-500, some are the true values over 500 | The number of linkedin connections the person has | 432 |
facebook_friends | Integer | Greater than or equal to 0 | The number of facebook friends the person has | 3912 |
name_aliases | Array (String) | Any additional associated names or aliases beyond the primary one currently displayed in the name field | ["andrew nichol", "r andrew nichol", "robert nichol"] | |
possible_emails | Array (Object) | Any additional associated emails to this person record with a lower level of confidence than the currently-displayed ones in the emails array | same as emails array | |
possible_phones | Array (String) | Any additional associated phones to the person record with a lower level of confidence to the currently-displayed ones in the phone_numbers array | same as phone_numbers array | |
possible_profiles | Array (Object) | Any additional associated profiles to the person record w/ a lower level of confidence to the currently-displayed ones in the profiles | same as profiles array | |
possible_street_addresses | Array (Object) | Any additional associated addresses to the person record with a lower level of confidence to the currently-displayed ones in the street_addresses array | same as street addresses array | |
possible_birth_dates | Array (String) | Any additional associated birth dates to the person record with a lower level of confidence to the currently-displayed one in the birth_date field | ["1991-05-26", "1992-05-26"] | |
possible_location_names | Array (String) | Inferred potential locations the person has lived in based on phone area codes, university location, other associations | same as location_names array | |
experience.first_seen | Date | Date string. YYYY-MM-DD | The date when this professional position was first associated to this record in our data | 2016-10-12 |
experience.last_seen | Date | Date string. YYYY-MM-DD | The date when this professional position was last associated to this record in our data | 2022-01-04 |
job_history | Array (Object) | Any additional professional positions associated to this person record beyond the ones we currently display in the experience array. Usually these are positions that have been removed or changed on resumes | [{"company": "twisselman enterprises", "title": "food service supervisor", "first_seen": "2016-05-17"}] | |
job_history.first_seen | Date | Date string. YYYY-MM-DD | The date when this professional position was first associated to this record in our data | 2016-05-17 |
job_history.last_seen | Date | Date string. YYYY-MM-DD | The date when this professional position was last associated to this record in our data | 2020-05-30 |
num_records | Integer | Greater than 0 | The number of unique raw records contributing to this specific PDL profile | |
num_sources | Integer | Greater than 0 | The number of unique sources contributing to this specific PDL profile | 172 |
first_seen | Date | Date string. YYYY-MM-DD | The date when this record was first created in our data | 2017-06-02 |
Premium Resume Fields
Formatting Notes
The dot notation indicates that the property is one level deep inside a hash. If a field is not present then we’ll return an empty list or null value for that field.
FIELD NAME | FIELD TYPE | Persistence Commitments and Format | SHORT DESCRIPTION | EXAMPLE |
---|---|---|---|---|
certifications | Array (Object) | Certification objects associated with this person profile | ||
certifications.end_date | Date | Date string. YYYY-MM-DD | Expiration of certification | 2018-03 |
certifications.name | String | Free Text Field | Inputted name of the certification | machine learning certification |
certifications.organization | String | Free Text Field | Inputted organization awarding certification | coursera |
certifications.start_date | Date | Date string. YYYY-MM-DD | Start date of certification | 2019-04 |
education.raw | Array (String) | Free Text Field | Raw education input information. Parsed into the degrees/majors/minors fields | bachelors of arts in entrepreneurship, business minor |
education.school.raw | Array (String) | Free Text Field | Raw school names | university of oregon |
education.summary | String | Free Text Field, Cased | User-inputted summary of education | when i was at oregon i volunteered at a local homeless shelter 3 days a week |
experience.company.raw | Array (String) | Free Text Field | Raw company names | google -- analytics team |
experience.company.ticker | String | Free Text Field, capitalized | Company Ticker, (only for public companies) | GOOG |
experience.company.type | Enum (String) | Canonical Company Types | Company Type | public |
experience.summary | String | Free Text Field, Cased | User-inputted summary of experience | worked on the "search analytics" team to understand our users better |
experience.title.raw | Array (String) | Free Text Field | Raw titles | chief executive officer and co-founder |
inferred_salary | Enum (String) | Canonical Inferred Salary Ranges | inferred salary range -- Contact Us to learn how we infer | 70,000-85,000 |
inferred_years_experience | Integer | Between 0 and 100 | Inferred years work experience | 7 |
job_company_ticker | String | Free Text Field, capitalized | Current Company Ticker | GOOG |
job_company_type | Enum (String) | Canonical Company Types | Current Company Type | public |
job_onet_code | String | The 8 digit O*NET code for a person’s current job title, following the 2018 SOC guidelines. | ”11-1011.00” | |
job_onet_major_group | String | The O*NET Major Group associated with a person’s current job title. | ”Management Occupations” | |
job_onet_minor_group | String | The O*NET Minor Group associated with a person’s current job title. | “Top Executives” | |
job_onet_broad_occupation | String | The O*NET Broad Occupation category associated with a person’s current job title. | “Chief Executives” | |
job_onet_specific_occupation | String | The O*NET Detailed Occupation category associated with a person’s current job title. | “Chief Executives” | |
job_onet_title | String | A more detailed job title for records where the specific occupation within O*NET's standard hierarchy isn't granular enough to accurately describe the job title. e.g. The highest level of granularity for c-suite positions is Chief Executives. | “Chief Technology Officer” | |
job_summary | String | Free Text Field, Cased | User-inputted summary of experience | worked on the "search analytics" team to understand our users better |
languages | Array (Object) | Self-identified languages spoken | ||
languages.name | String | Canonical Languages | Name of the canonical language the person inputted | english |
languages.proficiency | Enum (Integer) | Score 1-5 | Self-identified proficiency score 1 (limited) - 5 (fluent) | 5 |
linkedin_connections | Integer | Typically 0-500, some are the true values over 500 | Number of linkedin connections | 50 |
summary | String | Free Text Field, Cased | Self-written summaries tied to person profile (often linkedin summaries) | growth-hacker and digital nomad |
Updated 5 days ago