Companies
Summary
This collection of data includes over nine million global companies, with information such as names, domains, sizes, years founded, industries, localities, countries and LinkedIn URLs. All companies in this dataset have at least one employee in the PDL data, removing many of the companies in our Company Stats.
To download the data, go to http://www.peopledatalabs.com/company-dataset.
Fields
Field Name | Field Type | Persistence Commitments and Format | Short Description | Example |
---|---|---|---|---|
country | Enum (String) | Canonical Countries | The country of company's current headquarters. | united states |
founded | Integer | Greater than 0 | The foundation year of the company. | 2015 |
id | String | PDL company ID. This is currently non-persistent and generated from the company's primary LinkedIn username. | peopledatalabs | |
industry | Enum (String) | Canonical Industries | The self-reported industry -- the enum is from LinkedIn's standard industries. | computer software |
linkedin_url | String | The primary company LinkedIn URL. | linkedin.com/company/peopledatalabs | |
locality | String | The locality of company's current headquarters. | san francisco | |
name | String | The company's main common name. | people data labs | |
region | String | The region of company's current headquarters. | california | |
size | Enum (String) | Canonical Company Sizes | A range representing the number of people working at the company. | 11-50 |
website | String | The primary company website. | peopledatalabs.com |
Downloading the dataset
We provide the dataset in CSV, pipe-delimited and JSON formats. We have found that many customers prefer the CSV format, but this format is very large (nine million lines) and common programs like Excel and Numbers can't open it. To do so properly, you have to do it programmatically. For example, by using the Python CSV Library.
License
You may use this data for any purpose. We have released it under the terms of the Creative Commons Attribution license (CC BY 4.0 - https://creativecommons.org/licenses/by/4.0/).
Updated 5 months ago