Content
- Job titles taxonomy content:
- 107 canonical titles
- 136 aliases
- 17 parent titles
- 2 classification schemes for function:
- Function (”Data Labeling” vs “Data Operations Management”)
- Function-ISO-8K-150 (based on the ISO 8000-150 schema)
- Company data:
- 282 companies
- 40 industries
- Labelled dataset:
- 27 listings labelled with v1 canonical titles
- Taxonomy coverage estimate (based on this small dataset): 92%
Data Access
Through git
- Github Project Space: https://github.com/DataOperationsIL/TitleTaxonomy
- V1 Location: [xxxxx]
- Files:
- master.csv : the full denormalized taxonomy
- taxonomy-schema.json : a JSON structure describing the taxonomy schema
- dictionary.csv : a table including lists of unique values for taxonomy data validation
- company.csv: a list of companies with a data operations department, and metadata
- labelled-job-descriptions.csv : a labelled job descriptions sample for coverage evaluation
- readme.json : version metadata
Through Google Spreadsheet
Data Operations Job Titles Taxonomy Versioned
Suggested Usages
-
Title search expansion: Filter on the parent title to help extend a title search for job or candidate hunting, by identifying similar child titles related to this parent.
-
Jobs data retrieval and labeling: Retrieve and label online jobs with taxonomy titles.
- Skill analysis on jobs and skill-based knowledge base creation
- Specialties analysis on jobs
-
Data Operations IL Salary Survey: use classification schemes and parent-based title expansion to group survey participants into new buckets.
Schemes
V1 taxonomy schema
V1 classification schemes
Testing