<aside> 💡

Due to time and resource restrictions, and the limited data size, we implemented semi-automatic validation. For future versions, this process will be automated to run validations against dictionary.csv (data integrity) and master.csv (semantic relations integrity)

</aside>

Validation rules

Done (using Google spreadsheet validation dropdown menus in the taxonomy)

Done (no duplicate alias found in the taxonomy)

Done (in dictionary with Google spreadsheet function)

Done (in dictionary, using Find regex)

Done (in dictionary)

Done

Irrelevant (v1 as the baseline)

Done (vlookup)

Done (vlookup)

Done

Done

Done (136 unique aliases)

Done (pivot table)

OK

Done ( (vlookup)

OK

Done (pivot # of unique parent maps to canonical title)

Done (using Gemini)

Done

Done (split labels by “|”, then vlookup)