Issue Description
There is no pre-2023 data on native language; this may or may not be fixable, BUT this bigger deal is that the at least the Spanish speakers are recorded as being Estonian, and there are a number of codes which have no languages associated with them.
In addition, there are a number of languages that are marked as inactive in Slate, for no clear reason:
Troubleshooting/Research
https://tufts.box.com/s/igdq1h2w19if9r478lwj7wssrmink01t
There are only 1492 records in TUV that have a value in the Native Language field, and 2022 is the earliest round, but that is an absolute outlier--that was added by hand by RR after the cycle was over (and the student didn't even attend).
Resolution Steps
- activate all non-duplicate languages in Slate
- purge any duplicates
- add extended values as necessary
- review the 1492 records we have, and make sure their languages are correct.
The mappings from last year are wrong here:
CAS API → Slate
Czech → Chinese (Shanghai)
Gujarati → Guarani
Hmong → Vietnamese
Serbo-Croatian → Sinhala
Tagalog → Tshiluba
Tamil → Tagalog
- Pull the languages list from CAS, and make sure all are included, and map
0 Comments