Native Language Inconsistencies

Slate Instance

TUV

Requestor/Reporter

Elizabeth Storrs

Date

Apr 22, 2024

Status

COMPLETE

Bug Description

inconsistencies in native language fields

Issue Description

There is no pre-2023 data on native language; this may or may not be fixable, BUT this bigger deal is that the at least the Spanish speakers are recorded as being Estonian, and there are a number of codes which have no languages associated with them.

image-20240422-210036.png

In addition, there are a number of languages that are marked as inactive in Slate, for no clear reason:

image-20240422-204649.png

Troubleshooting/Research

https://tufts.box.com/s/igdq1h2w19if9r478lwj7wssrmink01t

There are only 1492 records in TUV that have a value in the Native Language field, and 2022 is the earliest round, but that is an absolute outlier--that was added by hand by RR after the cycle was over (and the student didn't even attend).

Resolution Steps

activate all non-duplicate languages in Slate
purge any duplicates
add extended values as necessary
review the 1492 records we have, and make sure their languages are correct. (Uploaded a source format).

The mappings from last year are wrong here:

CAS API → Slate
Czech → Chinese (Shanghai)
Gujarati → Guarani
Hmong → Vietnamese
Serbo-Croatian → Sinhala
Tagalog → Tshiluba
Tamil → Tagalog

Language prompts have been updated