We provide an official baseline for the TidyLang 2026 Challenge: a complete toolbox for training and evaluating a Language Identification (LID) model. The baseline uses Wav2Vec2 (Layers 17–24) + a simple projection head + ArcFace loss, and achieves strong performance on the TidyLang dataset.
Code, recipes, and documentation:
https://github.com/areffarhadi/TidyLang2026-baseline
Clone the repository for training scripts, evaluation scripts, ArcFace loss implementation, and full setup instructions.
The challenge has two evaluation tasks:
More detail on protocols, trial formats, and submission will be given in the Evaluation Plan.
pip install -r requirements.txt. Dataset and checkpoint paths are set inside the scripts; edit them to match your setup.bash train.sh with default parameters (or override GPU, batch size, epochs, margin, scale, hidden dim as needed).bash eval.sh to evaluate language identification on the validation manifest (flag=2); reports Micro and Macro accuracy.eval_enrollment.sh, then run bash eval_enrollment.sh to compute EER on enrollment-based trials (enrollment IDs with 20–65 s of audio, trial pairs).See the repository README for detailed usage and file formats (manifest, enrollment manifest, trial file).
| Task | Metric | Result |
|---|---|---|
| Language identification (35 seen languages) | Micro Accuracy | 75.76% |
| Language identification (35 seen languages) | Macro Accuracy | 40.25% |
| Unseen language recognition (enrollment-based) | EER | 34.7% |
eval.sh on the validation split (flag=2) of the training manifest (35 languages).eval_enrollment.sh using enrollment-based trials: each enrollment ID has 20–65 s of audio (multiple files); for each trial, the test utterance is compared with the enrollment ID (e.g. average similarity over enrollment utterances); EER is computed over the trial scores.Training uses flag=1 data only. Validation during training is reported on flag=2 (identification accuracy, language-recognition EER) and flag=3 (cross-lingual accuracy). No trials file is used during training; verification (unseen-language EER) is run separately with eval_enrollment.sh.
For help and technical support regarding the baseline, contact: aref.farhadipour@uzh.ch