New benchmark study explores neural network performance on Tajik POS tagging

By PulseAugur Editorial · [2 sources] · 2026-05-06 07:26

This paper introduces the first benchmark for part-of-speech tagging in the Tajik language, evaluating various neural network architectures. The study utilized the TajPersParallel corpus, focusing on context-independent classification of isolated lexical units. Results indicated that the mBERT model, fine-tuned with LoRA, performed best, though all models struggled with morphological ambiguity without syntactic context. AI

IMPACT Establishes a baseline for NLP tasks in Tajik, highlighting challenges in morphological ambiguity for low-resource languages.

RANK_REASON This is a research paper presenting a new benchmark and comparative study of neural architectures for a specific NLP task.

Read on arXiv cs.CL →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

arXiv cs.CL TIER_1 English(EN) · Mullosharaf K. Arabov · 2026-05-07 04:00

Benchmarking POS Tagging for the Tajik Language: A Comparative Study of Neural Architectures on the TajPersParallel Corpus

arXiv:2605.04576v1 Announce Type: new Abstract: This paper presents the first benchmark for the task of automatic part-of-speech (POS) tagging for the Tajik language. Despite the existence of multilingual language models demonstrating high effectiveness for many of the world's la…
arXiv cs.CL TIER_1 English(EN) · Mullosharaf K. Arabov · 2026-05-06 07:26

Benchmarking POS Tagging for the Tajik Language: A Comparative Study of Neural Architectures on the TajPersParallel Corpus

This paper presents the first benchmark for the task of automatic part-of-speech (POS) tagging for the Tajik language. Despite the existence of multilingual language models demonstrating high effectiveness for many of the world's languages, their capacity for grammatical analysis…

COVERAGE [2]

Benchmarking POS Tagging for the Tajik Language: A Comparative Study of Neural Architectures on the TajPersParallel Corpus

Benchmarking POS Tagging for the Tajik Language: A Comparative Study of Neural Architectures on the TajPersParallel Corpus

RELATED ENTITIES

RELATED TOPICS