MODELING COGNITIVE CONSTRAINTS IN SIGNED-TO-SPOKEN TRANSLATION: A MULTIMODAL NEURAL APPROACH
Main Article Content
Аннотация:
In recent years, the field of sign language translation (SLT) has evolved rapidly due to advances in deep learning and multimodal computing. However, despite this progress, translation from signed to spoken languages remains a complex challenge due to fundamental cognitive, linguistic, and modality-specific constraints. This paper investigates how cognitive factors — including working memory limits, multimodal perception, and temporal alignment — affect the process of translating visual-manual languages into spoken forms. The study also explores how multimodal neural networks, especially transformer-based architectures, can model and mitigate these constraints. Using statistical data and comparative analysis from developed countries (the USA, UK, Germany) and emerging contexts, the paper highlights the current state of SLT research, key datasets, and BLEU score benchmarks, providing recommendations for future development of inclusive AI systems that serve deaf and hard-of-hearing communities worldwide.
Article Details
Как цитировать:
Библиографические ссылки:
De Coster, M., Shterionov, D., Van Herreweghe, M., Dambre, J. (2022). Machine Translation from Signed to Spoken Languages: State of the Art and Challenges. arXiv:2202.03086.
Camgöz, N. C. (2020). Neural Sign Language Recognition and Translation. PhD thesis, University of Surrey.
Kita, S. (2023). Gesture links language and cognition for spoken and signed. Frontiers in Psychology.
Mohammed, R., Aljarrah, I., Al-Ayyoub, M., Fadel, A. (2025). Multimodal Multisource Neural Machine Translation. Computation, 13(8):194.
Jiang, Z., Moryossef, A., Müller, M., Ebling, S. (2022). Machine Translation between Spoken and Signed Languages Represented in SignWriting. arXiv:2210.05404.
Mercanoğlu Sincan, Ö., Camgöz, N. C., Bowden, R. (2023). Is Context All You Need? Scaling Neural Sign Language Translation to Large Domains of Discourse. arXiv:2308.09622.
Kan, J., Hu, K., Hagenbuchner, M., Tsoi, A. C., Bennamoun, M., Wang, Z. Y. (2021). Sign Language Translation with Hierarchical Spatio-Temporal Graph Neural Network. arXiv:2111.07258.
