An Overview of Indian Spoken Language Recognition from Machine Learning Perspective

Spandan Dey; Md Sahidullah; Goutam Saha

doi:10.1145/3523179

Article Dans Une Revue ACM Transactions on Asian and Low-Resource Language Information Processing Année : 2022

An Overview of Indian Spoken Language Recognition from Machine Learning Perspective

(1) , (2) , (1)

1
2

Spandan Dey

Fonction : Auteur

Indian Institute of Technology Kharagpur

Md Sahidullah

Fonction : Auteur
PersonId : 737397
IdHAL : sahid

Speech Modeling for Facilitating Oral-Based Communication

Goutam Saha

Fonction : Auteur
PersonId : 1090408

Indian Institute of Technology Kharagpur

Résumé

Automatic spoken language identification (LID) is a very important research field in the era of multilingual voice-command-based human-computer interaction (HCI). A front-end LID module helps to improve the performance of many speech-based applications in the multilingual scenario. India is a populous country with diverse cultures and languages. The majority of the Indian population needs to use their respective native languages for verbal interaction with machines. Therefore, the development of efficient Indian spoken language recognition systems is useful for adapting smart technologies in every section of Indian society. The field of Indian LID has started gaining momentum in the last two decades, mainly due to the development of several standard multilingual speech corpora for the Indian languages. Even though significant research progress has already been made in this field, to the best of our knowledge, there are not many attempts to analytically review them collectively. In this work, we have conducted one of the very first attempts to present a comprehensive review of the Indian spoken language recognition research field. In-depth analysis has been presented to emphasize the unique challenges of low-resource and mutual influences for developing LID systems in the Indian contexts. Several essential aspects of the Indian LID research, such as the detailed description of the available speech corpora, the major research contributions, including the earlier attempts based on statistical modeling to the recent approaches based on different neural network architectures, and the future research trends are discussed. This review work will help assess the state of the present Indian LID research by any active researcher or any research enthusiasts from related fields.

Mots clés

Language resources Machine learning Signal processing systems Low-resourced languages Indian language identification language similarity corpora development code-switching acoustic phonetics discriminative model

Domaines

Intelligence artificielle [cs.AI] Vision par ordinateur et reconnaissance de formes [cs.CV] Interface homme-machine [cs.HC] Traitement du signal et de l'image [eess.SP] Linguistique Linguistique Machine Learning [stat.ML]

Fichier principal

TALLIP_Overview.pdf (6.26 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Md Sahidullah : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03616853

Soumis le : mercredi 23 mars 2022-10:20:09

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : vendredi 24 juin 2022-18:15:30

Dates et versions

hal-03616853 , version 1 (23-03-2022)

Identifiants

HAL Id : hal-03616853 , version 1
DOI : 10.1145/3523179

Citer

Spandan Dey, Md Sahidullah, Goutam Saha. An Overview of Indian Spoken Language Recognition from Machine Learning Perspective. ACM Transactions on Asian and Low-Resource Language Information Processing, 2022, 21 (6), pp.1-45. ⟨10.1145/3523179⟩. ⟨hal-03616853⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

66 Consultations

231 Téléchargements

An Overview of Indian Spoken Language Recognition from Machine Learning Perspective

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager