仕訳帳情報
Computer Speech and Language
https://www.sciencedirect.com/journal/computer-speech-and-language
インパクト ・ ファクター:
3.100
出版社:
Elsevier
ISSN:
0885-2308
閲覧:
18977
追跡:
30
論文募集
An official publication of the International Speech Communication Association (ISCA)

Computer Speech & Language publishes reports of original research related to the recognition, understanding, production, coding and mining of speech and language.

The speech and language sciences have a long history, but it is only relatively recently that large-scale implementation of and experimentation with complex models of speech and language processing has become feasible. Such research is often carried out somewhat separately by practitioners of artificial intelligence, computer science, electronic engineering, information retrieval, linguistics, phonetics, or psychology.

The journal provides a focus for this work, and encourages an interdisciplinary approach to speech and language research and technology. Thus contributions from all of the related fields are welcomed in the form of reports of theoretical or experimental studies, tutorials, reviews, and brief correspondence pertaining to models and their implementation, or reports of fundamental research leading to the improvement of such models.

Research Areas Include

    Algorithms and models for speech recognition and synthesis
    Natural language processing for speech understanding and generation
    Statistical computational linguistics
    Computational models of discourse and dialogue
    Information retrieval, extraction and summarization
    Speaker and language recognition
    Computational models of speech production and perception
    Signal processing for speech analysis, enhancement and transformation
    Evaluation of human and computer system performance
最終更新 Dou Sun 2024-07-16
Special Issues
Special Issue on Multi-Speaker, Multi-Microphone, and Multi-Modal Distant Speech Recognition
提出日: 2024-12-02

Automatic speech recognition (ASR) has significantly progressed in the single-speaker scenario, owing to extensive training data, sophisticated deep learning architectures, and abundant computing resources. Building on this success, the research community is now tackling real-world multi-speaker speech recognition, where the number and nature of the sound sources are unknown and changing over time. In this scenario, refining core multi-speaker speech processing technologies such as speech separation, speaker diarization, and robust speech recognition is essential, and the effective integration of these advancements becomes increasingly crucial. In addition, emerging approaches, such as end-to-end neural networks, speech foundation models, and advanced training methods (e.g., semi-supervised, self-supervised, and unsupervised training) incorporating multi-microphone and multi-modal information (such as video and accelerometer data), offer promising avenues to alleviate these challenges. This special issue gathers recent advances in multi-speaker, multi-microphone, and multi-modal speech processing studies to establish real-world conversational speech recognition. Guest editors: Assoc. Prof. Shinji Watanabe (Executive Guest Editor) Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America Email: shinjiw@ieee.org Areas of Expertise: Speech recognition, speech enhancement, and speaker diarization Dr. Michael Mandel Reality Labs, Meta, Menlo Park, California, United States of America Email: mmandel@meta.com Areas of Expertise: Source separation, noise robust ASR, electromyography Dr. Marc Delcroix NTT Corporation, Chiyoda-Ku, Japan Email: marc.delcroix@ieee.org; marc.delcroix@ntt.com Areas of Expertise: Robust speech recognition, speech enhancement, source separation and extraction Dr. Leibny Paola Garcia Perera Johns Hopkins University, Baltimore, Maryland, United States of America Email: lgarci27@jhu.edu Areas of Expertise: Speech recognition, speech enhancement, and speaker diarization, multimodal speech processing Dr. Katerina Zmolikova Meta, Menlo Park, California, United States of America Email: kzmolikova@meta.com Areas of Expertise: Speech separation and extraction, speech enhancement, robust speech recognition Dr. Samuele Cornell Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America Email: scornell@andrew.cmu.edu Areas of Expertise: Robust speech recognition, speech separation and enhancement Special issue information: Relevant research topics include (but are not limited to): Speaker identification and diarization Speaker localization and beamforming Single- or multi-microphone enhancement and source separation Robust features and feature transforms Robust acoustic and language modeling for distant or multi-talker ASR Traditional or end-to-end robust speech recognition Training schemes: data simulation and augmentation, semi-supervised, self-supervised, and unsupervised training for distant or multi-talker speech processing Pre-training and fine-tuning of speech and audio foundation models and their application to distant and multi-talker speech processing Robust speaker and language recognition Robust paralinguistics Cross-environment or cross-dataset performance analysis Environmental background noise modeling Multimodal speech processing Systems, resources, and tools for distant Speech Recognition In addition to traditional research papers, the special issue also hopes to include descriptions of successful conversational speech recognition systems where the contribution is more in the implementation than the techniques themselves, as well as successful applications of conversational speech recognition systems. For example, the recently concluded seventh and eighth CHiME challenges serve as a focus for discussion in this special issue. The challenge considered the problem of conversational speech separation, speech recognition, and speaker diarization in everyday home environments from multi-microphone and multi-modal input. Seventh and eighth CHiME challenges consist of multiple tasks based on 1) distant automatic speech recognition with multiple devices in diverse scenarios, 2) unsupervised domain adaptation for conversational speech enhancement, 3) distant diarization and ASR in natural conferencing environments, and 4) ASR for multimodal conversations in smart glasses. Papers reporting evaluation results on the CHiME-7/8 datasets or other datasets dealing with real-world conversational speech recognition are equally welcome. Manuscript submission information: Tentative Dates: Submission Open Date: August 19, 2024 Manuscript Submission Deadline: December 2, 2024 Editorial Acceptance Deadline: September 1, 2025 Contributed full papers must be submitted via Computer Speech & Language online submission system (Editorial Manager®): https://www.editorialmanager.com/ycsla/default2.aspx. Please select the article type “VSI: Multi-DSR” when submitting the manuscript online. Please refer to the Guide for Authors to prepare your manuscript: https://www.elsevier.com/journals/computer-speech-and-language/0885-2308/guide-for-authors For any further information, the authors may contact the Guest Editors. Keywords: Speech recognition, speech enhancement/separation, speaker diarization, multi-speaker, multi-microphone, multi-modal, Distant Speech Recognition, CHiME challenge
最終更新 Dou Sun 2024-07-16
関連仕訳帳
CCF完全な名前インパクト ・ ファクター出版社ISSN
International Journal of Computer Integrated Manufacturing3.700Taylor & Francis0951-192X
Annals of Mathematics and Artificial Intelligence1.200Springer1012-2443
International Journal of Health Geographics3.000Springer1476-072X
International Journal of Control1.600Taylor & Francis0020-7179
cSoft Computing3.100Springer1432-7643
cNeural Processing Letters2.600Springer1370-4621
IT Professional2.200IEEE1520-9202
Journal of Optimization Theory and Applications1.600Springer0022-3239
Information Security Technical ReportElsevier1363-4127
Language Learning & Technology3.800University of Hawaii Press1094-3501
完全な名前インパクト ・ ファクター出版社
International Journal of Computer Integrated Manufacturing3.700Taylor & Francis
Annals of Mathematics and Artificial Intelligence1.200Springer
International Journal of Health Geographics3.000Springer
International Journal of Control1.600Taylor & Francis
Soft Computing3.100Springer
Neural Processing Letters2.600Springer
IT Professional2.200IEEE
Journal of Optimization Theory and Applications1.600Springer
Information Security Technical ReportElsevier
Language Learning & Technology3.800University of Hawaii Press
関連会議
CCFCOREQUALIS省略名完全な名前提出日通知日会議日
FSPSEInternational Conference on Frontiers of Signal Processing and Software Engineering2022-11-152022-11-202022-11-25
aa*a1SODAACM-SIAM Symposium on Discrete Algorithms2024-07-05 2025-01-12
ICCISInternational Conference on Computational and Information Sciences2014-03-102014-03-202014-05-30
bab1SATInternational Conference on Theory and Applications of Satisfiability Testing2024-03-082024-05-222024-08-21
ICETCAInternational Conference on Electronics Technology and Computer Applications2020-07-152020-07-202020-07-28
AAME'International Conference on Aerospace, Aerodynamics and Mechatronics Engineering2022-06-02 2022-07-22
SaCoNeTInternational Conference on Smart Communications in Network Technologies2018-07-312018-09-052018-10-27
AmIEuropean Conference on Ambient Intelligence2019-07-192019-08-122019-11-13
HPTSInternational Workshop on High Performance Transaction Systems 2011-06-012011-10-23
WebSciACM Web Science2024-11-302025-01-312025-05-20
おすすめ