ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR

Approved

Classifications

MinEdu publication type
A4 Article in conference proceedings (peer-reviewed)
Definition
Article
Target group
Scientific
Peer reviewed
Peer-reviewed
Article type
Other article
Host publication type
Conference platform

Authors of the publication

Number of authors
5
Authors
Singh, Vishwanath Pratap; Malato, Federico; Hautamäki, Ville; Sahidullah, Md.; Kinnunen, Tomi

Publication channel information

Title of host publication
Proceedings of Interspeech 2024
Editors of host publication
Lapidot, Itshak; Gannot, Sharon
Name of conference
INTERSPEECH
Title of journal/series
Interspeech
ISSN (print)
2308-457X
ISSN (electronic)
2958-1796
ISSN (linking)
2958-1796
Publisher
International Speech Communication Association (ISCA)
Publication forum ID
91794
Publication forum level
1
Internationality
Yes

Detailed publication information

Publication year
2024
Reporting year
2024
Page numbers
2885-2889
DOI
10.21437/Interspeech.2024-1403
Language of publication
English

Co-publication information

International co-publication
Yes
Co-publication with a company
No

Availability

Classification and additional information

MinEdu field of science classification
113 Computer and information sciences