Some Publications

Google Scholar Page

  1. Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
    Bai, Ye, Li, Jie, Han, Wenjing, Ni, Hao, Xu, Kaituo, Zhang, Zhuo, Yi, Cheng, and Wang, Xiaorui
    In Proc. Interspeech 2022 2022
  2. Half-Truth: A Partially Fake Audio Detection Dataset
    Yi, Jiangyan, Bai, Ye, Tao, Jianhua, Ma, Haoxin, Tian, Zhengkun, Wang, Chenglong, Wang, Tao, and Fu, Ruibo
    In Proc. Interspeech 2021 2021
  3. Fast End-to-End Speech Recognition Via Non-Autoregressive Models and Cross-Modal Knowledge Transferring From BERT
    Bai, Ye, Yi, Jiangyan, Tao, Jianhua, Tian, Zhengkun, Wen, Zhengqi, and Zhang, Shuai
    IEEE/ACM Transactions on Audio, Speech, and Language Processing 2021
  4. Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data
    Bai, Ye, Yi, Jiangyan, Tao, Jianhua, Wen, Zhengqi, Tian, Zhengkun, and Zhang, Shuai
    IEEE/ACM Transactions on Audio, Speech, and Language Processing 2021
  5. Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
    Bai, Ye, Yi, Jiangyan, Tao, Jianhua, Tian, Zhengkun, Wen, Zhengqi, and Zhang, Shuai
    In Proc. Interspeech 2020 2020
  6. A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting
    Bai, Ye, Yi, Jiangyan, Tao, Jianhua, Wen, Zhengqi, Tian, Zhengkun, Zhao, Chenghao, and Fan, Cunhang
    In Proc. Interspeech 2019 2019
  7. Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
    Bai, Ye, Yi, Jiangyan, Tao, Jianhua, Tian, Zhengkun, and Wen, Zhengqi
    In Proc. Interspeech 2019 2019
  8. Voice Activity Detection Based on Time-Delay Neural Networks
    Bai, Ye, Yi, Jiangyan, Tao, Jianhua, Wen, Zhengqi, and Liu, Bin
    In 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2019
  9. A Public Chinese Dataset for Language Model Adaptation
    Bai, Ye, Yi, Jiangyan, Tao, Jianhua, Wen, Zhengqi, and Fan, Cunhang
    Journal of Signal Processing Systems 2019
  10. CLMAD: A Chinese Language Model Adaptation Dataset
    Bai, Ye, Tao, Jianhua, Yi, Jiangyan, Wen, Zhengqi, and Fan, Cunhang
    In 2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP) 2018
  11. End-to-end keywords spotting based on connectionist temporal classification for Mandarin
    Bai, Ye, Yi, Jiangyan, Ni, Hao, Wen, Zhengqi, Liu, Bin, Li, Ya, and Tao, Jianhua
    In 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) 2016