Publicaciones en las que colabora con Alfons Juan Císcar (31)

2022

  1. Live Streaming Speech Recognition Using Deep Bidirectional LSTM Acoustic Models and Interpolated Language Models

    IEEE/ACM Transactions on Audio Speech and Language Processing, Vol. 30, pp. 148-161

  2. MLLP-VRAIN Spanish ASR Systems for the Albayzín-RTVE 2020 Speech-to-Text Challenge: Extension

    Applied Sciences (Switzerland), Vol. 12, Núm. 2

  3. MLLP-VRAIN UPV systems for the IWSLT 2022 Simultaneous Speech Translation and Speech-to-Speech Translation tasks

    IWSLT 2022 - 19th International Conference on Spoken Language Translation, Proceedings of the Conference

2021

  1. Europarl-ASR: A large corpus of parliamentary debates for streaming ASR benchmarking and speech data filtering/verbatimization

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

  2. Streaming cascade-based speech translation leveraged by a direct segmentation model

    Neural Networks, Vol. 142, pp. 303-315

  3. Towards cross-lingual voice cloning in higher education

    Engineering Applications of Artificial Intelligence, Vol. 105

2020

  1. Europarl-st: A multilingual corpus for speech translation of parliamentary debates

    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

  2. Improved hybrid streaming ASR with transformer language models

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

  3. LSTM-Based One-Pass Decoder for Low-Latency Streaming

    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

2019

  1. Real-time one-pass decoder for speech recognition using LSTM language models

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

2018

  1. MLLP-UPV and RWTH Aachen Spanish ASR Systems for the IberSpeech-RTVE 2018 Speech-to-Text Transcription Challenge

    4th International Conference, IberSPEECH 2018

  2. Speaker-adapted confidence measures for ASR using deep bidirectional recurrent neural networks

    IEEE/ACM Transactions on Audio Speech and Language Processing, Vol. 26, Núm. 7, pp. 1194-1202

2016

  1. ASR confidence estimation with speaker-adapted recurrent neural networks

    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

2015

  1. Window repositioning for printed Arabic recognition

    Pattern Recognition Letters, Vol. 51, pp. 86-93

2014

  1. Comparison of data selection techniques for the translation of video lectures

    Proceedings of the 11th Conference of the Association for Machine Translation in the Americas, AMTA 2014

  2. Discriminative Bernoulli HMMs for isolated handwritten word recognition

    Pattern Recognition Letters, Vol. 35, Núm. 1, pp. 157-168

  3. Handwriting word recognition using windowed Bernoulli HMMs

    Pattern Recognition Letters, Vol. 35, Núm. 1, pp. 149-156

  4. Interactive handwriting recognition with limited user effort

    International Journal on Document Analysis and Recognition, Vol. 17, Núm. 1, pp. 47-59

  5. Statistical text-to-speech synthesis of spanish subtitles

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 8854, pp. 40-48

  6. The translectures-UPV toolkit

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 8854, pp. 269-278