If I may insert a relevant plug: we (MERL) just put out a paper last week with S... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

jonathanleroux on Jan 14, 2020 | parent | context | favorite | on: Online speech recognition with wav2letter anywhere

If I may insert a relevant plug: we (MERL) just put out a paper last week with SOTA 7.0 % WER on LibriSpeech test-other (vs wav2letter@anywhere's 7.5%) with 590 ms theoretical latency using joint CTC-Transformer with parallel time-delayed LSTM and triggered attention. Check it out: https://arxiv.org/abs/2001.02674

jonathanleroux on Jan 15, 2020 [–]

Correction: no PTDLSTM here but time-restricted self-attention (duh). PTDLSTM was our previous encoder setup published at ASRU.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact