You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
Continuous speech separation (CSS) is a recently proposed framework which aims at separating each speaker from an input mixture signal in a streaming fashion. Hereafter we perform an evaluation study on practical design considerations for a CSS system, addressing important aspects which have been neglected in recent works. In particular, we focus on the trade-off between separation performance, computational requirements and output latency showing how an offline separation algorithm can be used to perform CSS with a desired latency. We carry out an extensive analysis on the choice of CSS processing window size and hop size on sparsely overlapped data. We find out that the best trade-off between computational burden and performance is obtained for a window of 5 s.
Author (s): Morrone, Giovanni; Cornell, Samuele; Zovato, Enrico; Brutti, Alessio; Squartini, Stefano
Affiliation:
Università Politecnica delle Marche, Ancona, Italy; PerVoice S.p.A., Trento, Italy; Fondazione Bruno Kessler, Trento, Italy
(See document for exact affiliation information.)
AES Convention: 152
Paper Number:10562
Publication Date:
2022-05-06
Import into BibTeX
Session subject:
Television Audio
Permalink: https://aes2.org/publications/elibrary-page/?id=21675
(472KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Morrone, Giovanni; Cornell, Samuele; Zovato, Enrico; Brutti, Alessio; Squartini, Stefano; 2022; Conversational Speech Separation: an Evaluation Study for Streaming Applications [PDF]; Università Politecnica delle Marche, Ancona, Italy; PerVoice S.p.A., Trento, Italy; Fondazione Bruno Kessler, Trento, Italy; Paper 10562; Available from: https://aes2.org/publications/elibrary-page/?id=21675
Morrone, Giovanni; Cornell, Samuele; Zovato, Enrico; Brutti, Alessio; Squartini, Stefano; Conversational Speech Separation: an Evaluation Study for Streaming Applications [PDF]; Università Politecnica delle Marche, Ancona, Italy; PerVoice S.p.A., Trento, Italy; Fondazione Bruno Kessler, Trento, Italy; Paper 10562; 2022 Available: https://aes2.org/publications/elibrary-page/?id=21675
@article{morrone2022conversational,
author={morrone giovanni and cornell samuele and zovato enrico and brutti alessio and squartini stefano},
journal={journal of the audio engineering society},
title={conversational speech separation: an evaluation study for streaming applications},
year={2022},
number={10562},
month={may},}