Towards lifelong human assisted speaker diarization

This paper introduces the resources necessary to develop and evaluate human assisted lifelong learning speaker diarization systems. It describes the ALLIES corpus and associated protocols, especially designed for diarization of a collection audio recordings across time. This dataset is compared to existing corpora and the performances of three baseline systems, based on x-vectors,i-vectors and VBxHMM, are reported for reference. Those systems are then extended to include an active correction process that efficiently guides a human annotator to improve the automatically generated hypotheses. An open-source simulated human expert is provided to ensure reproducibility of the human assisted correction process and its fair evaluation. An exhaustive evaluation, of the human assisted correction shows the high potential of this approach. The ALLIES corpus, a baseline system including the active correction module and all evaluation tools are made freely available to the scientific community.

Towards lifelong human assisted speaker diarization

Meysam Shamsi;Anthony Larcher;Loic Barrault;Sylvain Meignier;Yevheni Prokopalo;Marie Tahon;Ambuj Mehrish;Simon Petitrenaud;Olivier Galibert;Samuel Gaist;Andre Anjos;Sebastien Marcel;Marta R Costa-Jussà

2022

Abstract

This paper introduces the resources necessary to develop and evaluate human assisted lifelong learning speaker diarization systems. It describes the ALLIES corpus and associated protocols, especially designed for diarization of a collection audio recordings across time. This dataset is compared to existing corpora and the performances of three baseline systems, based on x-vectors,i-vectors and VBxHMM, are reported for reference. Those systems are then extended to include an active correction process that efficiently guides a human annotator to improve the automatically generated hypotheses. An open-source simulated human expert is provided to ensure reproducibility of the human assisted correction process and its fair evaluation. An exhaustive evaluation, of the human assisted correction shows the high potential of this approach. The ALLIES corpus, a baseline system including the active correction module and all evaluation tools are made freely available to the scientific community.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2022
			
	Titolo del Volume
	
				Computer Speech & Language
			
	DOI
	
				https://dx.doi.org/10.1016/j.csl.2022.101437
			
	Appare nelle tipologie:
	
				3.1 Articolo su libro

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0885230822000638-main.pdf non disponibili Tipologia: Versione dell'editore Licenza: Copyright dell'editore Dimensione 4.22 MB Formato Adobe PDF Visualizza/Apri	4.22 MB	Adobe PDF	Visualizza/Apri

I documenti in ARCA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10278/5105956

Citazioni

ND

1

1

social impact