Published July 18, 2018 | Version 0.2
Video/Audio Open

voiceHome corpus: A corpus dedicated to distant-microphone speech processing in domestic environments

  • 1. IRISA - CNRS UMR 6074, Rennes, France
  • 2. VoiceBox Technologies France
  • 3. Université de Lorraine, LORIA, UMR 7503, Vandoeuvre-lès-Nancy, F-54506, France
  • 4. CRPCC, Université Rennes 2, 35043 Rennes Cedex, France

Description

Purpose:

This corpus includes reverberated, noisy speech signals spoken by native French talkers in a lounge and recorded by an 8-microphone device at various angles and distances and in various noise conditions.

Room impulse responses and noise-only signals recorded in various real rooms and homes and baseline speaker localization and enhancement software are also provided.

This corpus stands apart from other corpora in the field by the number of rooms and homes considered and by the fact that it is publicly available at no cost.

 

Other materials:

Documentation (in french):

The corpus documentation is both available into the archive and hereafter by clicking on voiceHome_corpus_french_documentation_v1.2.pdf .

Terms of use

You may exploit the corpus for a non-commercial scientific purpose provided you mention it in any written work or software you derive from its use. Within a published article, paper or report, the corpus must appear in the bibliographical references.

Speaker records diffusion consent

All participants have given an informed and signed consent about public diffusion of recorded sentences.

New corpus version available : voiceHome-2 corpus

A new version of the corpus is available : voiceHome-2 corpus web page

 

Files

voiceHome_corpus v0.2.zip

Files (1.9 GB)

Name Size Download all
md5:1a98cace6a3ceda9736e107185baf3a6
1.9 GB Preview Download
md5:da952914ea614cfe3f4ee08abc434459
702.2 kB Preview Download