UCL - Communications and Remote Sensing Laboratory (TELE)

M2VTS logo

M2VTS Multimodal Face Database
Release 1.00

ACTS Project 102

Sound files

Sound files (.raw) are encoded using raw data (no header). The format is 16 bit unsigned linear and the sampling frequency is 48kHz.

The correspondence between the sound and the images of the voice sequence is given by :

   Image no :     1 ...  2   ...  26   ...     n
   Sample no :    1 ... 1921 ... 48001 ... 1+1920(n-1)
This format can be used as it is or converted to your favourite format using the SOX Sound Exchange software. As example, for converting .raw into .au (SUN/NeXT standard), type :

> sox -V -t raw -r 48000.000 -u -w -c 1 input.raw -t au -r 48000.000 -s -w -c 1 output.au

Go to next section...
Back to main page...


Last modified December 18, 1996.
Author: Stéphane Pigeon