Data Preparation

The following steps are required to prepare an acoustic model:

  • Prepare audio files in the .wav format described.
    Label these files according to Surah number and Ayah number, following the example transcripts.
  • Write a transcript file matching the exact recitation in each audio file
  • Prepare a language model file using the transcript files
  • Set up and run Sphinxtrain as described in the CMUSphinx section.