i6_core.corpus.data_augmentation

class i6_core.corpus.data_augmentation.ChangeCorpusSpeedJob(*args, **kwargs)

Changes the speed of all audio files in the corpus (shifting time AND frequency)

Parameters:
  • bliss_corpus (Path) – Bliss corpus

  • corpus_name (str) – name of the new corpus

  • speed_factor (float) – relative speed factor

  • base_frequency (int) – sampling rate of the audio files

run()
tasks()
Returns:

yields Task’s

Return type:

list[sisyphus.task.Task]

class i6_core.corpus.data_augmentation.SelfNoiseCorpusJob(*args, **kwargs)

Add noise to each recording in the corpus. The noise consists of audio data from other recordings in the corpus and is reduced by the given SNR. Only supports .wav files

WARNING: This Job uses /dev/shm for performance reasons, please be cautious

Parameters:
  • bliss_corpus (Path) – Bliss corpus with wav files

  • snr (float) – signal to noise ratio in db, positive values only

  • corpus_name (str) – name of the new corpus

  • n_noise_tracks (int) – number of random (parallel) utterances to add

  • seed (int) – seed for random utterance selection

run()
tasks()
Returns:

yields Task’s

Return type:

list[sisyphus.task.Task]