pre-process script¶

Utilities module for pre-processing audio stimuli

Run on the command line, e.g.:

$ python preprocess.py

Note

This module has dependencies not required by the CAQE web application. To install these dependencies, run pip install -r analysis_requirements.txt.

preprocess.generate_source_separation_anchors(directory=None, file_list=None)[source]¶

Generate the PEASS-style anchors for use in a source separation evaluation.

“The distorted target anchor is created by low-pass filtering the target source signal to a 3.5 kHz cut-off frequency and by randomly setting 20% of the remaining timefrequency coefficients to zero.”

“The artifacts anchor is ... created by randomly setting 99% of the time-frequency coefficients of the target to zero and by adjusting the loudness of the resulting signal to that of the target.” - Note that we simply used RMS instead of the ISO 352B loudness model as discussed in the paper.

Parameters:	directory (str) – Input directory of audio files to process. Either this or file_list must be defined. Default is None. file_list (list of str) – List of audio files to process. Either this or directory must be defined. Default is None.
Returns:
Return type:	None

References

[1]	Emiya, V., et al. Subjective and Objective Quality Assessment of Audio Source Separation. IEEE Transactions on Audio, Speech, and Language Processing, 19(7): 2046-2057, 2011.

preprocess.rms_normalize(directory=None, file_list=None, suffix=None, target_rms=None)[source]¶

This utility performs rms normalization on a directory or list of files. Note files must be WAV files.

Parameters:

directory (str) – Input directory of audio files to process. Either this or file_list must be defined. Default is None.
file_list (list of str) – List of audio files to process. Either this or directory must be defined. Default is None.
suffix (str) – The suffix to append to the output filenames. If None, then the input files will be overwritten. Default is None.
target_rms (float) – The target RMS to which we normalize. If None, then calculate the minimum RMS of the peak normalized files and normalize to that.

Returns:

output_file_list (list of str)
pre_norm_values (list of float)
post_norm_values (list of float)