Top | Calendar | Links | Slides | Readings |
This course covers machine extraction of structure in audio files covering areas such as source separation (unmixing audio recordings into individual component sounds), sound object recognition (labeling sounds), melody tracking, beat tracking, and perceptual mapping of audio to machine-quantifiable measures.
This course is approved for the Breadth Interfaces & project requirement in the CS curriculum.
Prior programming experience sufficient to be able to do laboratory assignments in PYTHON, implementing algorithms and using libraries without being taught to do so (there is no language instruction on Python). Having taken EECS 211 and 214 would demonstrate this experience.
Fundamentals of Music Processing
Lecture: Monday, Wednesday, 3:30 - 4:50pm CST in Tech M128
TBD
Please use CampusWire for class-related questions.
You will be graded on a 100 point scale (e.g. 93 to 100 = A, 90-92 = A-, 87-89 = B+, 83-86 = B, 80-82 = B-…and so on).
Homework and reading assignments are solo assignments and must be original work.
Final projects are group assignments and all members of a group will share a grade for all parts of the assignment.
Assignments must be submitted on the due date by the time specified on Canvas. If you are worried you can’t finish on time, upload a safety submission an hour early with what you have. I will grade the most recent item submitted before the deadline. Late submissions will not be graded.
Students can earn a MAXIMUM TOTAL of 10 extra-credit points (A full letter grade):
Paper reviews You will be able to earn extra credit by submitting reviews of up to 5 extra-credit papers in the field. Each paper review will be worth 2 points, for a total of 10 paper review points. NOTE: These reviews will be graded very strictly. If I have even a suspicion that a review was not written by you (without any aid, without any generative language model), you will get 0 points.
Week | Date | Topic | ASSIGNMENT | Points |
---|---|---|---|---|
1 | Tue Apr 1 | Course intro, Recording basics | ||
1 | Wed Apr 2 | How we hear, Frequency & Pitch | ||
2 | Mon Apr 7 | Loudness & Amplitude | ||
2 | Wed Apr 9 | The Fourier Series & Spectrogram | ||
3 | Mon Apr 14 | The Fourier Series & Spectrogram | ||
3 | Wed Apr 16 | Convolution, Reverb | HW 1 | 20 |
4 | Mon Apr 21 | Correlation and Reverb | ||
4 | Wed Apr 23 | Convolution and Filtering | ||
5 | Mon Apr 28 | Time-frequency masking | ||
5 | Wed Apr 30 | Audio Similarity & KNN | HW 2 | 20 |
6 | Mon May 5 | Labeling Sound Events | ||
6 | Wed May 7 | Audio Fingerprinting (Shazam) | ||
7 | Mon May 12 | Deep Learning (briefly) | ||
7 | Wed May 14 | Deep Source Separation | HW 3 | 20 |
8 | Mon May 19 | Deep Embeddings | ||
8 | Wed May 21 | Pitch tracking | ||
9 | Mon May 26 | Deep Models for Audio | HW 4 | 20 |
9 | Wed May 28 | Text2FX | ||
10 | Mon Jun 2 | Gesture Tracking for music control | ||
10 | Wed Jun 4 | Gesture Tracking for music control | HW 5 | 20 |
11 | Wed Jun 11 | Extra credit assignment due | Extra Credit | 10 |
The Fourier Transform and the Spectrogram
Fundamentals of Music Processing, Chapter 1
Fundamentals of Music Processing, Chapter 2 & Section 3.1
Fundamentals of Music Processing, Chapter 4
Fundamentals of Music Processing, Chapter 6
Fundamentals of Music Processing, Chapter 7
* REPET for Background/Foreground Separation in Audio
Chapter 4 of Machine Learning : This is Tom Mitchell’s book. Historical overview + explanation of backprop of error. It’s a good starting point for actually understanding deep nets. IT’S WORTH 2 XTRA CREDIT READINGS. THE CATCH IS THAT YOU HAVE TO WRITE 2 PAGES TO GET THE 2 POINTS
The dummy’s guide to MFCC - an easy, high-level read. Start with this.
From Frequency to Quefrency: A History of the Cepstrum - a historical analysis of the uses of cepstrums
Recovering sound sources from embedded repetition - This is a paper on how humans actually listen to and parse audio based on repetition. Read any time.
Paper coming…as soon as I find one I like.
Audealize: Crowdsourced Audio Production Tools - This describes a new way to make audio FX easy to use. Also, try the demo app.
Deep clustering: Discriminative embeddings for segmentation and separation - Don’t try to read this till you know something about deep learning.
Cerberus - A system that separates sounds in a musical mix and also transcribes them. Wait till you’ve gotten some deep learning education. Also, watch the related video
Lessons learned building a large music recommender system (This one is a video) - This is a talk by the chief researcher on music recommendation on Pandora. Watch any time.
An Industrial-Strength Audio Search Algorithm (Shazam) - Describes how the popular Shazam app for music audio ID works.
OtoMechanic: Auditory Automobile Diagnostics via Query-by-Example - A deep-learning based sound ID system. Also, watch the related video.
A Human-in-the-Loop System for Sound Event Detection and Annotation - An interactive sound-event labeler. Also, watch the related video.
Yin: a fundamental frequency estimator for speech and music - This is, perhaps, the most popular pitch tracker.
Crepe: A Convolutional Representation for Pitch Estimation - A deep learning pitch tracker that improves on Yin.
EECS 352 Final projects from 2017 and 2015
Facebook’s Universal Music Translation
A coursera corse on pitch tracking
U of Iowa’s Music Instrument Samples Dataset
The SocialFX data set of word descriptors for audio
VocalSketch: thousands of vocal imitations of a large set of diverse sounds
Bach10: audio recordings of each part and the ensemble of ten pieces of four-part J.S. Bach chorales
Python Utilities for Detection and Classification of Acoustic Scenes
Librosa audio and music processing in Python
Essentia: an open source music analysis toolkit includes a bunch of feature extractors and pre-trained models for extracting e.g. beats per minute, mood, genre, etc.
Yaafe - audio features extraction toolbox
The Northwestern University Source Separation Library (nussl)
Sonic Visualizer music viz software
Lily Pond, open source music notation software
SoundSlice guitar tab and notation website
Top | Calendar | Links | Slides | Readings |