CS281r
Topics in AI: Representation and Learning of Music
MW 1-2:30
Maxwell Dworkin 221


Course Description

Generating and understanding music presents a major challenge to artificial intelligence.  This class will focus on the problems of representing musical knowledge, and of learning how to understand and reproduce a style.  This interdisciplinary class is aimed both at computer science students who wish to delve into issues of knowledge representation and machine learning, and at music students who wish to learn computational techniques that can be applied to music.  Projects will be done in groups containing both musicians and computer scientists.

Prerequisites

This is an interdisciplinary course, and both music and computer science students will be able to contribute.  Therefore there are separate prerequisites for music students and computer science students.  For computer science students, the prerequisites are CS181 or CS182 and the ability to read music; for music students, Music 154, Music 157x or equivalent.  The prerequisites are flexible, and may be waived in particular cases by permission of instructor.

Enrollment in the course is limited; if necessary, preference will be given to students with more background.

Teaching Staff

Professor: Avi Pfeffer
Office: Maxwell Dworkin 251
Office Hours: Thursday 4-5

TF: Emir Kapanci
Office: Maxwell Dworkin 217
OH: TBA

TF: Chris Thorpe
Office Maxwell Dworkin 238
OH:  TBA


Calendar Notes

Wednesday, September 7th: class will meet in Maxwell Dworkin 323
Monday, October 6th: no class due to Yom Kippur
Friday, October 10th: make-up class for Yom Kippur
Monday, October 13th: no class due to Columbus Day
Friday, October 24th: make-up class for ISMIR
Monday, October 27th: no class due to ISMIR conference
Wednesday, October 29th: no class due to ISMIR conference
Friday, October 31st: make-up class for ISMIR
Wednesday, November 26th: no class due to Thanksgiving


Requirements

The course will have three kinds of requirements: reading and participation, accounting for 30% of the grade, four homeworks (40%) and a final project (30%).  

A question will be assigned on the readings for each lecture.  The questions will be designed to lead you into thinking about the readings. A short, one paragraph answer will be expected, to be handed in at the beginning of the lecture in which the readings are discussed.  The reading and participation grade will be based on completing the reading and answering the question, and on participating in class discussion.

The homeworks will be done in large teams, consisting of both musicians and computer scientists.  It is expected that everyone will have something to contribute to the team.  Each team will make a presentation to the class about their approach to the problem, and also provide a written report including a statement of work detailing the contributions of each group member.

The final project will consist of an investigation by an individual or small group into a topic on the interface between AI and music.  Milestones will be set throughout the course.  The project will be due at the end of reading period.


Syllabus (subject to change)

M 9/15: introduction (Hiller&Isaacson)
W 9/17: ontologies of music (Selfridge-Field)
M 9/22: music file formats (Csele, Hewlett&Selfridge-Field, Bainbridge, Cahill, Recordare)
W 9/24: music corpora (Huron, Schaffrath)
M 9/29: optical music recognition (Bainbridge&Wijaya, Bainbridge&Carter)
W 10/1: HW1 presentations
W 10/8: pitch recognition (Bores,  Cheveigne&Kawahra)
F 10/10: note segmentation (Duxbury et al., Viitaniemi et al.)
W 10/15: rhythm tracking 1 (Dixon)
M 10/20: rhythm tracking 2 (Cemgil & Kappen)
W 10/22: polyphonic pitch recognition (Marolt, Sterein et al)
F 10/24: HW2 presentations
F 10/31: harmonic analysis (Winograd)
M 11/3: pattern in music (Simon&Sumner, Nevill-Manning&Witten)
W 11/5: hierarchical structure (Lerdahl&Jackendoff, Bod)
M 11/10: melodic structure (Narmour)
W 11/12: HW3 presentations
M 11/17: Melodic similarity (O'Maidin, Crawford et al)
W 11/19: Harmonization (Hornel&Menzel, Thorpe)
M 11/24: Composition and Improvisation (Cope, Thom)
M 12/1: Accompaniment (Raphael)
W 12/3: HW4 presentations
M 12/8: Performance (Arcos et al., Widmer)
W 12/10: Project presentations
M 12/15: Project presentations

Readings

Sources
MMoM: Machine Models of Music, Stephan M. Schwanauer and David A. Levitt editors, MIT Press, 1993.
BM: Beyond MIDI, Eleanor Selfridge-Field editor, MIT Press, 1997.

tPoM: The Psychology of Music, Diana Deutsch editor, 2nd edition, Elsevier 1999.
MS: Melodic Similarity: Concepts, Procedures and Applications, Walter B. Hewlett and Eleanor Selfridge-Field editors, MIT Press, 1998.

[Hiller & Isaacson] Lejaren Hiller and Leonard Isaacson, "Musical Composition with a High-Speed Digital Computer", MMoM.

[Selfridge-Field] Eleanor Selfridge-Field, "Describing Musical Information", BM.
[Csele] Mark Csele, "The WAV File Format", web page, http://www.technology.niagarac.on.ca/courses/comp630/WavFileFormat.html.
[Hewlett & Selfridge-Field] Walter B. Hewlett and Eleanor Selfridge-Field, with David Cooper, Brent A. Field, Kia-Chuan Ng and Peer Sitter, "MIDI", BM.
[Bainbridge] David Bainbridge, "Csound", BM.
[Cahill] Margaret Cahill, "The Translation of Finale's Enigma File Format for CPNView", MSc thesis, University of Limerick, 1998.
[Recordare] Recordare, "The MusicXML Tutorial", web page, http://www.musicxml.org/xml/musicxml-tutorial.pdf.
[Huron] David Huron, "Humdrum and Kern: Selective Feature Encoding", BM.
[Shaffrath] Helmut Schaffrath, "The Essen Associative Code: A Code for Folksong Analysis", BM.
[Bainbridge & Wijaya] David Bainbridge and K Wijaya, "Bulk Processing of Optically Scanned Music", Seventh International Conference on Image Processing and its Applications, 1999.
[Bainbridge & Carter] David Bainbridge and N. Carter, "Automatic Reading of Music Notation", in Handbook of Character Recognition and Document Image Analysis, World Scientific, 1997.
[Bores] Bores Signal Processing, "Introduction to DSP", web-based tutorial, http://www.bores.com/courses/intro.
[Cheveigne & Kawahra] Alain de Cheveigne and Hideki Kawahra, "YIN: A Fundamental Frequency Estimator for Speech Recognition and Thythm Quantization", Journal of the Acoustical Society of America, 111(4), 1917-1930, 2002.
[Duxbury et al.] C. Duxbury, M. Sandler and M.E. Davies, "A Hybrid Approach to Musical Note Onset Detection", 5th International Conference on Digital Audio Effects, 2002.
[Viitaniemi et al.] T. Viitaniemi, Klapuri, A. Eronen, "A Probabilistic Model for the Transcription of Single-Voice Melodies", Finnish Signal Processing Symposium, 2003.
[Dixon] Simon Dixon, "Automatic Extraction of Tempo and Beat from Expressive Performances", Journal of New Music Research, 30(1), 39-58, 2001.
[Cemgil & Kappen] A.T. Cemgil and B. Kappen, "Monte Carlo Methods for Tempo Tracking and Rhythm Quantization", Journal of Artificial Intelligence Research, 18, 45-81.
[Marolt] M. Marolt, "SONIC: Transcription of Polyphonic Piano Music with Neural Networks", Workshop on Current Research Directions in Computer Music, Barecelona, 2001.
[Sterian et al.] Andrew Sterian, Mary H. Simoni and Gregory H. Wakefield, "Model-Based Music Transcription", International Computer Music Conference, 1999.
[Winograd] Terry Winograd, "Linguistics and the Computer Analysis of Tonal Harmony", MMoM.
[Simon&Sumner] Herbert A. Simon and Richard K. Sumner, "Pattern in Music", MMoM.
[Nevill-Manning&Witten] C.G. Nevill-Manning and I.H. Witten, "Identifying Hierarchical Structure in Sequences: A Linear-Time Algorithm", Jounral of Artificial Intelligence Research, 7, 67-82, 1997.
[Lerdahl&Jackendoff] Fred Lerdahl and Ray Jackendoff, "An Overview of Hierarchical Structure in Music", MMoM.
[Bod] Rens Bod, "A Unified Model of Structural Organization in Language and Music", Journal of Artificial Intelligence Research, 17, 289-308, 2002.
[Narmour] Eugene Narmour, "Hierarchical Expectation and Musical Style", tPoM.
[O'Maidin] Donncha O'Maidin, "A Geometrical Algorithm for Melodic Difference", MS.
[Crawford et al.] Tim Crawford, Costas S. Iliopoulos and Rajeev Raman, "String Matching Techniques for Musical Similarity and Melodic Recognition", MS.
[Hornel&Menzel] Dominik Hornel and Wolfram Menzel, "Learning Musical Structure and Style with Neural Networks", Computer Music Journal 22(4), 44-62, 1998.
[Thorpe] Christopher A. Thorpe, "Application of Markov Models to Bach-style Chorale Harmonization", in progress.
[Cope] David Cope, "A Computer Model of Music Composition", MMoM.
[Thom] Belinda Thom, "Interactive Improvisational Music Companionship: A User-Modeling Approach", User-Modeling and User-Adapted Interaction Journal, in print.
[Raphael] Chris Raphael, "A Probabilistic Expert System for Automatic Musical Accompaniment", Journal of Computer and Graphical Statistics, 10(3), 487-512, 2001.
[Arcos et al.] Josep Lluis Arcos, Ramon Lopez de Mantaras and Xavier Serra, "Saxex: A Case-Based Reasoning System for Generating Expressive Musical Performances", Journal of New Music Research, 27(3), 194-210, 1998.
[Widmer] Gerhard Widmer, "Learning Expressive Performance: The Structure-Level Approach", Journal of New Music Research, 25, 179-205, 1996.