Hello, and welcome to the Paper of the Day (Po'D): A Survey of Evaluation in Music Genre Recognition. Today's paper is B. L. Sturm, "A Survey of Evaluation in Music Genre Recognition", Proc. Adaptive Multimedia Retrieval, Copenhagen, Denmark, Oct. 2012.

This paper is best summarized by a particularly riveting line of section 2.2:

The numbers just sort of roll off the tongue. I think I might approach the presentation of this paper like at a humanities conference, where I read it. Aloud. With no slides. It is really only 7 pages of text, and 14 pages of references. I can skip the references.

And in the style of Harvard author name and date referencing, here is the first line of my paper:

A misguided study?

H. Jennings, P. Ivanov, A. Martins, P. da Silva, and G. Viswanathan, "Variance fluctuations in nonstationary time series: a comparative study of music genres," Physica A: Statistical and Theoretical Physics, vol. 336, pp. 585-594, May 2004.

In classic physics style, this work essentially reduces the music signal to be an amplitude envelope, and then claims truths on entire genres based on correlations.

The Medium Shapes the Message

Here is a nice article by David Byrne on how the "venue" shapes his sound, as well as that of many other artists, including birds. It reminds me of this fantastic course I took in graduate school about how minimalism (in music) arose and developed in part from the dawn of the LP and hi-fidelity.
The paper, R. B. Dannenberg, B. Thom, and D. Watson, "A machine learning approach to musical style recognition," in Proc. International Computer Music Conf., Thessaloniki, Greece, Sep. 1997, is regarded as the first to explore something like recognizing the genre of a musical signal. It proposes a system to determine the playing style of a musician. However, I have just discovered the following fascinating paper: K.-P. Han, Y.-S. Park, S.-G. Jeon, G.-C. Lee, and Y.-H. Ha, "Genre classification system of TV sound signals based on a spectrogram analysis," IEEE Transactions on Consumer Electronics, vol. 44, pp. 33-42, Feb. 1998. In that paper, they look at discriminating between speech and music, and Jazz, Classical and Popular genres. Not only do they simulate the algorithm, they actually implement the system using circuits and show the results. They also list the musical pieces they put in each genre dataset. Was Kansas Popular in 1998?

Music genre flowchart

flow.png From: T. Zhang, "Semi-automatic approach for music classification," in Proc. SPIE Conf. on Internet Multimedia Management Systems, 2003.

The authors put together a flowchart for automatic classification. I was curious about "detect features of symphony", especially when one only has a 30 second clip: "Since a symphony is composed of multiple movements and repetitions, there is an alternation between relatively high volume audio signal (e.g. performance of the whole orchestra) and low volume audio signal (e.g. performance of single instrument or a few instruments of the orchestra) along the music piece. ... Thus, by checking the existence of alternation between high volume and low volume intervals (with each interval longer than a certain threshold) and/or repetition(s) in the whole music piece, symphonies will be distinguished [from other genres]."

Props to the authors for attempting the impossible, but any flowchart for assigning music genre must be broken from the very first decision. Genres are not uniquely specified by characteristics that mutually exclude others.

Music genre taxonomy

genretax.png From: J. G. A. Barbedo and A. Lopes, "Automatic genre classification of musical signals," EURASIP Journal on Advances in Signal Processing, 2007.

The authors specify the meaning of each of these labels. For instance, "Dance" music has "strong percussive elements and very marked beating." Stemming from "Dance" there is "Jazz", "characterized by the predominance of instruments like piano and saxophone. Electric guitars and drums can also be present; vocals, when present, are very characteristic." And stemming from "Dance," stemming from "Jazz," there is "Cool", a "jazz style [that is] light and introspective, with a very slow rhythm." The genres "Techno" and "Disco" --- which both emphasize the importance of listening with your body and feet --- do not stem from "Dance," but instead from "Pop/Rock," "the largest class, including a wide variety of songs."

Props to the authors for attempting the impossible, but any taxonomy of music genre must be broken from the very first stem. Genres are not like species, and cannot be arranged like so. (On the plus side, it appears that to differentiate introspective music from non-introspective music requires only four spectral features computed over 21.3 ms windows.)
From 1976, "Daddy Cool" by Boney M. is a certifiable classic Disco tune. Below are Boney M. singing and dancing "Daddy Cool." And I think Bobby Ferrell's dancing might be a perfect reflection of cocaine use.

To me this is nearly perfect Disco: a square four on the floor with that typical open hi-hat between beats (this time with a flange!), simple yet memorable figures for strings and saxes, bass, female voices, and don't forget the sexual content! The only thing really wrong with it is that the track is no longer than 4 minutes. (And I would like a funkier bass-line.)

Now consider this 1993 remake by a pop group in Hungary.

Although note for note they are just about the same, to me from the get go the latter is a sequenced and sterile version missing the essential hihats of the original. But is it so far away that I would not classify it as Disco?
Here is Faron Young in 1956 covering Don Gibson's "Sweet Dreams" for him and his eyebrows to get a ticket for the Checkerboard Showboat to continue "following those girls."

Now, here are the Pioneers covering the song 12 years later in a recording released 1968 (is that a ukulele I hear?).

When I listen to this version, my own eyebrows raise as if to reach across space and time some 44 years to bring the vocals into key. It is precisely because of this, in our autotune saturated world, that I really like this recording.

Here is Jim Reeves in 1959 singing "He'll Have To Go", this time without the pressure of enduring the Checkerboard Showboat.

Now, here is David Isaacs covering the song 10 years later in a recording released 1969.

Those back up singers, with their bizarre harmonies intentional or not, are precisely why I can't stand to listen to this recording at a low volume. My wife rolls her eyes when I play it loud, just as I want to always hear it.

Now, from his 1974 masterful record "Rhapsody in White," here is "Love's Theme" by Barry White performed by The Love Unlimited Orchestra.

Aside from the rich orchestration combining two rhythm guitars, roiling piano, lush strings, sweeping harp, and the drums and bass I could listen to four hours alone, I love this particular recording for a few reasons. First, popular music these days that combine classical elements like strings, is essentially boring. I am looking at you The Verve, and Guns 'N Roses. Second, around 1m45, when the horns take in the bridge, there is a wonderful maybe-flub by one of the players. Then from 3m07 to 3m11 the piano loses it, before nearly everything is taken away by a quite artificial but delicious rapid fade out at 3m16, leaving naked the rhythm guitars shivering alone with the bass and drums.


Disco in Bulgaria

Disco --- the music, the dress, the life style --- was a phenomenon that has a clear beginning, peak, and denouement, at least in the USA and the UK. Contrary to the hundreds of "Now that is what I call Disco" compilations available, Disco made inroads to many other places in the world --- places other than Western Europe and Scandinavia (ABBA).

I have been communicating with a colleague (NN) who is an expert in Bulgarian popular music, and he has graciously given me permission to quote our conversation. I indent his notes below.
This is one of the best mash-ups I have seen. We need so much more of this.

If you are wondering, that mad piano-playing dancing man is Neil Sedaka singing "Bad Blood":

The second singer is Teddy Pendergrass singing "Close the door":

The man in the beginning is Bob McGrath from Sesame Street. Seeing him takes me back to my childhood when I was an avid watcher. :)

