Skip to Content
This site provides my research, software, and creative output.

Automated classification of music genre, sound objects, and speech by machine learning.

icmpc.png
Authors: 
Leigh M. Smith, Stephen T. Pope, Jay Leboeuf and Steve Tjoa

Proceedings of the 12th International Conference on Music Perception and Cognition, page 943, Thessaloniki, Greece, July 2012. ICMPC/ESCOM. (abstract).

Abstract: 

A software system, MediaMined, is described for the efficient analysis and classification of auditory signals. This system has been applied to the tasks of musical instrument identification, classifying musical genre, distinguishing between music and speech, and detection of the gender of human speakers. For each of these tasks, the same algorithm is applied, consisting of low-level signal analysis, statistical processing and perceptual modeling for feature extraction, and then supervised learning of sound classes. Given a ground truth dataset of audio examples, textual descriptive classification labels are then produced. Such labels are suitable for use in automating content interpretation (auditioning) and content retrieval, mixing and signal processing. A multidimensional feature vector is calculated from statistical and perceptual processing of low level signal analysis in the spectral and temporal domains. Machine learning techniques such as support vector machines are applied to produce classification labels given a selected taxonomy. The system is evaluated on large annotated ground truth datasets (n > 30000) and demonstrates success rates (F-measures) greater than 70% correct retrieval, depending on the task. Issues arising from labeling and balancing training sets are discussed. The performance of classification of audio using machine learning methods demonstrates the relative contribution of bottom-up signal derived features and data oriented classification processes to human cognition. Such demonstrations then sharpen the question as to the contribution of top-down, expectation based processes in human auditory cognition.

Retro bike light

I'm a big fan of urban biking, having had the privilege of living in Amsterdam and Paris which are both very bike friendly cities for daily commuting. Returning back to NYC, I got a cast off commuter bike from a neighbour, that after considerable work, I got in working order. Given drivers in NYC are nowhere near as aware of cyclists as European drivers, lights are essential. I fitted the crappy old "Fairway Flyer" with a Dutch magneto (aka "dynamo" although that term is strictly incorrect) generator and 6VAC old school chrome light. In practice, the drag on wheel isn't enough to notice given the entire bike is hardly built for speed.

iZotope hiring MediaMined interns!

If you are a budding music or audio engineering undergraduate student, iZotope is hiring paid interns. The work mostly consists of auditioning our systems.

iZotope hiring MediaMined developers!

The MediaMined team at iZotope is expanding! We're hiring software developers with a background in web application development. If you are a LAMP rockstar and care deeply about music, we'd love to hear from you!

The Pleasure Programmes at the Theatre for the New City

(Folks That Live On The Hill, by Jerome Kern)

iZotope acquires Imagine Research

The company I have been working with for about 18 months now, Imagine Research has now been acquired by iZotope Inc. based in Boston. This is a great opportunity for us to continue to develop our MediaMined technology, to broaden it's reach and incorporate it into iZotope's future products. For my local friends, I'll continue to be based in NYC.

Jill's video interview with Peter

Jill produced a beautiful interview that we made with my Dad, Peter, first shot after his diagnosis, and Jill rapidly edited it ready for playing at his wake. It was a deeply moving way for Dad to be there for all of the people that loved him.

Mediamined

Mediamined is a project I've been working on together with the great folks at Imagine Research for about a year now. With some help from the U.S. National Science Foundation, we're now making public some of our technology.

My Tribute to Dad

This is the tribute I presented with my sister Gaye at my father, Peter John Smith's funeral:

I'd like to talk about Dad's life through some of our personal experiences and recollections of him.

The passing of Peter Smith (8/4/1943 - 29/9/2011)

My father Peter Smith passed away this morning at 10:37am at Kalamunda hospital. Thankfully I was able to be with him, my sister Gaye, and mother Dorothy at the time of his passing, which was peaceful. His condition had deteriorated very rapidly and thankfully this kept his suffering to a minimum. Thank you all for your kind wishes.

Syndicate content
EU Copyright