The Musical Avatar - A visualization of musical preferences by means of audio content description

Haro, G.; Anna Xambó; Ferdinand Fuhrmann; Dmitry Bogdanov; Emilia Gómez; Perfecto Herrera

Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

The Musical Avatar - A visualization of musical preferences by means of audio content description

Title	The Musical Avatar - A visualization of musical preferences by means of audio content description
Publication Type	Conference Paper
Year of Publication	2010
Conference Name	5th Audio Mostly Conference: A Conference on Interaction with Sound
Authors	Haro, G. G. , Xambó A. , Fuhrmann F. , Bogdanov D. , Gómez E. , & Herrera P.
Conference Start Date	15/09/2010
Publisher	ACM New York, NY, USA
Conference Location	Pitea, Sweden
Abstract	The music we like (i.e. our musical preferences) encodes and communicates key information about ourselves. Depicting such preferences in a condensed and easily understandable way is very appealing, especially considering the current trends in social networked communications. In this paper we propose a method to automatically generate, given a provided set of preferred music tracks, an iconic representation of a user’s musical preferences – the Musical Avatar . Starting from the raw audio signal we first compute over 60 low-level audio features. Then, by applying pattern recognition methods, we infer a set of semantic descriptors for each track in the collection. Next, we summarize these track-level semantic descriptors, obtaining a user profile. Finally, we map this collection-wise description to the visual domain by creating a humanoid cartoony character that represents the user’s musical preferences. We performed a proof-of-concept evaluation of the proposed method on 11 subjects with promising results. The analysis of the users’ evaluations shows a clear preference for avatars generated by the proposed semantic descriptors over avatars derived from neutral or randomly generated values. We also found a general agreement on the representativeness of the users’ musical preferences via the proposed visualization strategy.
preprint/postprint document	files/publications/avatar_AM2010.pdf
Final publication	http://doi.acm.org/10.1145/1859799.1859813

avatar_AM2010.pdf