search query: @instructor Corona, Francesco / total: 8
reference: 4 / 8
Author: | Ramaseshan, Ajay |
Title: | Application of multiway methods for dimensinality reduction to music |
Publication type: | Master's thesis |
Publication year: | 2013 |
Pages: | 86 s. + liitt. 5 Language: eng |
Department/School: | Perustieteiden korkeakoulu |
Main subject: | Tietokoneverkot (T-110) |
Supervisor: | Simula, Olli |
Instructor: | Corona, Francesco ; Miche, Yoan |
Electronic version URL: | http://urn.fi/URN:NBN:fi:aalto-201407052298 |
OEVS: | Electronic archive copy is available via Aalto Thesis Database.
Instructions Reading digital theses in the closed network of the Aalto University Harald Herlin Learning CentreIn the closed network of Learning Centre you can read digital and digitized theses not available in the open network. The Learning Centre contact details and opening hours: https://learningcentre.aalto.fi/en/harald-herlin-learning-centre/ You can read theses on the Learning Centre customer computers, which are available on all floors.
Logging on to the customer computers
Opening a thesis
Reading the thesis
Printing the thesis
|
Location: | P1 Ark Aalto 8728 | Archive |
Keywords: | Mel spectrogram MDS MLSCA MPCA music collection MIR PCA |
Abstract (eng): | This thesis can be placed in the broader field of Music Information Retrieval (MIR). MIR refers to a huge set of strategies, software and tools through which computers can analyse and predict interesting patterns from audio data. It is a diverse and multidisciplinary field, encompassing fields like signal processing, machine learning, and musicology and music theory, to name a few. Methods of dimensionality reduction are widely used in data mining and machine learning. These help in reducing the complexity of the classification/clustering algorithms etc., used to process the data. They also help in studying some useful statistical properties of the dataset. In this Master's Thesis, a personalized music collection is taken and audio features are extracted from the songs, by using the Mel spectrogram. A music tensor is built from these features. Then, two approaches to unfold the tensor and convert it into a 2-way data matrix are studied. After unfolding the tensor, dimensionality reduction techniques like Principal Components Analysis (PCA) and classic metric Multidimensional Scaling (MDS) are applied. Unfolding the tensor and performing either MDS or PCA is equivalent to performing Multiway Principal Component Analysis (MPCA). A third method Multilevel Simultaneous Component Analysis (MLSCA), which builds a composite model for each song is also applied. The number of components to retain is obtained by hold-out validation. The fitness of each of these models were evaluated with the T2 and Q statistic, and compared with each other. The aim of this thesis is to produce a dimensionality reduction which can be used for further MIR tasks like better clustering of data with respect to e.g. artists / genres. |
ED: | 2014-01-07 |
INSSI record number: 48301
+ add basket
INSSI