haku: @keyword natural language processing / yhteensä: 16
viite: 3 / 16
Tekijä:Westrup, Clemens
Työn nimi:An Exploration of Representation Learning and Sequential Modeling Approaches for Supervised Topic Classification in Job Advertisements
Julkaisutyyppi:Diplomityö
Julkaisuvuosi:2016
Sivut:94 s. + liitt. 9      Kieli:   eng
Koulu/Laitos/Osasto:Perustieteiden korkeakoulu
Oppiaine:Machine Learning and Data Mining   (SCI3015)
Valvoja:Gionis, Aristides
Ohjaaja:Mathioudakis, Michael
Elektroninen julkaisu: http://urn.fi/URN:NBN:fi:aalto-201611025476
Sijainti:P1 Ark Aalto  5799   | Arkisto
Avainsanat:natural language processing
computational linguistics
representation learning
sequential text modeling
text classification
job advertisements
Tiivistelmä (eng):This thesis applies the explorative double diamond design process borrowed to iteratively frame a research problem applicable in the context of a recruitment web service and then find the best approach to solve it.

Thereby the problem focus is laid on multi-class classification, in particular the task of labelling sentences in job advertisements with one of six topics which were found to be covered in every typical job description.
A dataset is obtained for evaluation and conventional N-Gram Vector Space models are compared with Representation Learning approaches, notably continuous distributed representations, and Sequential Modeling techniques using Recurrent Neural Networks.

Results of the experiments show that the Representation Learning and Sequential Modeling approaches perform on par or better than traditional feature engineering methods and show a promising direction in and beyond research in Computational Linguistics and Natural Language Processing.
ED:2016-11-13
INSSI tietueen numero: 55007
+ lisää koriin
INSSI