haku: @keyword deep learning / yhteensä: 9
viite: 5 / 9
Tekijä:Luketina, Jelena
Työn nimi:Hyperparameter Optimization for Machine Learning
Julkaisutyyppi:Diplomityö
Julkaisuvuosi:2016
Sivut:73      Kieli:   eng
Koulu/Laitos/Osasto:Perustieteiden korkeakoulu
Oppiaine:Applied and Engineering Mathematics   (SCI3016)
Valvoja:Hollanti, Camilla ; Dubashi, Devdatt
Ohjaaja:Raiko, Tapani
Elektroninen julkaisu: http://urn.fi/URN:NBN:fi:aalto-201606292835
Sijainti:P1 Ark Aalto  4207   | Arkisto
Avainsanat:hyperparameter
machine learning
deep learning
optimization
gradient-based
Tiivistelmä (eng):In the recent years, there have been significant developments in the field of machine learning, with the modern methods like deep learning, significantly overpassing previous state-of-the-art results on a variety of tasks.
These modern methods however, come at the cost of increased complexity and require careful tuning of multiple hyperparameters which specify the model.

The common practice still is manual tuning of the hyperparameters, making the use of deep learning methods, more of an art than a science.
In this thesis, we will explore some of the methods for automated hyperparameter optimization, focusing on the gradient-based approach.
The goal is to provide a gentle introduction to this topic, by first providing a solid overview of essential concepts from both optimization and machine learning.
ED:2016-07-17
INSSI tietueen numero: 54109
+ lisää koriin
INSSI