Inssi

Helecon

Vocabulary

Tenttu

search query: @keyword bayesilainen mallinvalinta / total: 1

reference: 1 / 1

« previous | next »

Author:	Piironen, Juho
Title:	Comparison of Bayesian predictive methods for variable selection
	Bayesilaisten prediktiivisten muuttujavalintamenetelmien vertailu
Publication type:	Master's thesis
Publication year:	2014
Pages:	vi + 60 Language: eng
Department/School:	Lääketieteellisen tekniikan ja laskennallisen tieteen laitos
Main subject:	Laskennallinen tiede ja tekniikka (F3009)
Supervisor:	Lampinen, Jouko
Instructor:	Vehtari, Aki
Electronic version URL:	http://urn.fi/URN:NBN:fi:aalto-201408292542
OEVS:	Electronic archive copy is available via Aalto Thesis Database. Instructions close Reading digital theses in the closed network of the Aalto University Harald Herlin Learning Centre In the closed network of Learning Centre you can read digital and digitized theses not available in the open network. The Learning Centre contact details and opening hours: https://learningcentre.aalto.fi/en/harald-herlin-learning-centre/ You can read theses on the Learning Centre customer computers, which are available on all floors. Logging on to the customer computers Aalto University staff members log on to the customer computer using the Aalto username and password. Other customers log on using a shared username and password. Opening a thesis On the desktop of the customer computers, you will find an icon titled: Aalto Thesis Database Click on the icon to search for and open the thesis you are looking for from Aaltodoc database. You can find the thesis file by clicking the link on the OEV or OEVS field. Reading the thesis You can either print the thesis or read it on the customer computer screen. You cannot save the thesis file on a flash drive or email it. You cannot copy text or images from the file. You cannot edit the file. Printing the thesis You can print the thesis for your personal study or research use. Aalto University students and staff members may print black-and-white prints on the PrintingPoint devices when using the computer with personal Aalto username and password. Color printing is possible using the printer u90203-psc3, which is located near the customer service. Color printing is subject to a charge to Aalto University students and staff members. Other customers can use the printer u90203-psc3. All printing is subject to a charge to non-University members.
Location:	P1 Ark Aalto 1664 \| Archive
Keywords:	Bayesian model selection variable selection selection induced bias cross validation information criteria reference model projection bayesilainen mallinvalinta muuttujavalinta valintaharha ristiinvalidointi informaatiokriteerit referenssimalli projektio
Abstract (eng):	To date, several methods for Bayesian model selection have been proposed. Although there are many studies discussing the theoretical properties of these methods for model assessment, an extensive quantitative comparison between the methods for model selection for finite data seems to be lacking. This thesis reviews the most commonly used methods in the literature and compares their performance in practical variable selection problems, especially in situations where the data is scarce. The study also discusses the selection induced bias in detail and underlines its relevance for variable selection. Although the focus of the study is on variable selection, the presented ideas are generalizable to other model selection problems as well. The numerical results consist of simulated experiments and one real world problem. The results suggest that even though there are nearly unbiased methods for assessing the performance of a given model, the high variance in the performance estimation may lead to considerable selection induced bias and selection of an overfitted model. The results also suggest that the reference predictive and projection methods are least sensitive to the selection induced bias and are therefore more robust for searching promising models than the alternative methods, such as cross validation and information criteria. However, due to the selection bias, also for these methods the estimated divergence between the reference and candidate models may be an unreliable indicator of the performance of the selected models. For this reason, the performance estimation of the found models should be done for example using cross validation outside the selection process.
Abstract (fin):	Kirjallisuudessa on esitetty useita erilaisia menetelmiä bayesilaiseen mallin valintaan. Vaikka näiden menetelmien teoreettisia ominaisuuksia erityisesti mallin suorituskyvyn mittaamiseen on tutkittu runsaasti, kattavaa tutkimusta eri menetelmien eroista mallin valintaan äärelliselle aineistolle ei näytä olevan tehty. Tässä työssä käsitellään yleisimmin käytettyjä mallinvalintamenetelmiä ja vertaillaan näiden käyttäytymistä käytännön muuttujavalintaongelmissa, erityisesti tilanteissa joissa dataa on niukasti. Työn tarkoituksena on käsitellä myös valintaharhaksi kutsuttua ilmiötä ja korostaa sen merkitystä muuttujavalintaongelmissa. Vaikka työ käsittelee pääosin muuttujavalintaa, työssä esitetyt johtopäätökset ovat yleistettävissä myös muihin mallinvalintaongelmiin. Numeeriset esimerkit koostuvat simuloiduista testeistä sekä yhdestä reaalimaailman ongelmasta. Tulosten perusteella näyttää siltä, että vaikka yksittäisten mallien suorituskykyä voidaan arvioida harhattomasti, valintaharha voi vaikeuttaa mallinvalintaa huomattavasti ja johtaa ylisovittuneen mallin valintaan. Näyttää myös siltä, että referenssiprediktiiviset ja projektiomenetelmät ovat vähiten herkkiä valinnan aiheuttamalle harhalle ja kykenevät näin ollen löytämään parempia malleja kuin vaihtoehtoiset menetelmät kuten ristiinvalidointi ja informaatiokriteerit. Valintaharhasta johtuen kuitenkin myös näille menetelmille estimoitu eroavuus referenssimallin ja kandidaattimallien välillä voi antaa epäluotettavan kuvan valittujen mallien suorituskyvystä. Tästä syystä lopullinen valittujen mallien suorituskyvyn arviointi tulisi tehdä käyttäen esimerkiksi valintaprosessin ulkopuolista ristiinvalidointia.
ED:	2014-08-31

INSSI record number: 49670

« previous | next »

INSSI