Loading Events

« All Events

  • This event has passed.

Talk by Iñigo Urteaga, Columbia University

February 15 @ 12:00 am - 1:00 pm

Place: AGORA – Espai Polivalent, mòdul B3, Campus Nord UPC
https://www.upc.edu/campusnord/ca/altres/planols-i-transports-al-campus-nord

Bayesian models and inference for reinforcement learning: the multi-armed bandit case.

Speaker: Iñigo Urteaga, Columbia University

https://iurteaga.github.io/

 

Abstract:

The most celebrated corners of machine learning over the past decades are those successful at predicting — e.g., spam classification, medical diagnoses, or cat faces. However, a wide variety of applied problems are prescriptive rather than predictive: those for which decisions must be made in order to maximize a reward. Such problems are common in health, commerce, and engineering. One particular setting for optimizing interactions with the unknown world is the multi-armed bandit, which describes sequential decision processes, a particular instance of reinforcement learning.

In this talk, I will show how Bayesian models and inference methods from the statistics and machine learning community — particularly variational and Monte Carlo methods — can be used to extend multi-armed bandit models, improve learning on complex scenarios, and make informed decisions.

 

Contact:

Ricard Gavaldà, gavalda-at-cs.upc.edu
Talk partially sponsored by TIN2017-89244-R (MACDA project).

Details

Date:
February 15
Time:
12:00 am - 1:00 pm