Contextual Bandits and Reinforcement Learning with Function Approximation

모드선택 :              
세미나 신청은 모드에서 세미나실 사용여부를 먼저 확인하세요

Contextual Bandits and Reinforcement Learning with Function Approximation

김수현 0 5358
구분 수학강연회
일정 2025-09-11(목) 16:00~17:00
세미나실 129동 101호
강연자 이다빈 (서울대학교)
담당교수 권재훈
기타

In this talk, we discuss contextual bandits and reinforcement learning problems based on function approximation frameworks. For the first part, we consider neural logistic bandits, where the main task is to learn an unknown reward function within a logistic link function using a neural network. For the second part, we explain algorithms for learning Markov decision processes whose transition is governed by a multinomial logit model. 

    정원 :
    부속시설 :
세미나명