Contextual Bandits and Reinforcement Learning with Function Approximation

모드선택 :
세미나 신청은 모드에서 세미나실 사용여부를 먼저 확인하세요

Contextual Bandits and Reinforcement Learning with Function Approximation

김수현 129동 101호 0 5358 2025.09.04 10:16

구분	수학강연회
일정	2025-09-11(목) 16:00~17:00
세미나실	129동 101호
강연자	이다빈 (서울대학교)
담당교수	권재훈
기타

In this talk, we discuss contextual bandits and reinforcement learning problems based on function approximation frameworks. For the first part, we consider neural logistic bandits, where the main task is to learn an unknown reward function within a logistic link function using a neural network. For the second part, we explain algorithms for learning Markov decision processes whose transition is governed by a multinomial logit model.

수학강연회

$프린트$

정원 :
부속시설 :

구분 강연일 시간 세미나명 강연자 초청자 신청자

기타

2026-05-22

10:00

연구회의

천정희

김선우
확률론

2026-06-25

15:00

27동 116호 Replica Symmetry Breaking in the Multi-Species SK Model with Centered Gaussian External Field.
확률론 김희준 20260625 15:00 서인석

김희준

서인석

민동준
표현론

2026-06-11

10:00

129동 309호 Lecture Series on Representation Theory
표현론 Satoshi Naito 20260611 10:00 권재훈

Satoshi Naito

권재훈

정세화(QSMS)
Geometry Physics and Symmetry

2026-06-11

10:00

129동 406호 Quantum Topology from Dynamics 3
Geometry Physics and Symmetry 박성혁 20260611 10:00 유필상

박성혁

유필상

유필상
표현론

2026-06-11

15:00

129동 309호 Lecture Series on Representation Theory
표현론 Jaehyun Hong 20260611 15:00 권재훈

Jaehyun Hong

권재훈

정세화(QSMS)
표현론

2026-06-10

10:00

129동 309호 Lecture Series on Representation Theory
표현론 Satoshi Naito 20260610 10:00 권재훈

Satoshi Naito

권재훈

정세화(QSMS)
표현론

2026-06-10

15:00

129동 309호 Lecture Series on Representation Theory
표현론 Jaehyun Hong 20260610 15:00 권재훈

Jaehyun Hong

권재훈

정세화(QSMS)
HYKE,기타

2026-06-10

15:30

27동 325호 HYKE 세미나
HYKE,기타 Joseph S. Kwon 20260610 15:30 하승열

Joseph S. Kwon

하승열

이재문(2025-28998)
표현론

2026-06-09

10:00

129동 309호 Lecture Series on Representation Theory
표현론 Satoshi Naito 20260609 10:00 권재훈

Satoshi Naito

권재훈

정세화(QSMS)
Geometry Physics and Symmetry

2026-06-09

10:00

129동 406호 Quantum Topology from Dynamics 1
Geometry Physics and Symmetry 박성혁 20260609 10:00 유필상

박성혁

유필상

유필상
Geometry Physics and Symmetry

2026-06-09

14:00

129동 406호 Quantum Topology from Dynamics 2
Geometry Physics and Symmetry 박성혁 20260609 14:00 유필상

박성혁

유필상

유필상
표현론

2026-06-08

10:00

129동 309호 Lecture Series on Representation Theory
표현론 Satoshi Naito 20260608 10:00 권재훈

Satoshi Naito

권재훈

정세화(QSMS)
표현론

2026-06-08

15:00

129동 309호 Lecture Series on Representation Theory
표현론 Jaehyun Hong 20260608 15:00 권재훈

Jaehyun Hong

권재훈

정세화(QSMS)
작용소이론

2026-06-05

15:00

27동 116호 TBA
작용소이론 강동오 20260605 15:00 기타

강동오

기타

이우영
수학강연회

2026-06-04

16:00

129동 101호 Geometry, Analysis, and Probability of Conformal Fields
수학강연회 강남규 20260604 16:00 이계선

강남규

이계선

김수현
HYKE,기타

2026-06-01

15:30

27동 325호 Trajectory Inference via Multi-marginal Schrödinger Bridges
HYKE,기타 김영헌 20260601 15:30 하승열

김영헌

하승열

이재문(2025-28998)

Contextual Bandits and Reinforcement Learning with Function Approximation

공지/학술

세미나

Contextual Bandits and Reinforcement Learning with Function Approximation

정원 : 부속시설 :

정원 :
부속시설 :