Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

Approved

Classifications

MinEdu publication type

A4 Article in conference proceedings (peer-reviewed)

Definition

Article

Host publication type

Conference platform

Target group

Scientific

Article type

Other article

Peer reviewed

Peer-reviewed

Authors of the publication

Number of authors

34

Authors

Milani, Stephanie; Kanervisto, Anssi; Ramanauskas, Karolis; Schulhoff, Sander; Houghton, Brandon; Mohanty, Sharada; Galbraith, Byron; Chen, Ke; Song, Yan; Zhou, Tianze; Yu, Bingquan; Liu, He; Guan, Kai; Hu, Yujing; Lv, Tangjie; Malato, Federico; Leopold, Florian; Raut, Amogh; Hautamäki, Ville; Melnik, Andrew; Ishida, Shu; Henriques, João F.; Klassert, Robert; Laurito, Walter; Cazzonelli, Lucas; Kulbach, Cedric; Popovic, Nicholas; Schweizer, Marvin; Novoseller, Ellen; Goecks, Vinicius G; Waytowich, Nicholas; Watkins, David; Miller, Josh; Shah, Rohin

Publication channel information

Title of host publication

Proceedings of Machine Learning Research

Editors of host publication

Ciccone, Marco; Stolovitzky, Gustavo; Albrecht, Jacob

Name of conference

Annual Conference on Neural Information Processing Systems

Title of journal/series

Proceedings of Machine Learning Research

Publication forum ID

87561

Publication forum level

1

Internationality

Yes

Detailed publication information

Publication year

2023

Reporting year

2023

Journal/series volume number

220

Page numbers

171-188

Language of publication

English

Co-publication information

International co-publication

Yes

Co-publication with a company

Yes

Classification and additional information

MinEdu field of science classification

113 Computer and information sciences

Subject headings

fine-tuning; imitation learning; Learning from humans; preference learning; reinforcement learning from human feedback; reward modeling

Source database ID

Scopus ID

2-s2.0-85179129024