Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts

Document detail

ID

oai:arXiv.org:2402.15505

Topic

Computer Science - Machine Learnin... Computer Science - Artificial Inte... Computer Science - Computer Vision...

Author

Liu, Yuejiang Alahi, Alexandre

Year

2024

listing date

2/28/2024

Keywords

weak-to-strong computer

Metrics

Abstract

Steering the behavior of a strong model pre-trained on internet-scale data can be difficult due to the scarcity of competent supervisors.

Recent studies reveal that, despite supervisory noises, a strong student model may surpass its weak teacher when fine-tuned on specific objectives.

Yet, the effectiveness of such weak-to-strong generalization remains limited, especially in the presence of large capability gaps.

In this paper, we propose to address this challenge by harnessing a diverse set of specialized teachers, instead of a single generalist one, that collectively supervises the strong student.

Our approach resembles the classical hierarchical mixture of experts, with two components tailored for co-supervision: (i) we progressively alternate student training and teacher assignment, leveraging the growth of the strong student to identify plausible supervisions; (ii) we conservatively enforce teacher-student and local-global consistency, leveraging their dependencies to reject potential annotation noises.

We validate the proposed method through visual recognition tasks on the OpenAI weak-to-strong benchmark and additional multi-domain datasets.

Our code is available at \url{https://github.com/yuejiangliu/csl}.

;Comment: Preprint

Liu, Yuejiang,Alahi, Alexandre, 2024, Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts

Document

Open

Source

Articles recommended by ES/IODE AI

Computer Science

TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish

evaluate provide turkishmmlu questions language

Archives of Clinical Neurops...

The Measurement of Acculturation in Neuropsychological Evaluations of Hispanic/Latino Individuals across the Lifespan: A Scoping Review of the Literature

evidence review lifespan various validation acculturation measures

BMC Cancer

Systematic druggable genome-wide Mendelian randomization identifies therapeutic targets for lung cancer

agphd1 subtypes replication hykk squamous cell gene carcinoma causal targets mendelian randomization cancer analysis