Documentdetail
ID kaart

oai:arXiv.org:2410.10736

Onderwerp
Computer Science - Machine Learnin... Statistics - Machine Learning
Auteur
Shah, Vrund Chaudhari, Tejas Manwani, Naresh
Categorie

Computer Science

Jaar

2024

vermelding datum

16-10-2024

Trefwoorden
loss robust machine calibrated calibration learning shifted surrogates
Metriek

Beschrijving

Robustness towards adversarial attacks is a vital property for classifiers in several applications such as autonomous driving, medical diagnosis, etc.

Also, in such scenarios, where the cost of misclassification is very high, knowing when to abstain from prediction becomes crucial.

A natural question is which surrogates can be used to ensure learning in scenarios where the input points are adversarially perturbed and the classifier can abstain from prediction?

This paper aims to characterize and design surrogates calibrated in "Adversarial Robust Reject Option" setting.

First, we propose an adversarial robust reject option loss $\ell_{d}^{\gamma}$ and analyze it for the hypothesis set of linear classifiers ($\mathcal{H}_{\textrm{lin}}$).

Next, we provide a complete characterization result for any surrogate to be $(\ell_{d}^{\gamma},\mathcal{H}_{\textrm{lin}})$- calibrated.

To demonstrate the difficulty in designing surrogates to $\ell_{d}^{\gamma}$, we show negative calibration results for convex surrogates and quasi-concave conditional risk cases (these gave positive calibration in adversarial setting without reject option).

We also empirically argue that Shifted Double Ramp Loss (DRL) and Shifted Double Sigmoid Loss (DSL) satisfy the calibration conditions.

Finally, we demonstrate the robustness of shifted DRL and shifted DSL against adversarial perturbations on a synthetically generated dataset.

;Comment: Accepted at Asian Conference on Machine Learning (ACML) , 2024

Shah, Vrund,Chaudhari, Tejas,Manwani, Naresh, 2024, Towards Calibrated Losses for Adversarial Robust Reject Option Classification

Document

Openen

Delen

Bron

Artikelen aanbevolen door ES/IODE AI

Choice Between Partial Trajectories: Disentangling Goals from Beliefs
agents models aligned based bootstrapped learning reward function model return choice choices partial