Documentdetail
ID kaart

oai:arXiv.org:2406.08824

Onderwerp
Computer Science - Robotics Computer Science - Artificial Inte... Computer Science - Computation and... Computer Science - Computers and S...
Auteur
Azeem, Rumaisa Hundt, Andrew Mansouri, Masoumeh Brandão, Martim
Categorie

Computer Science

Jaar

2024

vermelding datum

19-06-2024

Trefwoorden
outcomes people llms computer science
Metriek

Beschrijving

Members of the Human-Robot Interaction (HRI) and Artificial Intelligence (AI) communities have proposed Large Language Models (LLMs) as a promising resource for robotics tasks such as natural language interactions, doing household and workplace tasks, approximating `common sense reasoning', and modeling humans.

However, recent research has raised concerns about the potential for LLMs to produce discriminatory outcomes and unsafe behaviors in real-world robot experiments and applications.

To address these concerns, we conduct an HRI-based evaluation of discrimination and safety criteria on several highly-rated LLMs.

Our evaluation reveals that LLMs currently lack robustness when encountering people across a diverse range of protected identity characteristics (e.g., race, gender, disability status, nationality, religion, and their intersections), producing biased outputs consistent with directly discriminatory outcomes -- e.g. `gypsy' and `mute' people are labeled untrustworthy, but not `european' or `able-bodied' people.

Furthermore, we test models in settings with unconstrained natural language (open vocabulary) inputs, and find they fail to act safely, generating responses that accept dangerous, violent, or unlawful instructions -- such as incident-causing misstatements, taking people's mobility aids, and sexual predation.

Our results underscore the urgent need for systematic, routine, and comprehensive risk assessments and assurances to improve outcomes and ensure LLMs only operate on robots when it is safe, effective, and just to do so.

Data and code will be made available.

;Comment: 40 pages (52 with references), 21 Figures, 6 Tables

Azeem, Rumaisa,Hundt, Andrew,Mansouri, Masoumeh,Brandão, Martim, 2024, LLM-Driven Robots Risk Enacting Discrimination, Violence, and Unlawful Actions

Document

Openen

Delen

Bron

Artikelen aanbevolen door ES/IODE AI

A rare case of localized peliosis hepatis during adjuvant chemotherapy including oxaliplatin mimicking a liver metastasis of colon cancer
peliosis hepatis metastatic liver tumor oxaliplatin oxaliplatin associated cancer metastatic tumor liver hepatis peliosis