Détail du document
Identifiant

oai:arXiv.org:2407.01529

Sujet
Computer Science - Cryptography an... Computer Science - Machine Learnin...
Auteur
Koch, Luke Oesch, Sean Chaulagain, Amul Dixon, Jared Dixon, Matthew Huettal, Mike Sadovnik, Amir Watson, Cory Weber, Brian Hartman, Jacob Patulski, Richard
Catégorie

Computer Science

Année

2024

Date de référencement

03/07/2024

Mots clés
file wild attack files polyglot detection
Métrique

Résumé

A polyglot is a file that is valid in two or more formats.

Polyglot files pose a problem for malware detection systems that route files to format-specific detectors/signatures, as well as file upload and sanitization tools.

In this work we found that existing file-format and embedded-file detection tools, even those developed specifically for polyglot files, fail to reliably detect polyglot files used in the wild, leaving organizations vulnerable to attack.

To address this issue, we studied the use of polyglot files by malicious actors in the wild, finding $30$ polyglot samples and $15$ attack chains that leveraged polyglot files.

In this report, we highlight two well-known APTs whose cyber attack chains relied on polyglot files to bypass detection mechanisms.

Using knowledge from our survey of polyglot usage in the wild -- the first of its kind -- we created a novel data set based on adversary techniques.

We then trained a machine learning detection solution, PolyConv, using this data set.

PolyConv achieves a precision-recall area-under-curve score of $0.999$ with an F1 score of $99.20$% for polyglot detection and $99.47$% for file-format identification, significantly outperforming all other tools tested.

We developed a content disarmament and reconstruction tool, ImSan, that successfully sanitized $100$% of the tested image-based polyglots, which were the most common type found via the survey.

Our work provides concrete tools and suggestions to enable defenders to better defend themselves against polyglot files, as well as directions for future work to create more robust file specifications and methods of disarmament.

;Comment: 18 pages, 11 figures

Koch, Luke,Oesch, Sean,Chaulagain, Amul,Dixon, Jared,Dixon, Matthew,Huettal, Mike,Sadovnik, Amir,Watson, Cory,Weber, Brian,Hartman, Jacob,Patulski, Richard, 2024, On the Abuse and Detection of Polyglot Files

Document

Ouvrir

Partager

Source

Articles recommandés par ES/IODE IA

Diabetes and obesity: the role of stress in the development of cancer
stress diabetes mellitus obesity cancer non-communicable chronic disease stress diabetes obesity patients cause cancer