Abstract
he performance of state-of-the-art Deep Learning models heavily depends on the availability of well-curated training and testing datasets that sufficiently capture the operational domain. Data augmentation is an effective technique in alleviating data scarcity, reducing the time-consuming and expensive data collection and labelling processes. Despite their potential, existing data augmentation techniques primarily focus on simple geometric and colour space transformations, like noise, flipping and resizing, producing datasets with limited diversity. When the augmented dataset is used for testing the Deep Learning models, the derived results are typically uninformative about the robustness of the models. We address this gap by introducing GENFUZZER, a novel coverage-guided data augmentation fuzzing technique for Deep Learning models underpinned by generative AI. We demonstrate our approach using widely-adopted datasets and models employed for image classification, illustrating its effectiveness in generating informative datasets leading up to a 26% increase in widely-used coverage criteria.
| Original language | English |
|---|---|
| Title of host publication | 2023 38th IEEE/ACM International Conference on Automated Software Engineering |
| Subtitle of host publication | Proceedings |
| Publisher | IEEE |
| Number of pages | 5 |
| ISBN (Electronic) | 979-8-3503-2996-4 |
| ISBN (Print) | 979-8-3503-2997-1 |
| DOIs | |
| Publication status | Published - 15 Sept 2023 |
| Event | the 38th IEEE/ACM International Conference on Automated Software Engineering - Kirchberg, Luxembourg, Kirchberg, Luxembourg Duration: 11 Sept 2023 → 15 Sept 2023 https://conf.researchr.org/home/ase-2023 |
Publication series
| Name | 2023 38th IEEE/ACM International Conference on Automated Software Engineering (ASE) |
|---|---|
| Publisher | IEEE |
| ISSN (Print) | 1938-4300 |
| ISSN (Electronic) | 2643-1572 |
Conference
| Conference | the 38th IEEE/ACM International Conference on Automated Software Engineering |
|---|---|
| Abbreviated title | ASE 2023 |
| Country/Territory | Luxembourg |
| City | Kirchberg |
| Period | 11/09/23 → 15/09/23 |
| Internet address |
Bibliographical note
This is an author-produced version of the published paper. Uploaded in accordance with the University’s Research Publications and Open Access policy.Keywords
- Generative AI, Deep Learning Testing, Coverage Guided Fuzzing, Data Augmentation, Safe AI
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver