Table of Contents

Title: Data augmentation: A comprehensive survey of modern approaches

Authors: Alhassan Mumuni, Fuseini Mumuni

Published: Dec 2022

Link: https://doi.org/10.1016/j.array.2022.100258

Summary (Generated by Microsoft Copilot):

Introduction:

  • The paper reviews modern data augmentation techniques in computer vision to improve machine learning model performance.

Challenges:

  • Obtaining large, quality annotated data is time-consuming and resource-intensive.
  • Small and poorly representative datasets lead to poor model performance.

Methods:

  • Data transformation and data synthesis techniques.
  • Meta-learning to enhance data augmentation processes.

Novelties:

  • New taxonomy categorizing methods into data transformation and data synthesis.
  • Extensive coverage of modern techniques like neural rendering and generative adversarial networks (GANs).

Results:

  • Quantitative performance comparisons of various methods on benchmark datasets like CIFAR-10, CIFAR-100, and ImageNet.

Performances:

  • Significant performance improvements with advanced data augmentation methods over baseline models.

Limitations:

  • Some methods may introduce artificial, out-of-distribution samples.
  • Geometric transformations may harm performance in controlled settings.

Discussion:

  • Emphasizes the importance of task-relevant augmentations.
  • Highlights future research directions and challenges in data augmentation.