Auxiliary Fourier Augmentation

This repository contains the code for the paper "Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification" accepted at CVPR 2024.

Introduction

We propose Auxiliary Fourier-basis Augmentation (AFA), a complementary technique targeting augmentation in the frequency domain and filling the robustness gap left by visual augmentations. We demonstrate the utility of augmentation via Fourier-basis additive noise in a straightforward and efficient adversarial setting. Our results show that AFA benefits the robustness of models against common corruptions, OOD generalization, and consistency of performance of models against increasing perturbations, with negligible deficit to the standard performance of models over various benchmarks and resolutions. It can be seamlessly integrated with other augmentation techniques to further boost performance.

For more details see our CVPR 2024 paper: Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification

Schema

Experiment Setups

The following snippets are an example of how to use the ConfigBuilder to create a config object

experiments = [
    # This creates the experimental setup for training ImageNet using ResNet50-Dubin model
    # The training is done with the AFA attack and the mean strength is set to 10 and the minimum strength is set to 0
    # It does not use JSD and so defaults to use of ACE loss as there is an attack specified
    # It does not use mix like CutMix or MixUp
    # No other augmentations are used
    {
        'ds': 'in', 'm': 'rn50dubin', 'use_jsd': False,
        'use_prime': False, 'use_augmix': False, 'in_mix': False, 'use_mix': False,
        'use_fourier': False, 'use_apr': False, 'attack': 'afa', 'min_str': 0., 'mean_str': 10.,
    },

    # This creates the experimental setup for training ImageNet using CCT model
    # The training is done with the AFA attack and the mean strength is set to 10 and the minimum strength is set to 0
    # It does not use JSD and so defaults to use of ACE loss as there is an attack specified
    # It uses mix like CutMix or MixUp
    # It uses AugMix besides the AFA augmentation
    {
        'ds': 'in', 'm': 'cct_14_7x2_224', 'use_jsd': False,
        'use_prime': False, 'use_augmix': True, 'in_mix': False, 'use_mix': True,
        'use_fourier': False, 'use_apr': False, 'attack': 'afa', 'min_str': 0., 'mean_str': 10.,
    },
]

The experiment variable is a list of dictionaries, each dictionary represents an experimental setup. Specify the experiment list in the main.py file and run the file to start the experiments.

Look at config_utils.py for more details on the ConfigBuilder class and experimental setups.

Running the Experiments

First install the requirements using the following command:

pip install -r requirements.txt

Then, construct the config object using the ConfigBuilder class and specify the experiments in the main.py file. This is shown above.

To run the experiments, use the following command:

python main.py

Requirements

PyTorch
Numpy
Matplotlib
einops, opt_einsum
tqdm
ml-collections
torchvision
pytorch_lightning
wandb
torchmetrics
thop

Pretrained Models

The process to load pretrained models for ImageNet can be found here using the load_weights.py script. Pretrained weights can be downloaded as decribed below:

For ImageNet, all models have been moved to Zenodo and can be downloaded here. For CIFAR-10, all model weights are available here.

Evaluations

We refer to: CorruptionBenchCV for the corruption benchmark tests on ImageNet-C, ImageNet-\bar{C}, ImageNet-3DCC and ImageNet-P, and the ImageNet-v2 and ImageNet-R for the OOD tests.

The Fourier heatmaps of models are plotted using this repository.

Citation

If you find this repository useful, please consider citing our paper:

@inproceedings{afa,
  title={Fourier-basis functions to bridge augmentation gap: Rethinking frequency augmentation in image classification},
  author={Vaish, Puru and Wang, Shunxin and Strisciuglio, Nicola},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={17763--17772},
  year={2024}
}

License

This repository is released under the Apache 2.0 license. See LICENSE for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
assets		assets
pretrained		pretrained
project		project
results		results
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
config_utils.py		config_utils.py
main.py		main.py
plot_benchmark_per_corruption.ipynb		plot_benchmark_per_corruption.ipynb
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Auxiliary Fourier Augmentation

Introduction

Schema

Contents

Experiment Setups

Running the Experiments

Requirements

Pretrained Models

Evaluations

Citation

License

About

Releases

Packages

Contributors 3

Languages

License

nis-research/afa-augment

Folders and files

Latest commit

History

Repository files navigation

Auxiliary Fourier Augmentation

Introduction

Schema

Contents

Experiment Setups

Running the Experiments

Requirements

Pretrained Models

Evaluations

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages