Optimized Vision Transformer Training using GPU and Multi-threading

This is the official repository for the paper titled, "Optimized Vision Transformer Training using GPU and Multi-threading," published at the IEEE Conference on Artificial Intelligence 2024 (IEEE CAI 2024). This repository contains an optimized implementation of Convolutional Neural Networks (CNN), Transformer, and Vision Transformer (ViT) models.

Authors

Jonathan Ledet (@jonledet)
Ashok Kumar
Dominick Rizk
Rodrigue Rizk
KC Santosh

Overview

This project focuses on optimizing Vision Transformer training using GPU acceleration and multi-threading techniques. It provides implementations of popular deep learning models, including Convolutional Neural Networks (CNN), Transformer, and a customized version of Vision Transformer (ViT) tailored for improved performance.

Getting Started

Prerequisites

Python (>=3.6)
Anaconda 3
PyTorch
CUDA-enabled GPU (for GPU acceleration)

Installation

Clone this repository:

git clone https://github.com/jonledet/vision-transformer.git

Create and activate a new Anaconda environment:

conda create --name your-env-name python=3.6
conda activate your-env-name

Install dependencies:
```
pip install -r requirements.txt
```

Usage

To run the models, execute the corresponding Python script:

python cnn.py

python transformer.py

python vit.py

Acknowledgments

The Vision Transformer (ViT) model is based on the work from the vision-transformers-cifar10 repository.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.idea		.idea
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
cnn.py		cnn.py
requirements.txt		requirements.txt
transformer.py		transformer.py
vit.py		vit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimized Vision Transformer Training using GPU and Multi-threading

Authors

Overview

Contents

Getting Started

Prerequisites

Installation

Usage

Acknowledgments

License

About

Releases

Packages

Languages

License

jonledet/vision-transformer

Folders and files

Latest commit

History

Repository files navigation

Optimized Vision Transformer Training using GPU and Multi-threading

Authors

Overview

Contents

Getting Started

Prerequisites

Installation

Usage

Acknowledgments

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages