The goal of this repository is to explore and implement cutting-edge models and papers from scratch. This allows me to understand the nuances of each model, while also giving others an opportunity to follow along and learn. Each implementation includes detailed comments, explanations, and test scripts to demonstrate the models' performance.
Here’s a list of the models and papers implemented so far:
- Vision Transformer (ViT)
For all of these experiments I have trained / tested the models using Nvidia RTX 4090 GPU. The logs of the training and/or testing are omitted from this repository.
To run any of the implemented models, follow the instructions in each model's respective folder. Here's a basic workflow:
- Clone the repository:
git clone https://github.com/your-username/your-repo-name.git cd your-repo-name
- You will find the main.py which you can run directly after seting up your conda/venv environment