This project is a PyTorch implementation of the paper "ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-scale Stages". ECViT is a hybrid architecture that effectively ...