This project demonstrates the use of Vision Transformer (ViT) for real-time hand gesture recognition using a webcam. The model is trained to recognize hand gestures representing digits (1-10) from the ...