A complete deep learning-based action recognition system with ResNet101 encoder, LSTM decoder with spatial attention, FastAPI backend, and web-based frontend. The system recognizes 40 different human ...