Tested with Python 3.10.13, PyTorch 2.5.1, CUDA 12.6. Dependencies are captured in two files at the repo root: environment.yml (conda — canonical) and requirements.txt (pip-only pins).
HOI-DETR is a transformer-based framework for detecting hands, hand-held objects, and their interactions in images and video. Built on the Co-DETR architecture, it adds a lightweight interaction ...