egocentric video Video Understanding Long-Form Video Understanding multimodal AI few-shot learning
Long-Form Video Understanding Temporal Action Detection Computer Vision