Abstract: Dense video captioning requires localization and description of multiple events in long videos. Prior works detect events in videos solely relying on the visual content and completely ignore ...
Abstract: Temporal action localization aims at detecting the temporal intervals of human actions in untrimmed videos. Most previous methods rely on locating and matching the start and end times of ...
D2-Net: Sanath Narayan, Hisham Cholakkal, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao. "D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results