Python Byte Pair Encoding

A prompt tuning method for few-shot action recognition

Abstract: Vision-language pre-training models learn visual concepts from image-text or video-text pairs, which can be adopted for visual-textual tasks. In this paper, we adopt these concepts as prior ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

A prompt tuning method for few-shot action recognition

Trending now