Datasets

What Happens in Crowd Scenes: A New Dataset about Crowd Scenes for Image Captioning

CrowdCaption is a new challenging image captioning dataset for complex real-world crowd scene understanding, which towards to describe crowd scene.

RefCrowd: Grounding the Target in Crowd with Referring Expressions

RefCrowd is a new challenging referring comprehension dataset for complex real-world crowd grounding, which towards looking for the target person in crowd with referring expressions.