InstructSAM is a training-free framework for Instruction-Oriented Object Counting, Detection, and Segmentation (InstructCDS). We construct EarthInstruct, an InstructCDS benchmark for remote sensing.
DMLViT: Dynamic Multi-Scale Local Vision Transformer for Object Counting in Congested Traffic Scenes
Abstract: Object counting in congested traffic scenes is an important component of traffic perception, facilitating urban traffic management and public transportation capacity optimization. Vision ...
Abstract: This study presents TET-Count, a novel category-agnostic model for object counting from natural language prompts, addressing limitations in existing methods requiring extensive annotated ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results