We propose MaskCut approach to generate pseudo-masks for multiple objects in an image. CutLER can learn unsupervised object detectors and instance segmentors solely on ImageNet-1K. CutLER exhibits ...
We propose an encoder-decoder for open-vocabulary semantic segmentation comprising a hierarchical encoder-based cost map generation and a gradual fusion decoder. We introduce a category early ...