GroupViT is a framework for learning semantic segmentation purely from text captions without using any mask supervision. It learns to perform bottom-up heirarchical spatial grouping of ...
Since Github doesn't provide a great way for you to learn about new releases and features, don't just star the repo, join the mailing list. dsq will likely work on other platforms that Go is ported to ...