Discussion about this post

User's avatar
Andrew Walsh's avatar

I am amazed this works with such a small set. One way of reducing the tedium of manual annotation is to use a better object detection algorithm to do it for you. Grounding DINO is one of these. You can get this on hugging face spaces eg. https://huggingface.co/spaces/merve/Grounding_DINO_demo

It is slow to use on hugging face but can be downloaded and used on your PC with GPU or on runpod/vast.ai/COLAB.

I just tried it with "chess piece" and it did a pretty good job. It is not smart enough to know what a black king is though. So you would need to manually change annotation class for Ronan's sample chess piece demo. This is an example of a zero shot detector so it can work with objects outside of those in standard classes like COCO. With IDE's like cursor or (my fave) cline you can also generate python code to quickly cycle through a dataset and change/add/delete bounding boxes and class names without too much fuss. Grounding DINO is by no means perfect so it will make errors but can dramatically reduce the amount of manual annotation needed.

Expand full comment
2 more comments...

No posts