Pytorch based library to rank predicted bounding boxes using text/image user’s prompts
Pytorch based library to rank predicted bounding boxes using text/image user’s prompts. Usually, object detection models trains to detect common classes of objects such as “car”, “person”, “cup”, “bottle”. But sometimes we need to detect more complex classes such as “lady in the red dress”, “bottle of whiskey”, or “where is my red cup” instead of “person”, “bottle”, “cup” respectively. One way to solve this problem is to train more complex detectors that can detect more complex classes, but we […]
Read more