Image Classification#

Imagenet-1K#

Note

For Vgg and Googlenet, there's a big gap in performance of pre-trained networks. The difference arises after the adaptive-pooling, which implies the networks can still be used as feature extractors (see results here).
For RegNets, the pretrained weights correspond to torchvision's IMAGENET1K_V2.
Swin_v2 pretrained is not supported.
ViT only supports DINO pretrained weights.