Use an Attention model for Fashion image Category Classification

To classify fashion image's categories without their attributes, can we use any attention mechanisms along with CNN(vgg16)? if so what kind of attentions can be used ?. or attention mechanisms can be used only for multi label classification ? can anyone please help me to get an idea about this :(

