Fig. 6From: Object detection using convolutional neural networks and transformer-based models: a reviewThe detailed architecture of vision transformer [48]Back to article page