2021
CoAtNet
Zihang Dai, Hanxiao Liu, Quoc V. Le, Mingxing Tan
CoAtNets feature the unification of depthwise convolutions and self attention via relative attention in this hybrid network structure. Convolution layers and attention layers are vertically stacked in a principled way that proves to be effective for improving generalization, capacity, and efficiency.