1 论文解读
paper:VIT
1.1 VIT模型架构如下图所示:
图片原始输入维度 H * W * C在H和W按像素P切分,则H 、W可分割为 NPP, NHW/(PP),N为输入transform序列的长度。 x ∈ R H ∗ W ∗ C > x ∈ R N ∗ P 2 ∗ C x \in R^{H*W…
eReaders are best if all you want to do is have something as close to a paper book as possible. However, if you need anything more than that, a tablet makes more sense as a general-purpose device that can also read ebooks. 如果您只想拥有尽可能接近纸质书的东…