|
+ 1 = + 1 = 55. Since the kernel output is the same length as width, its area is 55×55.)]]<noinclude> |
|
] and ] convolution, pooling, and dense layers<br>(AlexNet image size should be 227×227×3, instead of 224×224×3, so the math will come out right. The original paper said different numbers, but Andrej Karpathy, the former head of computer vision at Tesla, said it should be 227×227×3 (he said Alex didn't describe why he put 224×224×3). The next convolution should be 11×11 with stride 4: 55×55×96 (instead of 54×54×96). It would be calculated, for example, as: + 1 = + 1 = 55. Since the kernel output is the same length as width, its area is 55×55.)]]<noinclude> |