[深度学习论文笔记][Attention] Spatial Transformer Networks

阅读量：

Jaderberg, Max, Karen Simonyan, and Andrew Zisserman. "Spatial transformer networks." Advances in Neural Information Processing Systems. 2015. (Citations: 116).

1 Motivation

SAT restricts attention to a static grid structure. Our aim is for the model to focus on any part of the image within the grid.

The pooling operation is capable of making a network somewhat spatially invariance concerning the position of features. Despite the fact that max-pooling typically has a small spatial support, this characteristic is retained.

spatial invariance only exists through a deeply structured hierarchy of max-pooling layers and convolutional operations, while the intermediate feature maps within a CNN do not exhibit true invariance under significant input transformations.

It aims to introduce a spatial transformer module that intelligently selects key features (attention) and applies scaling; cropping; rotation; non-rigid transformations to them.

deformations.

2 空间变换器
我们需要一个可微分模块，在单次前向传递中对特征图施加空间变换。对于每个像素坐标(x_s, y_s)，我们通过计算相应的输出坐标(x_t, y_t)来实现这一过程。

We normalize coordinate points x_s and y_s into the interval [-1, 1]. This transformation permits the execution of cropping operations, translation with offsets s_x and s_y, rotation operations by θ degrees (clockwise), scaling by a factor of α relative to a reference point (x_0,y_0), and applying shear transformations along both horizontal and vertical axes.

Each channel undergoes the same warping when dealing with multi-channel inputs. By processing every pixel in the output, we establish a sampling grid. This allows us to compute the final output efficiently using bilinear interpolation.

3 Architecture
See Fig. One can also use multiple spatial transformers in parallel — this can be useful if there are multiple objects or parts of interest in a feature map that should be
focussed on individually. A limitation of this architecture in a purely feed-forward network is that the number of parallel spatial transformers limits the number of objects that the
network can model.

4 Training Details
For training, we initialize

This allows the output to be the same as input.

5 Results, as shown in Figure. The insertion of spatial transformers into a classification network enables the system to learn how to attend and transform the input.

6 References
参考文献[1]可通过访问该视频获得：https://www.youtube.com/watch?v=Ywv0Xi2-14Y.
参考文献[2]可通过访问此视频获取：https://www.youtube.com/watch?v=T5k0GnBmZVI.

全部评论 (0)

还没有任何评论哟~

[深度学习论文笔记][Attention] Spatial Transformer Networks

Jaderberg,Max,KarenSimonyan,andAndrewZisserman.“Spatialtransformernetworks.”AdvancesinNeuralInformat...

[深度学习论文笔记][CVPR 17 oral] Inverse Compositional Spatial Transformer Networks

[CVPR17oral]InverseCompositionalSpatialTransformerNetworks ChenHsuanLinandSimonLucey fromCMU paperli...

Spatial Transformer Networks 论文笔记

SpatialTransformerNetworks论文笔记简介 SpatialTransformerNetworks和BN一样相当于一个小插件，放在卷积网络中，其主要目的是对齐网络的每个输入。

【论文学习】STN —— Spatial Transformer Networks

Paper：SpatialTransformerNetworks 这是Google旗下DeepMind大作，最近学习人脸识别，这篇paper提出的STN网络可以代替align的操作，端到端的训练实现图...

【阅读论文笔记】《3D Morphable Models as Spatial Transformer Networks》

《3DMorphableModelsasSpatialTransformerNetworks》 BasA,HuberP,SmithWAP,etal.3DMorphableModelsasSpatial...

Spatial Transformer Networks 论文解读

papertitle:SpatialTransformerNetworks paperlink:https://arxiv.org/pdf/1506.02025.pdf oralordemovideo...

论文阅读《Spatial Transformer Networks》

Reference 原文源码:torch平台实例：交通路标识别参考博客 whatisSTN? moduleinsertedtoCNNwithoutanyextratrainingfeaturem...

论文笔记《Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting》

这里写目录标题 1\.Abstract 2\.Introduction 3\.Preliminaries 3.1交通网络 3.2交通流预测 4\.ASTGCN 4.1整体框架 4.2SpatialTe...

[论文理解]Spatial Transformer Networks（STN）

0写在前面在对原文进行了翻译，以及参考了别人的一些博客后，lz打算提炼一下自己对STN的理解，后续有更深入的认识后会不断地增加内容。运用pytorch实现的STN代码可以点这里查看。

【论文笔记】ST-TR：Skeleton-based Action Recognition via Spatial and Temporal Transformer Networks

SkeletonbasedActionRecognitionviaSpatialandTemporalTransformerNetworks 基于骨骼通过时空变换网络的行为识别未解决的问题：有效编码...

是否确定退出登录?

[深度学习论文笔记][Attention] Spatial Transformer Networks

全部评论 (0)

相关文章推荐

[深度学习论文笔记][Attention] Spatial Transformer Networks

[深度学习论文笔记][CVPR 17 oral] Inverse Compositional Spatial Transformer Networks

Spatial Transformer Networks 论文笔记

【论文学习】STN —— Spatial Transformer Networks

【阅读论文笔记】《3D Morphable Models as Spatial Transformer Networks》

Spatial Transformer Networks 论文解读

论文阅读《Spatial Transformer Networks》

论文笔记《Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting》

[论文理解]Spatial Transformer Networks（STN）

【论文笔记】ST-TR：Skeleton-based Action Recognition via Spatial and Temporal Transformer Networks