论文笔记：Evolving Losses for Unsupervised Video Representation Learning

阅读量：

Evolving Losses for Unsupervised Video Representation Learning 论文笔记

Distillation

Distillate Knowledge from Teacher model Net-T to Student model Net-S.
在这里插入图片描述

目的：为了精简模型方便部署。

$L=\alpha L_{s o f t}+\beta L_{h a r d}$

$L_{s o f t}=-\sum_{j}^{N} p_{j}^{T} \log \left(q_{j}^{T}\right), \text { where } p_{l}^{T}=\frac{\exp \left(v_{i} / T\right)}{\sum_{k}^{N} \exp \left(v_{k} / T\right)}, q_{i}^{T}=\frac{\exp \left(z_{i} / T\right)}{\sum_{k}^{N} \exp \left(z_{k} / T\right)}$

$L_{h a r d}=-\sum_{j}^{N} c_{j} \log \left(q_{j}^{1}\right), \text { where } q_{i}^{1}=\frac{\exp \left(z_{i}\right)}{\sum_{j}^{N} \exp \left(z_{j}\right)}$

第一部分是从Teacher 模型中学习，第二部分是从ground truth 中学习

温度的高低改变的是Net-S训练过程中对负标签的关注程度: 温度较低时，对负标签的关注，尤其是那些显著低于平均值的负标签的关注较少；而温度较高时，负标签相关的值会相对增大，Net-S会相对多地关注到负标签。

Main idea: Multiple modalities to multiple tasks
在这里插入图片描述

Loss Function

$\mathcal{L}=\sum_{m} \sum_{t} \lambda_{m, t} \mathcal{L}_{m, t}+\sum_{d} \lambda_{d} \mathcal{L}_{d}$

where

$\lambda$ is weight

$\mathcal{L}_{m,t}$ is loss function of modality $m$ to task $t$

$\mathcal{L}_{d}$ is $L_2$ distance of a layer in the main network $M_i$ to another network $L_i$
$\mathcal{L}_{d}\left(L_{i}, M_{i}\right)=\left\|L_{i}-M_{i}\right\|_{2}$

Evolution Algorithm

Using GA to determine the $\lambda$

Each $λ_{m,t}$ or ${λ_d}$ is constrained to be in $[0,1]$

Unsupervised loss function

Zipf Distribution matching (ELo)

cluster centroids $\left\{c_{1}, c_{2}, \ldots c_{k}\right\} \text { where } c_{i} \in \mathcal{R}^{D}$

Naively assuming all clusters have the same variance, and let $2\sigma^2 = 1$

we can compute the probability of a feature vector $x \in R^D$ belonging to a cluster $c_i$ as
$p\left(x \mid c_{i}\right)=\frac{1}{\sqrt{2 \sigma^{2} \pi}} \exp \left(-\frac{\left(x-c_{i}\right)^{2}}{2 \sigma^{2}}\right)$
Bayes rules:
$\begin{aligned} p\left(c_{i} \mid x\right) &=\frac{p\left(c_{i}\right) p\left(x \mid c_{i}\right)}{\sum_{j}^{k} p\left(c_{j}\right) p\left(x \mid c_{j}\right)}=\frac{\exp -\frac{\left(x-c_{i}\right)^{2}}{2 \sigma^{2}}}{\sum_{j=1}^{k} \exp -\frac{\left(x-c_{j}\right)^{2}}{2 \sigma^{2}}} \\ &=\frac{\exp -\left(x-c_{i}\right)^{2}}{\sum_{j=1}^{k} \exp -\left(x-c_{j}\right)^{2}} \end{aligned}$
which is standard softmax function

given the above probability of each video belonging to each cluster, and the Zipf distribution, we compute the prior probability of each class as $q\left(c_{i}\right)=\frac{1 / i^{s}}{H_{k, s}}$ where $H$ is $k_{th}$ harmonic number and $s$ is real constant.

$p\left(c_{i}\right)=\frac{1}{N} \sum_{x \in V} p\left(c_{i} \mid x\right)$ , the average over all videos in the set.

KL divergence :
$K L(p \| q)=\sum_{i=1}^{k} p\left(c_{i}\right) \log \left(\frac{p\left(c_{i}\right)}{q\left(c_{i}\right)}\right)$
This will be our fitness function.

it poses a prior constraint over the distribution of (learned) video representations in clusters to follow the Zipf distribution.

Loss Evolution

tournament selection and CMA-ES.

全部评论 (0)

还没有任何评论哟~

论文笔记：Evolving Losses for Unsupervised Video Representation Learning

EvolvingLossesforUnsupervisedVideoRepresentationLearning论文笔记 Distillation KnowledgeDistillationfrom:...

【论文笔记】【CVPR2020】 (MoCo) Momentum Contrast for Unsupervised Visual Representation Learning

KaimingHe,HaoqiFan,YuxinWu,SainingXie,RossGirshick CVPR2020BestPaper Code:<https://github.com/facebo...

AI论文精读笔记-Momentum Contrast for Unsupervised Visual Representation Learning(MoCo)

1\.论文基本信息论文标题：MomentumContrastforUnsupervisedVisualRepresentationLearning 作者：KaimingHeHaoqiFanYuxin...

VideoBERT: A Joint Model for Video and Language Representation Learning（论文笔记）

文章目录摘要简介模型模型训练实验与分析数据集数据预处理模型预训练零样本动作分类大数据集的好处对于视频加注释任务的迁移学习讨论原文链接： <https://openaccess...

论文笔记-Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Cloud

Hello,今天是论文阅读计划的第20天啦～今天要介绍的论文是关于3D点云的，也是我看的第一篇关于3D点云的文章。之前有听过这个名词，但是一直也没有去了解过，今天就趁着这篇文章好好的了解一下啦。

论文笔记：unsupervised representation learning with deep convolutional generative adversarial networks

1\.previouswork[generativeadversarialnets] paperlink:http://arxiv.org/pdf/1406.2661v1.pdf torchimple...

Discriminative Feature Learning for Unsupervised Video Summarization（论文翻译）

DiscriminativeFeatureLearningforUnsupervisedVideoSummarization Abstract 在本文中，我们解决了无监督视频摘要的问题，该问题会自动从...

【论文笔记】HARP: Hierarchical Representation Learning for Networks

目录 Abstract Introduction 问题定义 Method 源码【code】 [paper]<https://arxiv.org/pdf/1706.07845v2.pdf <https:...

Deep Reinforcement Learning for Unsupervised Video Summarization阅读笔记

DeepReinforcementLearningforUnsupervisedVideoSummarizationwithDiversityRepresentativenessReward论文阅读笔...

《Unsupervised Learning of Depth and Ego-Motion from Video》论文笔记

UnsupervisedLearningofDepthandEgoMotionfromVideo 作者：TinghuiZhou, 项目主页摘要作者针对于无结构化的单目视频序列的图像深度获取提出了一...

是否确定退出登录?

论文笔记：Evolving Losses for Unsupervised Video Representation Learning

Evolving Losses for Unsupervised Video Representation Learning 论文笔记

Distillation

Loss Function

Evolution Algorithm

Unsupervised loss function

Zipf Distribution matching (ELo)

Loss Evolution

全部评论 (0)

相关文章推荐

论文笔记：Evolving Losses for Unsupervised Video Representation Learning

【论文笔记】【CVPR2020】 (MoCo) Momentum Contrast for Unsupervised Visual Representation Learning

AI论文精读笔记-Momentum Contrast for Unsupervised Visual Representation Learning(MoCo)

VideoBERT: A Joint Model for Video and Language Representation Learning（论文笔记）

论文笔记-Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Cloud

论文笔记：unsupervised representation learning with deep convolutional generative adversarial networks

Discriminative Feature Learning for Unsupervised Video Summarization（论文翻译）

【论文笔记】HARP: Hierarchical Representation Learning for Networks

Deep Reinforcement Learning for Unsupervised Video Summarization阅读笔记

《Unsupervised Learning of Depth and Ego-Motion from Video》论文笔记