RWTH-PHOENIX Weather数据集模型说明和下载

阅读量：

RWTH-PHOENIX Weather 2014 T数据集说明：

德国公共电视台PHONIX在期间（2009至2011年）制作了配以lip reading 或者 hand signing 的每日新闻与气象 forecasts，并通过注释标记法完成了386份不同版本的气象 forecast报道。

此外,我们采用自动化语音识别技术与人工处理流程的结合使用来实现原始德语语音的转录工作.从而支持构建一个从手语视频输入到口语输出的手势翻译系统.

本文将简要说明这个库的用法和提供快速下载地址链接。

目录结构和说明：

RWTH-PHOENIX Weather 2014 T数据集的相关文件目录结构如下：

研究中使用它时，请遵守以下指导方针：Necati Cihan Camgöz等人的著作《Neural Sign Language Translation》发表于IEEE计算机视觉与模式识别会议（2018年盐湖城会议）。

手语翻译的过程由配备固定彩色摄像机的手语翻译员进行拍摄；口译员穿着深色衣物，在模拟背景中进行工作；所有拍摄的视频均采用每秒二十五帧的速度拍摄，并且每个帧的尺寸均为210像素乘以260像素；每一帧画面仅包含解码器窗口。

数据集获取位置： https://www-i6.informatik.rwth-aachen.de/ftp/pub/rwth-phoenix/2016/phoenix-2014-T.v3.tar.gz

下面我将分享几个论文都引用了该数据集，大家可以参考下：

Sign Language Transformers: Integrated End-to-end Sign Language Understanding and Conversion

Existing studies on Sign Language Translation have demonstrated that incorporating a mid-level sign gloss representation significantly enhances translation performance. Notably, the current state-of-the-art approaches necessitate gloss-level tokenization for functionality. To address this challenge, we introduce a novel architecture based on transformer models that simultaneously learns Continuous Sign Language Recognition and Translation. This is accomplished through the application of Connectionist Temporal Classification (CTC) loss, which unifies both recognition and translation tasks into a single framework. Our approach does not require ground-truth timing information, effectively solving two interdependent sequence-to-sequence learning problems in an end-to-end manner. As a result, we achieve significant performance improvements. We evaluate our system's recognition and translation capabilities on the RWTH-PHOENIX-Weather-2014T dataset. Our results demonstrate state-of-the-art performance in both sign language recognition and translation tasks. Specifically, our translation networks outperform existing methods by achieving scores more than double those of comparable systems (9.58 compared to 21.80 BLEU-4 Score). Additionally, we establish new baseline results for various text-to-text sign language translation tasks using transformer-based approaches.

论文地址：https://arxiv.org/pdf/2003.13830v1.pdf

Gloss Attention for Gloss-free Sign Language Translation

Currently, most sign language translation (SLT) approaches rely on gloss annotations to supplement supervision information, but obtaining these gloss annotations remains a challenge. To address this issue, we first analyze existing models to investigate how gloss annotations can simplify SLT processes. Our findings reveal that this mechanism provides two key benefits for the model: 1) it assists in implicitly identifying semantic boundaries within continuous sign language videos and 2) it enables the model to grasp the global context of sign language videos. To further enhance this capability, we introduce \emph{gloss attention}, which allows the model to maintain focus within segments of videos that share semantic similarities while still benefiting from gloss-based information as done by existing models. Additionally, we adapt insights from sentence-level similarity in natural language processing into our gloss attention-based SLT framework (GASLT), facilitating sentence-level understanding of sign language videos. Experimental results across multiple large-scale datasets demonstrate that our proposed GASLT model makes significant strides in performance compared to current techniques, with code available at \url{https://github.com/YinAoXiong/GASLT}.

论文地址：https://arxiv.org/pdf/2307.07361v1.pdf

Temporal Lift Pooling for Continuous Sign Language Recognition

In neural networks, pooling methods have become essential techniques for expanding feature representations and reducing computational complexity.

论文地址：https://arxiv.org/pdf/2207.08734v1.pdf

全部评论 (0)

还没有任何评论哟~

RWTH-PHOENIX Weather数据集模型说明和下载

RWTHPHOENIXWeather2014T数据集说明：德国公共电视台PHOENIX在三年内（2009年至2011年）录制了配有手语翻译的每日新闻和天气预报节目，并使用注释符号转录了386个版本的...

下载imagenet2012数据集，以及label说明

updated@2018120715:22:08 官方下载地址：<http://www.imagenet.org/challenges/LSVRC/2012/nonpubdownloads，需要“非....

大模型开源数据集合整理和说明

大模型开源数据集合整理预训练数据集指令微调数据集预训练数据集数据集合语言大小说明 wikipeida中文0.5G知识类高质量语料 LinlyAI/Chinesepretrainingdatas...

肺部CT分割挑战2017数据集下载和说明

文章目录 1\.下载 1.1正常情况 1.2关联内容 2LCTSC（LungCTSegmentationChallenge2017）数据集说明 2.1.简述 2.2.数据描述 2.2.1.详细描述 2...

Ubantu20.04在huggingface下载模型和数据集

它提供了丰富的预训练模型库，涵盖了从文本到图像、语音、视频等多种类型的模型，帮助开发者和研究人员快速构建和部署机器学习应用。除了模型库，Hugging Face 还提供了强大的...需要输入自己的hu...

phoenix 使用说明

phoenix使用说明 2018061420180619 ApachePhoenix是构建在HBase之上的关系型数据库层，作为内嵌的客户端JDBC驱动用以对HBase中的数据进行低延迟访问。

下载HuggingFace模型和数据集（免翻墙）

暂无描述

高程数据下载——DLR_SRTM_说明

翻译DLR文档 DLRSRTMXSARDigitalElevationModels TheGermanAerospaceCenterDLRismakingtheSRTMXSARdigitaleleva...

Cesium 和 webgl 加载各类型模型说明

模型格式说明 1. Fbx:支持动画，容易出现材质丢失 2. obj:不支持动画数据存储，只用于静态模型。 3. gltf: 1.glTFGLTransmissionFormat，即图形语言交换格式，...

Oracle数据库下载安装和卸载简单说明

安装oracle11g服务端 64位WIN10+oracle11g+plsql安装 1、下载Oracle11gR2forWindows的版本下载地址：https://www.oracle.com/t...

是否确定退出登录?

RWTH-PHOENIX Weather数据集模型说明和下载

RWTH-PHOENIX Weather 2014 T数据集说明：

目录结构和说明：

Gloss Attention for Gloss-free Sign Language Translation

Temporal Lift Pooling for Continuous Sign Language Recognition

全部评论 (0)

相关文章推荐

RWTH-PHOENIX Weather数据集模型说明和下载

下载imagenet2012数据集，以及label说明

大模型开源数据集合整理和说明

肺部CT分割挑战2017数据集下载和说明

Ubantu20.04在huggingface下载模型和数据集

phoenix 使用说明

下载HuggingFace模型和数据集（免翻墙）

高程数据下载——DLR_SRTM_说明

Cesium 和 webgl 加载各类型模型说明

Oracle数据库下载安装和卸载简单说明