网站介绍
SEW-D-tiny
SEW-D by ASAPP Research
The base model pretrained on 16kHz sampled speech audio. When using the model make sure that your speech input is also sampled at 16Khz. Note that this model should be fine-tuned on a downstream task, like Automatic Speech Recognition, Speaker Identification, Intent Classification, Emotion Recognition, etc…
Paper: Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Authors: Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Han, Kilian Q. Weinberger, Yoav Artzi
Abstract
This paper is a study of performance-efficiency trade-offs in pre-trained models for automatic speech recognition (ASR). We focus on wav2vec 2.0, and formalize several architecture designs that influence both the model performance and its efficiency. Putting together all our observations, we introduce SEW (Squeezed and Efficient Wav2vec), a pre-trained model architecture with significant improvements along both performance and efficiency dimensions across a variety of training setups. For example, under the 100h-960h semi-supervised setup on LibriSpeech, SEW achieves a 1.9x inference speedup compared to wav2vec 2.0, with a 13.5% relative reduction in word error rate. With a similar inference time, SEW reduces word error rate by 25-50% across different model sizes.
The original model can be found under https://github.com/asappresearch/sew#model-checkpoints .
Usage
See this blog for more information on how to fine-tune the model. Note that the class Wav2Vec2ForCTC
has to be replaced by SEWDForCTC
.
本站Ai工具导航提供的“asapp/sew-d-tiny-100k”来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由“Ai工具导航”实际控制,在“2025-10-05 21:03:38”收录时,该网页上的内容,都属于合规合法,后期网页的内容如出现违规,可以直接联系网站管理员进行删除,“Ai工具导航”不承担任何责任。
流量统计
- 7天
- 30天
- 90天
- 365天
猜你喜欢
Iconoir
940+ 高质量开源图标库High quality open source icon library...Boxicons
高质量 1,400+ 开源图标库High quality 1400 icons 43; open sou...谷歌字体
近千款免费字体库/WebFontNearly a thousand free font libraries / WebF...方正字库
免费和商业字体库Free and commercial font library...造字工坊
高质量原创字体设计高质量原创字体设计...iFontCloud 文鼎雲字庫
台湾著名文鼎字体The famous Wending font in Taiwan...Fonts In Use
设计中用到的字体(可搜公司)设计中用到的字体(可搜公司)...Kukla Kit
3D 风格可组合人物物件插画3D style can combine characters and objects fo...AVATARZ
3D 风格人物插画3D style figure illustration...Handz
3D 风格手型插画3D style hand illustration...Getillustrations
插画素材包可在线调色The illustration material package can adjust color...Doodle Ipsum
免费可组合插画占位图Free and combinable illustrator placeholder...
- 关注我们
-
扫一扫二维码关注我们的微信公众号
- 网址推荐
- 热门标签
-
- 游戏(4562)
- 街机游戏合集(4329)
- 街机游戏(4329)
- 在线游戏集合(4329)
- 小霸王游戏(4329)
- 街机在线(4329)
- nes合集游戏(4328)
- 在线小游戏网站(4328)
- 游戏榜(4328)
- 红白机游戏盒(4328)
- GBA(1796)
- 街机(555)
- 动作冒险(400)
- 青檬花园(374)
- 角色扮演(354)
- 小游戏(346)
- 动作(341)
- 汉化(332)
- SFC(328)
- 运动比赛(321)
- 深度导航(309)
- 免费(294)
- 射击(292)
- AIGC导航(277)
- 创意(265)
- 国内精选服务商(255)
- 中文(247)
- 冒险(240)
- 工具达人(239)
- AI写作工具(232)
- 探索发现(221)
- 有趣网站(220)
- 平台(219)
- 摸鱼网站(219)
- 网络创意(219)
- 脑洞网站(219)
- 格斗(212)
- 人工智能(199)
- 视频(198)
- 翻译(187)
- 动漫(161)
- 的(153)
- Video(152)
- 数字人(151)
- 数据分析(145)
- 在线工具(139)
- ppt(138)
- 文生图(134)
- logo(134)
- 网页游戏(130)