语音识别 -- 概述

时间：2020-11-20 15:08:19 阅读：65 评论：0 收藏：0 [点我收藏+]

1. 语音合成
zhrtvc：https://github.com/KuangDD/zhrtvc

2.离线语音识别

mozilla deepspeech：https://github.com/mozilla/DeepSpeech

PaddlePaddle deepspeech:https://github.com/PaddlePaddle/DeepSpeech

deepspeech2：

技术分享图片

kaldi：https://github.com/kaldi-asr/kaldi

介绍：Kaldi是一个C++实现的语音识别工具，它使用Apache v2.0开源协议。其主要目标用户为语音识别的研究者，由Dan Povey博士和捷克的BUT大学联合开发。

优点：

缺点：

athena：https://github.com/didi/athena https://github.com/athena-team/athena

vosk api： https://github.com/alphacep/vosk-api
传统vs深度学习

技术分享图片

深度学习--> 端到端

技术分享图片

3. 相关中文数据集

thchs30：http://www.openslr.org/

技术分享图片

Aishell：http://www.aishelltech.com/kysjcp

　　Aishell开源178小时的中文语音语料及基本训练脚本， 400个人讲，其中训练集340个人，测试解20个人，验证集40个人

原文：https://www.cnblogs.com/Towerb/p/14009846.html

踩

(0)

评论一句话评论（0）

分享档案

更多>