AIAS/3_audio_sdks/README.md
2024-10-20 16:27:54 +08:00

72 lines
1.9 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

#### 项目清单:
- 3_audio_sdks - [语音处理 SDK]
```text
1). 工具箱系列音素工具箱librosajava soundjavacv ffmpeg, fft, vad工具箱等。
2). 声音克隆
3). 语音合成
4). 声纹识别
5). 语音识别
...
```
<div align="center">
<table>
<tr>
<td>
<div align="left">
<p>语音识别ASR【短语音】 - asr_whisper_sdk</p>
中文语音识别。
</div>
</td>
<td>
<div align="center">
<img src="https://aias-home.oss-cn-beijing.aliyuncs.com/AIAS/voice_sdks/asr.jpeg" width = "400px"/>
</div>
</td>
</tr>
<tr>
<td>
<div align="left">
<p>语音识别ASR【长语音】 - asr_whisper_long_sdk</p>
中文语音识别。
</div>
</td>
<td>
<div align="center">
<img src="https://aias-home.oss-cn-beijing.aliyuncs.com/AIAS/voice_sdks/asr.jpeg" width = "400px"/>
</div>
</td>
</tr>
<tr>
<td style="width:220px">
<div align="left">
<p>语音处理包Librosa- librosa_sdk</p>
python语音处理库librosa的java实现。
</div>
</td>
<td>
<div align="center">
<img src="https://aias-home.oss-cn-beijing.aliyuncs.com/AIAS/voice_sdks/phoneme.jpeg" width = "400px"/>
</div>
</td>
</tr>
<tr>
<td style="width:220px">
<div align="left">
<p>TTS 文本转为语音 - tts_sdk</p>
TTS 文本转为语音。
</div>
</td>
<td>
<div align="center">
<img src="https://aias-home.oss-cn-beijing.aliyuncs.com/AIAS/voice_sdks/SV2TTS.png" width = "400px"/>
</div>
</td>
</tr>
</table>
</div>