SAM-Translator
SAM-Translator is a proposed system designed to convert text from a source language directly into speech in a target language.
This demo shows samples from the CVSS and CoVoST2 datasets. You can explore the translations by selecting different languages from the dropdown.
- Speech Translation: Speech is synthesized by our proposed SAM-Translator, and the text is transcribed using a pretrained ASR (Whisper) model.
- Speech Ground Truth: Speech and text are taken from the ground-truth dataset for reference.
CVSS Dataset
Loading CVSS data...
| Index |
Input Text |
Speech Translation (SAM-Translator 10k) |
Speech Ground Truth |
CoVoST2 Dataset
Loading CoVoST2 data...
| Index |
Input Text |
Speech Translation (SAM-Translator) |
Speech Ground Truth |