Subscribe to SOOFTWARE

Stay up to date! Get all the latest & greatest posts delivered straight to your inbox

speech

A collection of 20 posts

Sooftware Speech - 한국어 Tacotron2 cover image
speech, 

Sooftware Speech - 한국어 Tacotron2

한국어 Tacotron2 이번 포스팅에서는 Tacotron2 아키텍처로 한국어 TTS 시스템을 만드는 방법에 대해 다루겠습니다. Tacotron2 Tacotron2는 17년 12월 구글이 NATURAL TTS SYNTHESIS BY…

Sooftware NLP - Textless NLP cover image
speech, nlp, paper, 

Sooftware NLP - Textless NLP

Textless NLP: Generating expressive speech from raw audio paper / code / pre-train model / blog Name: Generative Spoken Language Model (GSLM…

Sooftware Speech - Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition Paper Review cover image
speech, paper, 

Sooftware Speech - Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition Paper Review

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition Yu Zhang et al., 2020 Google Research, Brain Team Reference…

PORORO Text-To-Speech (TTS) cover image
speech, toolkit, record, 

PORORO Text-To-Speech (TTS)

PORORO Text-To-Speech (TTS) 얼마전에 저희 팀에서 공개한 PORORO: Platform Of neuRal mOdels for natuRal language prOcessing 라이브러리에 제가 공들여만든 TTS…

Sooftware Speech - EMNLP Paper Review: Speech cover image
speech, paper, 

Sooftware Speech - EMNLP Paper Review: Speech

EMNLP Paper Review: Speech Adaptive Feature Selection for End-to-End Speech Translation (Biao Zhang et al) Incremental Text-to-Speech…

Sooftware Speech - One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech Paper Review cover image
speech, tts, paper, 

Sooftware Speech - One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech Paper Review

One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech Tomáš Nekvinda, Ondřej Dušek Charles University INTERSPEECH, 202…

Sooftware Speech - Wav2vec 2.0 : A Framework for Self-Supervised Learning of Speech Representations cover image
speech, paper, 

Sooftware Speech - Wav2vec 2.0 : A Framework for Self-Supervised Learning of Speech Representations

wav2vec 2.0 : A Framework for Self-Supervised Learning of Speech Representations Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael…

Sooftware Speech - Conformer Paper Review cover image
speech, paper, 

Sooftware Speech - Conformer Paper Review

Conformer: Convolution-augmented Transformer for Speech Recognition Anmol Gulati et al. Google Inc. INTERSPEECH, 2020 Reference Conformer…

Sooftware Speech - AI & Speech Processing: Application-2 cover image
speech, 

Sooftware Speech - AI & Speech Processing: Application-2

AI & Speech Processing: Application-2 본 글은 광운대학교 전자공학과 박호종 교수님의 강의를 듣고 작성되었음을 밝힙니다. Speaker Verification and Identification Verification…

Sooftware Speech - AI & Speech Processing: Application-1 cover image
speech, 

Sooftware Speech - AI & Speech Processing: Application-1

AI & Speech Processing: Application-1 본 글은 광운대학교 전자공학과 박호종 교수님의 강의를 듣고 작성되었음을 밝힙니다. 음성/오디오/sound…

Sooftware Speech - AI & Speech Processing: DSP for Audio cover image
speech, dsp, 

Sooftware Speech - AI & Speech Processing: DSP for Audio

AI & Speech Signal Processing Lecture : DSP for Audio 본 글은 광운대학교 전자공학과 박호종 교수님의 강의를 듣고 작성되었음을 밝힙니다. 이제는 오디오에 특화된 DSP로 넘어가보자. Short-Time…

Sooftware Speech - AI & Speech Processing: DSP-2 cover image
speech, dsp, 

Sooftware Speech - AI & Speech Processing: DSP-2

AI & Speech Processing: DSP-2 본 글은 광운대학교 전자공학과 박호종 교수님의 강의를 듣고 작성되었음을 밝힙니다. DFT (Discrete Fourier Transform) Digital 처리를 위하여 time와 frequency…

Sooftware Speech - AI & Speech Processing: DSP-1 cover image
speech, dsp, 

Sooftware Speech - AI & Speech Processing: DSP-1

AI & Speech Processing: DSP-1 본 글은 광운대학교 전자공학과 박호종 교수님의 강의를 듣고 작성되었음을 밝힙니다. DSP Review Time-to-Frequency transform Continuous-Time Fourier…

Sooftware Speech - ClovaCall Paper Review cover image
speech, paper, 

Sooftware Speech - ClovaCall Paper Review

ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers image 논문링크 2020-04-2…

Sooftware Speech - STATE-OF-THE-ART SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODEL Paper Review cover image
speech, paper, 

Sooftware Speech - STATE-OF-THE-ART SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODEL Paper Review

「STATE-OF-THE-ART SPEECH RECOGNITION WITH SEQUENCE-TO-SEQUENCE MODEL」 Review title https://arxiv.org/abs/1712.0176…

Sooftware Speech - Attention-Based Models for Speech Recognition Paper Review cover image
speech, paper, 

Sooftware Speech - Attention-Based Models for Speech Recognition Paper Review

Attention-Based Models for Speech Recognition Paper Review title http://papers.nips.cc/paper/5847-attention-based-models-for-speech…

Sooftware Speech - SpecAugment Paper Review cover image
speech, paper, 

Sooftware Speech - SpecAugment Paper Review

SpecAugment: 「A Simple Data Augmentation Method for Automatic Speech Recognition」 Review title https://arxiv.org/abs/1904.08779 Abstract…

Sooftware Speech - DeepSpeech Paper Review cover image
speech, paper, 

Sooftware Speech - DeepSpeech Paper Review

Deep Speech: Scaling up end-to-end speech recognition title https://arxiv.org/pdf/1412.5567.pdf (Awni Hannun et al. 2014) Abstract…

Sooftware Speech - Listen, Attend and Spell Paper Review cover image
speech, paper, 

Sooftware Speech - Listen, Attend and Spell Paper Review

「Listen, Attend and Spell」 Review title https://arxiv.org/abs/1508.01211 (William Chan et al. 2015) Introduction 어텐션 기반 Seq2seq…

Sooftware Speech - MFCC (Mel-Frequency Cepstral Coefficient) cover image
dsp, speech, 

Sooftware Speech - MFCC (Mel-Frequency Cepstral Coefficient)

MFCC (Mel-Frequency Cepstral Coefficient) ‘Voice Recognition Using MFCC Algorithm’ 논문 참고 MFCC란? 음성인식에서 MFCC, Mel-Spectrogram…