Page 3 of 4 for Jisoo’s Blog

[개념정리] 사물 인터넷이란? 4차 산업혁명에 사용되는 기술들은?

Jun 29, 2023 •
본 포스트에서는 4차 산업 혁명에 사용되는 기술들에 관하여 서술함. </br>
[오류해결] Pytorch, Cuda설치 및 Geometric과 씨름하기

Jun 24, 2023 •
기본 설정은 nvidia drivers는 설치된 상태로 간주함. Docker기반이긴 하지만 conda 환경이면 무리없이 적용될 것. (참고)geometric 배포자들은 local환경보다 가상환경을 추천하고 있음. 귀찮은 사람들은 그냥 python=3.7.16 , cuda 11.7으로 세팅해놓고 아래 배포해놓은 requirements.txt 실행 ㄱ docker hub에 공개되어 있는 cuda11.7, ubuntu 20.04인 이미지를 다운받음
[논문리뷰] A Unified System for Voice Cloning and Voice Conversion through Diffusion Probabilistic Modeling

Jun 14, 2023 •
출처 Sadekova, Tasnima, et al. “A Unified System for Voice Cloning and Voice Conversion through Diffusion Probabilistic Modeling}}.” Proc. Interspeech 2022 (2022): 3003-3007.
[논문리뷰] Squeezeformer: An efficient transformer for automatic speech recognition.

Jun 12, 2023 •
출처 Kim, Sehoon, et al. “Squeezeformer: An efficient transformer for automatic speech recognition.” arXiv preprint arXiv:2206.00888 (2022).
[논문리뷰] Robust disentangled variational speech representation learning for zero-shot voice conversion

Jun 12, 2023 •
Lian, Jiachen, Chunlei Zhang, and Dong Yu. “Robust disentangled variational speech representation learning for zero-shot voice conversion.” ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. Copyright of figures and other materials in the paper belongs to original authors.
[논문리뷰] Nvc-net: End-to-end adversarial voice conversion

Jun 12, 2023 •
출처 Nguyen, Bac, and Fabien Cardinaux. “Nvc-net: End-to-end adversarial voice conversion.” ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022.APA
[논문리뷰] FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS.

Jun 12, 2023 •
출처 Kim, Changhwan, et al. “FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS.” Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Vol. 2022. 2022.
[논문리뷰] Adversarial Multi-Task Learning for Disentangling Timbre and Pitch in Singing Voice Synthesis

Jun 12, 2023 •
출처
[논문리뷰] Contrastive Siamese Network for Semi-Supervised Speech Recognition

Jun 12, 2023 •
출처 Khorram, Soheil, et al. “Contrastive Siamese Network for Semi-Supervised Speech Recognition.” ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. Copyright of figures and other materials in the paper belongs to original authors.
[논문 및 코드 리뷰] Conformer-based hybrid ASR system for Switchboard dataset

Jun 4, 2023 •
1 논문 리뷰