Jisoo's Blog
  • About
  • Years
  • Categories
  • Tags
  • More
    FAQ Docs
강지수 / Jisoo Kang
  • 강지수 / Jisoo Kang
  • ji_soo_o@korea.ac.kr
  • korea
  • [개념정리] 사물 인터넷이란? 4차 산업혁명에 사용되는 기술들은?

    Jun 29, 2023 •

    본 포스트에서는 4차 산업 혁명에 사용되는 기술들에 관하여 서술함. </br>
  • [오류해결] Pytorch, Cuda설치 및 Geometric과 씨름하기

    Jun 24, 2023 •

    기본 설정은 nvidia drivers는 설치된 상태로 간주함. Docker기반이긴 하지만 conda 환경이면 무리없이 적용될 것. (참고)geometric 배포자들은 local환경보다 가상환경을 추천하고 있음. 귀찮은 사람들은 그냥 python=3.7.16 , cuda 11.7으로 세팅해놓고 아래 배포해놓은 requirements.txt 실행 ㄱ docker hub에 공개되어 있는 cuda11.7, ubuntu 20.04인 이미지를 다운받음
  • [논문리뷰] A Unified System for Voice Cloning and Voice Conversion through Diffusion Probabilistic Modeling

    Jun 14, 2023 •

    출처 Sadekova, Tasnima, et al. “A Unified System for Voice Cloning and Voice Conversion through Diffusion Probabilistic Modeling}}.” Proc. Interspeech 2022 (2022): 3003-3007.
  • [논문리뷰] Squeezeformer: An efficient transformer for automatic speech recognition.

    Jun 12, 2023 •

    출처 Kim, Sehoon, et al. “Squeezeformer: An efficient transformer for automatic speech recognition.” arXiv preprint arXiv:2206.00888 (2022).
  • [논문리뷰] Robust disentangled variational speech representation learning for zero-shot voice conversion

    Jun 12, 2023 •

    Lian, Jiachen, Chunlei Zhang, and Dong Yu. “Robust disentangled variational speech representation learning for zero-shot voice conversion.” ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. Copyright of figures and other materials in the paper belongs to original authors.
  • [논문리뷰] Nvc-net: End-to-end adversarial voice conversion

    Jun 12, 2023 •

    출처 Nguyen, Bac, and Fabien Cardinaux. “Nvc-net: End-to-end adversarial voice conversion.” ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022.APA
  • [논문리뷰] FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS.

    Jun 12, 2023 •

    출처 Kim, Changhwan, et al. “FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS.” Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Vol. 2022. 2022.
  • [논문리뷰] Adversarial Multi-Task Learning for Disentangling Timbre and Pitch in Singing Voice Synthesis

    Jun 12, 2023 •

    출처
  • [논문리뷰] Contrastive Siamese Network for Semi-Supervised Speech Recognition

    Jun 12, 2023 •

    출처 Khorram, Soheil, et al. “Contrastive Siamese Network for Semi-Supervised Speech Recognition.” ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022. Copyright of figures and other materials in the paper belongs to original authors.
  • [논문 및 코드 리뷰] Conformer-based hybrid ASR system for Switchboard dataset

    Jun 4, 2023 •

    1 논문 리뷰
  • 2
  • 3
  • 4

Copyright © 2023 - 2025 강지수 / Jisoo Kang; All rights reserved.

Powered by Jekyll & Hamilton

Daily Review blog