2023
Journal Papers
- Phuc Duc Nguyen, Yoshifumi Shiraki, Kenji Ishikawa, Jun Muramatsu, Noboru Harada & Takehiro Moriya (2023). Distribution Matching for Dimming Control in Visible-Light Region-of-Interest Signaling. IEEE Photonics Journal, 15 (1), 1-14.
- Denny Hermawanto, Kenji Ishikawa, Kohei Yatabe & Yasuhiro Oikawa (2023). Determination of Microphone Acoustic Center from Sound Field Projection Measured by Optical Interferometry. The Journal of the Acoustical Society of America, -.
- Shogo Seki, Hirokazu Kameoka, Takuhiro Kaneko & Kou Tanaka (2023). Non-parallel Whisper-to-Normal Speaking Style Conversion Using Auxiliary Classifier Variational Autoencoder. IEEE Access, 11, 44590-44599.
- Samuel A. Verburg, Kenji Ishikawa, Efren Fernandez-Grande & Yasuhiro Oikawa (2023). A Century of Acousto-Optics: From Early Discoveries to Modern Sensing of Sound with Light. Acoustics Today, 19 (3), 54-62.
- Ryosuke Sugiura, Yutaka Kamamoto & Takehiro Moriya (2023). General form of almost instantaneous fixed-to-variable-length codes and optimal code tree construction. IEEE Transactions on Information Theory, 69 (12).
- Kenji Ishikawa, Yoshifumi Shiraki, Takehiro Moriya, Atsushi Ishizawa, Kenichi Hitachi & Katsuya Oguri (2023). Comprehensive Noise Analysis for Acousto-optic Measurement of Airborne Sound. IEEE Trans on Instrumentation and Measurement, 73 (7000309).
Peer-reviewed Conference Papers
- Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko & Shogo Seki (2023). Distilling sequence-to-sequence voice conversion models for streaming conversion applications. Proc. IEEE Spoken Language Technology Workshop (SLT). Doha, Qatar.
- Shogo Seki, Hirokazu Kameoka, Kou Tanaka & Takuhiro Kaneko (2023). JSV-VC: Jointly Trained Speaker Verification and Voice Conversion Models. Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Island of Rhodes,Greek.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2023). Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input. Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Island of Rhodes,Greek.
- Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka & Shogo Seki (2023). Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis. Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Island of Rhodes,Greek.
- Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka & Shogo Seki (2023). iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN. Proc. Interspeech. Dublin, Ireland.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2023). Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation. Proc. Interspeech. Dublin, Ireland.
- Kou Tanaka, Takuhiro Kaneko, Hirokazu Kameoka & Shogo Seki (2023). CFVC: Conditional Filtering for Controllable Voice Conversion. Proc. Interspeech. Dublin, Ireland.
- Noboru Harada, Daisuke Niizumi, Yasunori Ohishi, Daiki Takeuchi & Masahiro Yasuda (2023). First-Shot Anomaly Sound Detection for Machine Condition Monitoring: A Domain Generalization Baseline. Proc. European Signal Processing Conference(EUSIPCO). Helsinki, Finland.
- Shogo Seki, Kanami Imamura, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka & Noboru Harada (2023). W2N-AVSC: Audiovisual Extension for Whisper-to-Normal Speech Conversion. Proc. European Signal Processing Conference(EUSIPCO). Helsinki, Finland.
- Kou Tanaka, Hirokazu Kameoka & Takuhiro Kaneko (2023). PRVAE-VC: Non-Parallel Many-to-Many Voice Conversion with Perturbation-Resistant Variational Autoencoder. Proc.ISCA Speech Synthesis Workshop(SSW). Grenoble, France.
- Boxin Liu, Shiqi Zhang, Daiki Takeuchi, Daisuke Niizumi, Noboru Harada & Shoji Makino (2023). Masked modeling duo vision transformer with multi-layer feature fusion on respiratory sound classification. Proc. Detection and Classification of Acoustic Scenes and Events(DCASE) Workshop. Tampere, Finland.
- Chihiro Watanabe & Hirokazu Kameoka (2023). DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion. Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC). Taipei, Taiwan.
- Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Ryo Tanabe, Takashi Endo & Yohei Kawaguchi (2023). Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring. Proc. Detection and Classification of Acoustic Scenes and Events(DCASE) Workshop. Tampere, Finland.
- Daiki Takeuchi, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada & Kunio Kashino (2023). Similarity-discrepancy disentanglement for audio difference captioning. Proc. Detection and Classification of Acoustic Scenes and Events(DCASE) Workshop. Tampere, Finland.
- Noboru Harada, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi & Masahiro Yasuda (2023). ToyADMOS2+: New Toyadmos Data and Benchmark Results of the First-Shot Anomalous Sound Event Detection Baseline. Proc. Detection and Classification of Acoustic Scenes and Events(DCASE) Workshop. Tampere, Finland.
- Keisuke Takazawa, Hirokazu Kameoka & Masahiro Yukawa (2023). Multiple Sound Source Tracking Based on Generative Modeling and Recursive Bayesian Filtering of Spatial Gradient Spectra. Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC). Taipei, Taiwan.
- Noboru Harada, Daisuke Niizumi, Yasunori Ohishi, Daiki Takeuchi & Masahiro Yasuda (2023). First-shot anomaly sound detection for machine condition monitoring: A Domain Generalization baseline. Proc. Asia-Pacific Signal and Information Processing Association (APSIPA) Annual Summit and Conference (ASC). Helsinki, Finland.
- Haruka Nozawa, Mayuko Imanishi, Yasuhiro Oikawa & Kenji Ishikawa (2023). Physical-model-based reconstruction of three-dimensional sound field from multi-directional measurement by parallel phase-shift interferometry. Proc. The Australian Acoustical Society(Acoustics2023). Sydney, Australia.