2022
Journal Papers
- Ken Mano, Hideki Sakurada & Yasuyuki Tsukada (2022). Quality and quantity pair as trust metric. IEICE Transactions on Information and Systems.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2022). Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations. EEE/ACM Transactions on Audio, Speech and Language Processing (TASLP).
- Wangyou Zhang, Xuankai Chang, Christoph Boeddeker, Tomohiro Nakatani, Shinji Watanabe & Yanmin Qian (2022). End-to-end dereverberation, beamforming, and speech recognition in a cocktail party. IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 30, 3173-3188.
- Marc Delcroix, Jorge Bennasar Vázquez, Tsubasa Ochiai, Keisuke Kinoshita, Yasunori Ohishi & Shoko Araki (2022). Soundbeam: target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP).
- Kenji Ishikawa, Kohei Yatabe, Yasuhiro Oikawa, Yoshifumi Shiraki & Takehiro Moriya (2022). Speckle holographic imaging of sound field using fresnel lens. Optics Letters.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2022). BYOL for audio: Exploring pre-trained general-purpose audio representations. IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP).
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2022). Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations. Proceedings of Machine Learning Research (PMLR).
- Li Li, Kohei Yatabe, Hirokazu Kameoka & Shoji Makino (2022). FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP).
- X. Wu, Y. Sun, A. Kimura, and K. Kashino, "Contrast enhancement based on reflectance-oriented probabilistic equalization," Signal Processing, vol. 194, 2022.
Peer-reviewed Conference Papers
- Masato Wakayama (2022). Quantum Interaction and number theory, representation theory - modular forms a bit beyond, infinite symmetric group, Fuchsian ODE. Painlevé Seminar.
- Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada & Kunio Kashino (2022). ConceptBeam: Concept driven target speech extraction. Proc. ACM International Conference on Multimedia(ACMMM). Lisbon, Portugal.
- Seiya Matsuda, Akisato Kimura & Seiichi Uchida (2022). Font generation with missing impression labels. in Proc. International Conference on Pattern Recognition (ICPR). Montreal Quebec, Canada.
- Kana Goto, Tetsuya Ueda, Li Li, Takeshi Yamada & Shoji Makino (2022). Geometrically constrained independent vector analysis with auxiliary function approach and iterative source steering. in Proc. European Signal Processing Conference (EUSIPCO). Belgrade, Serbia.
- Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada & Kunio Kashino (2022). Composing general audio representation by fusing multi-layer features of pre-trained model. in Proc. European Signal Processing Conference (EUSIPCO). Belgrade, Serbia.
- Natsuki Ueno & Hirokazu Kameoka (2022). Multiple sound source localization based on stochastic modeling of spatial gradient spectra. in Proc. European Signal Processing Conference (EUSIPCO). Belgrade, Serbia.
- Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka & Shogo Seki (2022). MISRNet: Lightweight neural vocoder using multi-input single shared residual blocks. in Proc. Interspeech. Incheon, Korea.
- Hirokazu Kameoka, Takuhiro Kaneko, Shogo Seki & Kou Tanaka (2022). CAUSE: Crossmodal action unit sequence estimation from speech. in Proc. Interspeech. Incheon, Korea.
- Daiki Takeuchi, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada & Kunio Kashino (2022). Introducing auxiliary text query-modifier to content-based audio retrieval. in Proc. Interspeech. Incheon, Korea.
- Takashi Shibata, Masatoshi Okutomi & Masayuki Tanaka (2022). Robustizing object detection networks using augmented feature pooling. in Proc. Asian Conference on Computer Vision (ACCV). Macau SAR, China.
- Yu Moriyasu, Takashi Shibata, Masayuki Tanaka & Masatoshi Okutomi (2022). Top-K ensemble for semantic segmentation robust against unexpected degradation. Proc. IEEE International Conference on Consumer Electronics(ICCE). Bordeaux,France.
- Yasuhiro Fujiwara, Masahiro Nakano, Atsutoshi Kumagai, Yasutoshi Ida, Akisato Kimura & Naonori Ueda (2022). Fast binary network hashing via graph clustering. Proc. IEEE BigData. Osaka, Japan.
- Denny Hermawanto, Kenji Ishikawa, Kohei Yatabe & Yasuhiro Oikawa (2022). Visualization of microphone's acoustic center using phase-shifting interferometry. Proc. International Congress on Acoustics (ICA). Gyeongju,Korea.
- M. Nakano, R. Nishikimi, Y. Fujiwara, A. Kimura, T. Yamada, and N. Ueda, "Nonparametric relational models with superrectangulation," in Proc. International Conference on Artificial Intelligence and Statistics (AISTATS), 2022.
- G. Irie, T. Shibata, and A. Kimura, "Co-attention-guided bilinear model for echo-based depth estimation," in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
- T. Kaneko, K. Tanaka, H. Kameoka, and S. Seki, "Fastening and lightening convolutional mel-spectrogram vocoder using inverse short-time fourier transform," in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
- S. Seki, H. Kameoka, and L. Li, "Exploring and improving multichannel variational autoencoder for underdetermined source separation," in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
- L. Li, H. Kameoka, and S. Seki, "HBP: An efficient block permutation solver using hungarian algorithm and spectrogram inpainting for multichannel audio source separation," in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
- H. Kameoka, S. Seki, L. Li, and C. Watanabe, "AttentionPIT: Soft permutation invariant training for audio source separation with attention mechanism," in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
- T. Kaneko, "AR-NeRF: Unsupervised learning of depth and defocus effects from natural images with aperture rendering neural radiance fields," in Proc. Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- S. Yoneda, G. Irie, T. Shibata, M. Nishiyama, and I. Yoshio, "Deep segmentation network without mask image supervision for 2D image registration," in Proc. International Workshop on Frontiers of Computer Vision (IW-FCV), 2022.
- M. Ueda, A. Kimura, and S. Uchida, "Font shape-to-impression translation," in Proc. International Workshop on Document Analysis Systems (DAS), 2022.
- C. Kabore, M. Tsuchida, I. Suzuki, S. Sugaya, A. Kimura, and N. Harada, "Prototyping of low-cost color enhancement lighting using multicolor LEDs," in Proc. International Symposium on Electronic Imaging (EI), 2022.