Publications (2012/01-2012/12)
20032004200520062007200820092010201120122013201420152016201720182019202020212022
Journal Papers
- Online speaker clustering using incremental learning of an ergodic hidden Markov model
- IEICE TRANS. INF. & SYST, Vol. E95-D, No. 10, pp. 2469-2478, Oct., 2012
- Active Learning Using Phone-Error Distribution for Speech Modeling
- IEICE TRANS. INF. & SYST, Vol. E95-D, No. 10, pp. 2486-2494, Oct., 2012
- A Fast and Accurate Video Semantic-Indexing System Using Fast MAP Adaptation and GMM Supervectors
- IEEE Transactions on Multimedia, vol. 14, Issue: 4 Part 2, pp. 1196-1205, Aug., 2012
- Robust Gait-Based Person Identification against Walking Speed Variations
- IEICE Trans. Inf. & Syst, Vol. E95-D, No. 2, pp. 668-676, Feb. 1, 2012
Conference Proceedings (peer reviewed)
- Acoustic Model Training Using Committee-Based Active and Semi-Supervised Learning for Speech Recognition
- APSIPA ASC 2012, Dec. 4, 2012
- Efficient model training for HMM-based person identification by gait
- Proceedings of 2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, Dec., 2012
- q-Gaussian Mixture Models Based on Non-Extensive Statistics for Image And Video Semantic Indexing
- ACCV2012, Nov. 5, 2012
- MULTIMEDIA EVENT DETECTION USING GMM SUPERVECTORS AND SVMS
- ICIP 2012, pp. 3089-3092, Oct. 3, 2012
- Overlapped Speech Detection in Meeting Using Cross-Channel Spectral Subtraction and Spectrum Similarity
- InterSpeech2012, Sep. 12, 2012
- Q-Gaussian based spectral subtraction for robust speech recognition
- InterSpeech2012, Sep. 11, 2012
- Non-extensive Statistics for Feature Normalization in Speech Recognition
- Proc. International Workshop on Statistical Machine Learning for Speech Processing (IWSML) 2012, Mar., 2012
Conference Proceedings (non-refereed)
- Tokyo Tech Speaker Recognition
- NIST SRE 2012, Dec. 11, 2012
- TokyoTechCanon at TRECVID 2012
- TRECVID 2012, Nov. 26, 2012
Domestic Conference Proceedings
- Video Semantic Indexing Using GMM-Supervectors
- Greater Tokyo Area Multimedia/Vision Workshop, Aug. 30, 2012
- A video watermarking method to objects robust against various attacks
- IEICE Technical Report, Vol. 112, No. 190, pp. 43-48, Aug. 27, 2012
- Multimodal Interface for Error Correction in Speech Recognition
- Microsoft Research Asia IJARC CORE7 Project Summary Booklet, pp. 15-16, Jun. 29, 2012
- Speaker Adaptation for Dialog Act Recognition
- 2012 Spring Meeting ASJ, p. 111, Mar. 21, 2012
- MAP Adaptation Using Multiple Priors for Speaker Verication
- 2012 Spring Meeting ASJ, pp. 79-82, Mar. 19, 2012
- A Compensation Technique Using q-Logarithm for Noisy Speech Recognition
- 2012 Spring Meeting ASJ, pp. 19-20, Mar. 19, 2012
- Spectral Subtraction Based on q-Gaussian Assumption for Noise Robust Speech Recognition
- 2012 Spring Meeting ASJ, pp. 21-22, Mar. 19, 2012
- Recognition of Indonesian Code-Switching Speech
- 2012 Spring Meeting ASJ, pp. 75-76, Mar., 2012
- Language Model for Efficient Error Correction in Speech Recognition
- 2012 Spring Meeting ASJ, pp. 89-90, Mar., 2012
- Subject adaptation and adaptive training for gait-based person identification
- IEICE Technical Report, No. PRMU2011-199, pp. 77-82, Feb., 2012
- Two-pass approach for recognizing code-switching speech
- IEICE Technical Report, No. SP2011-150, pp. 225-229, Feb., 2012
Keynote Talks
- Speech Technology Plays a Key Role in Video Semantic Indexing
- First International Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis (AMVA) at ACM Multimedia 2012, pp. 1-2, Oct. 29, 2012
Invited Talks & Tutorials
- Mobile or Cloud-based Photo/Video Analytics?
- Greater Tokyo Area Multimedia/Vision Workshop, Aug. 30, 2012