発表文献 (2012/01-2012/12)
20032004200520062007200820092010201120122013201420152016201720182019202020212022
論文(査読あり)
- Online speaker clustering using incremental learning of an ergodic hidden Markov model
- IEICE TRANS. INF. & SYST, Vol. E95-D, No. 10, pp. 2469-2478, Oct., 2012
- Active Learning Using Phone-Error Distribution for Speech Modeling
- IEICE TRANS. INF. & SYST, Vol. E95-D, No. 10, pp. 2486-2494, Oct., 2012
- A Fast and Accurate Video Semantic-Indexing System Using Fast MAP Adaptation and GMM Supervectors
- IEEE Transactions on Multimedia, vol. 14, Issue: 4 Part 2, pp. 1196-1205, Aug., 2012
- 音声認識におけるモデル間スケーリング係数の自動推定
- 電子情報通信学会論文誌, Vol. J95-D, No. 5, pp. 1276-1285, May 1, 2012
- Robust Gait-Based Person Identification against Walking Speed Variations
- IEICE Trans. Inf. & Syst, Vol. E95-D, No. 2, pp. 668-676, Feb. 1, 2012
国際会議(査読あり)
- Acoustic Model Training Using Committee-Based Active and Semi-Supervised Learning for Speech Recognition
- APSIPA ASC 2012, Dec. 4, 2012
- Efficient model training for HMM-based person identification by gait
- Proceedings of 2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, Dec., 2012
- q-Gaussian Mixture Models Based on Non-Extensive Statistics for Image And Video Semantic Indexing
- ACCV2012, Nov. 5, 2012
- MULTIMEDIA EVENT DETECTION USING GMM SUPERVECTORS AND SVMS
- ICIP 2012, pp. 3089-3092, Oct. 3, 2012
- Overlapped Speech Detection in Meeting Using Cross-Channel Spectral Subtraction and Spectrum Similarity
- InterSpeech2012, Sep. 12, 2012
- Q-Gaussian based spectral subtraction for robust speech recognition
- InterSpeech2012, Sep. 11, 2012
- Non-extensive Statistics for Feature Normalization in Speech Recognition
- Proc. International Workshop on Statistical Machine Learning for Speech Processing (IWSML) 2012, Mar., 2012
国際会議(査読なし)
- Tokyo Tech Speaker Recognition
- NIST SRE 2012, Dec. 11, 2012
- TokyoTechCanon at TRECVID 2012
- TRECVID 2012, Nov. 26, 2012
国内会議(査読なし)
- ディープラーニングを用いた日本語大語彙話し言葉音声認識
- 日本音響学会2012年秋季研究発表会講演論文集, Sep. 20, 2012
- 映像のセマンティックインデクシングのためのq-混合ガウス分布
- 信学技報, Vol. 112, No. 197, pp. 31-36, Sep. 2, 2012
- Video Semantic Indexing Using GMM-Supervectors
- Greater Tokyo Area Multimedia/Vision Workshop, Aug. 30, 2012
- A video watermarking method to objects robust against various attacks
- IEICE Technical Report, Vol. 112, No. 190, pp. 43-48, Aug. 27, 2012
- 複数ピンマイクで収音された会議音声の重畳区間検出
- 情報処理学会研究報告, Vol. 2012-SLP-92, No. 6, Jul. 20, 2012
- クラウド時代の新しい音声研究パラダイム
- 情報処理学会研究報告, Vol. 2012-SLP-92, No. 4, Jul. 19, 2012
- Multimodal Interface for Error Correction in Speech Recognition
- Microsoft Research Asia IJARC CORE7 Project Summary Booklet, pp. 15-16, Jun. 29, 2012
- GMM-Supervectorを用いた映像の高速セマンティック検索システム
- 第18回画像センシングシンポジウム講演論文集, DS2-08, Jun. 11, 2012
- Speaker Adaptation for Dialog Act Recognition
- 2012 Spring Meeting ASJ, p. 111, Mar. 21, 2012
- コミッティに基づく能動学習・半教師付き学習を用いた音声モデル
- 日本音響学会2012年春季研究発表会講演論文集, pp. 55-56, Mar. 21, 2012
- 相互スペクトル減算と振幅スペクトル相関を用いた 会議音声の重畳区間検出
- 日本音響学会2012年春季研究発表会講演論文集, pp. 13-14, Mar. 21, 2012
- MAP Adaptation Using Multiple Priors for Speaker Verication
- 2012 Spring Meeting ASJ, pp. 79-82, Mar. 19, 2012
- A Compensation Technique Using q-Logarithm for Noisy Speech Recognition
- 2012 Spring Meeting ASJ, pp. 19-20, Mar. 19, 2012
- Spectral Subtraction Based on q-Gaussian Assumption for Noise Robust Speech Recognition
- 2012 Spring Meeting ASJ, pp. 21-22, Mar. 19, 2012
- Recognition of Indonesian Code-Switching Speech
- 2012 Spring Meeting ASJ, pp. 75-76, Mar., 2012
- Language Model for Efficient Error Correction in Speech Recognition
- 2012 Spring Meeting ASJ, pp. 89-90, Mar., 2012
- 手話素単位を用いた大語彙手話認識
- 電子情報通信学会技術研究報告, No. PRMU2011-222, pp. 155-160, Feb. 9, 2012
- GMM-SupervectorとSVMを用いた映像からのイベント検出
- 電子情報通信学会技術研究報告, No. PRMU2011-230, pp. 195-200, Feb. 2, 2012
- Subject adaptation and adaptive training for gait-based person identification
- IEICE Technical Report, No. PRMU2011-199, pp. 77-82, Feb., 2012
- Two-pass approach for recognizing code-switching speech
- IEICE Technical Report, No. SP2011-150, pp. 225-229, Feb., 2012
- 固定監視カメラからの人混み中の行動イベント検出
- 電子情報通信学会技術研究報告, No. PRMU2011-173, pp. 257-262, Jan. 19, 2012
基調講演
- Speech Technology Plays a Key Role in Video Semantic Indexing
- First International Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis (AMVA) at ACM Multimedia 2012, pp. 1-2, Oct. 29, 2012
解説・総説
- 映像検索技術の新たな潮流
- 電子情報通信学会誌, Vol. 95, No. 10, pp. 932-938, Oct., 2012
- 音声認識における転移学習:話者適応
- 人工知能学会誌, vol. 27, no. 4, pp. 359-364, Jul. 1, 2012
招待講演・チュートリアル
- コミュニケーションとしての映像とその検索
- 第15回情報理論的学習理論ワークショップ(IBIS2012), Nov. 7, 2012
- 映像検索技術の最新動向
- 産業計測第36委員会研究会, Oct. 25, 2012
- Mobile or Cloud-based Photo/Video Analytics?
- Greater Tokyo Area Multimedia/Vision Workshop, Aug. 30, 2012
- 映像検索技術の最前線
- 第18回画像センシングシンポジウム講演論文集, OS3-02-1-4, Jun. 11, 2012