Publications (2014/01-2014/12)
20032004200520062007200820092010201120122013201420152016201720182019202020212022
Conference Proceedings (peer reviewed)
- Speaker Adaptation of Deep Neural Networks Using a Hierarchy of Output Layers
- Proc. Spoken Language Technology (SLT) Workshop, pp. 153-158, Dec. 7, 2014
- An Efficient Error Correction Interface for Speech Recognition on Mobile Touchscreen Devices
- Proc. Spoken Language Technology (SLT) Workshop, pp. 454-459, Dec. 7, 2014
- n-Gram Models for Video Semantic Indexing
- Proc. ACM Multimedia (MM), pp. 777-780, Nov. 3, 2014
- Spectral Graph Skeletons for 3D Action Recognition
- Proc. Asian Conference on Computer Vision (ACCV), pp. 1-16, Nov. 1, 2014
- Simple Gesture-based Error Correction Interface for Smartphone Speech Recognition
- Proc. Interspeech, pp. 1194-1198, Sep. 16, 2014
- Discriminative PLDA training with application-specific loss functions for speaker verification
- Proc. Odyssey Workshop, pp. 26-32, Jun. 16, 2014
- i-Vector Selection for Effective PLDA Modeling in Speaker Recognition
- Proc. Odyssey Workshop, pp. 100-105, Jun. 16, 2014
- Constrained Discriminative PLDA Training for Speaker Verification
- Proc. International Conference on Acoustic Speech and Signal Processing (ICASSP), pp. 1689-1693, May 4, 2014
- Event Detection by Velocity Pyramid
- Proc. Multimedia Modeling (MMM), pp. 353-364, Jan. 6, 2014
Conference Proceedings (non-refereed)
- TokyoTech-Waseda at TRECVID 2014
- Proc. TRECVID workshop, pp. 1-13, Nov. 9, 2014
Domestic Conference Proceedings
- Error Correction Using Long Context Match for Smartphone Speech Recognition
- Technical Reports of IPSJ SLP, vol. 104, no. 22, pp. 1-6, Dec. 16, 2014
- An Efficient Error Correction Method for Smartphone Speech Recognition
- Proc. ASJ 2014 Autumn Meeting, pp. 29-30, Sep. 5, 2014
- Collection and analysis of multi-party interaction data for automatic boredom recognition
- Proc. The 28th Annual Conference of the Japanese Society for Artificial Intelligence (JSAI) 2014, pp. 1-4, May 13, 2014
- Velocity Pyramid for Event Detection
- Technical Reports of IEICE PRMU, vol. 113, no. 493, pp. 13-18, Mar. 13, 2014
- Discriminatively Trained PLDA with Partially Preserved Model Assumptions in Speaker Verification
- Proc. ASJ 2014 Spring Meeting, pp. 99-100, Mar. 12, 2014
- Training Multiple PLDA Models by Clustered I-Vectors for Speaker Verification
- Proc. ASJ 2014 Spring Meeting, pp. 97-98, Mar. 12, 2014
- Robust 0-1 Loss Training for PLDA in Speaker Verification
- Proc. ASJ 2014 Spring Meeting, pp. 101-102, Mar. 12, 2014
Invited Talks & Tutorials
- Robust Video Information Retrieval using Speech Technologies
- Language Technologies Institute, Carnegie Mellon University, Jun. 20, 2014
- Video Semantic Indexing Using Speech Technologies
- Dublin City University, Jan. 6, 2014
Selected Talks
- Semantics for Large-Scale Multimedia: New Challenges for NLP
- ACL2014, Jun. 22, 2014