Machine Learning for Multimodal Interaction : First International Workshop, MLMI 2004, Martigny, Switzerland, June 21-23, 2004, Revised Selected Papers / edited by Samy Bengio, Hervé Bourlard
(Information Systems and Applications, incl. Internet/Web, and HCI. ISSN:29461642 ; 3361)
データ種別 | 電子ブック |
---|---|
版 | 1st ed. 2005. |
出版者 | (Berlin, Heidelberg : Springer Berlin Heidelberg : Imprint: Springer) |
出版年 | 2005 |
大きさ | XII, 362 p : online resource |
著者標目 | Bengio, Samy editor Bourlard, Hervé editor SpringerLink (Online service) |
書誌詳細を非表示
一般注記 | MLMI 2004 -- Accessing Multimodal Meeting Data: Systems, Problems and Possibilities -- Browsing Recorded Meetings with Ferret -- Meeting Modelling in the Context of Multimodal Research -- Artificial Companions -- Zakim – A Multimodal Software System for Large-Scale Teleconferencing -- Towards Computer Understanding of Human Interactions -- Multistream Dynamic Bayesian Network for Meeting Segmentation -- Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives -- An Integrated Framework for the Management of Video Collection -- The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing -- S-SEER: Selective Perception in a Multimodal Office Activity Recognition System -- Mapping from Speech to Images Using Continuous State Space Models -- An Online Algorithm for Hierarchical Phoneme Classification -- Towards Predicting Optimal Fusion Candidates: A Case Study on Biometric Authentication Tasks -- Mixture of SVMs for Face Class Modeling -- AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking -- The 2004 ICSI-SRI-UW Meeting Recognition System -- On the Adequacy of Baseform Pronunciations and Pronunciation Variants -- Tandem Connectionist Feature Extraction for Conversational Speech Recognition -- Long-Term Temporal Features for Conversational Speech Recognition -- Speaker Indexing in Audio Archives Using Gaussian Mixture Scoring Simulation -- Speech Transcription and Spoken Document Retrieval in Finnish -- A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System -- Shallow Dialogue Processing Using Machine Learning Algorithms (or Not) -- ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings -- Piecing Together the Emotion Jigsaw -- Emotion Analysis in Man-Machine Interaction Systems -- A Hierarchical System for Recognition, Tracking and Pose Estimation -- Automatic Pedestrian Tracking Using Discrete Choice Models and Image Correlation Techniques -- A Shape Based, Viewpoint Invariant Local Descriptor HTTP:URL=https://doi.org/10.1007/b105752 |
---|---|
件 名 | LCSH:User interfaces (Computer systems) LCSH:Human-computer interaction LCSH:Artificial intelligence LCSH:Natural language processing (Computer science) LCSH:Computers and civilization LCSH:Computer vision FREE:User Interfaces and Human Computer Interaction FREE:Artificial Intelligence FREE:Natural Language Processing (NLP) FREE:Computers and Society FREE:Computer Vision |
分 類 | LCC:QA76.9.U83 LCC:QA76.9.H85 DC23:005.437 DC23:004.019 |
書誌ID | EB00003157 |
ISBN | 9783540305682 |
類似資料
この資料の利用統計
このページへのアクセス回数:2回
※2019年3月27日以降
全貸出数:0回
(1年以内の貸出:0回)
※2019年3月27日以降