当前位置: X-MOL首页全球导师 国内导师 › 徐波

个人简介

招生专业 081104-模式识别与智能系统 081203-计算机应用技术 招生方向 类脑认知计算模型 群体智能与博弈对抗 语音识别与合成,自然语言处理与应用 教育背景 1990-09--1997-06 中国科学院自动化研究所 硕士博士 1984-09--1988-06 浙江大学 本科 学历 中国科学院自动化研究所 博士毕业 学位 1988年6月毕业于浙江大学电机工程系并获学士学位; 1992年4月获中国科学院自动化所硕士学位; 1997年7月获中国科学院自动化所博士学位。 工作简历 徐波,博士,研究员。任中科院自动化研究所所长,中国科学院人工智能创新研究院院长,中国科学院大学人工智能学院院长,兼任国家新一代人工智能战略咨询委员会委员,中科院脑科学与智能技术卓越创新中心副主任,北京市脑科学专项专家组成员,中国人工智能产业发展联盟副理事长等职,长期从事智能语音处理和人工智能技术研究和应用。1998年-2003年任模式识别国家重点实验室副主任;2001年至2006年担任国际口语信息处理学会中文口语信息处理分会主席,并任中国中文信息学会副理事长至今;2004.7—2011.7担任国家863计划信息技术领域专家组专家; 2008年在新加坡建立中新数字媒体研究院并担任院长,开展人类跨语言沟通技术研究。由于在汉语声学模型和识别、大规模口语发音评测技术、媒体内容识别与监测等创新性工作,获得过国际中文口语信息处理优秀论文一等奖、“中国科学院杰出青年奖”、“王选新闻科技进步一等奖”等奖项。指导发表科技论文200余篇,申请发明专利40余项,软件著作权10余项,主持完成国家标准一个。2010年以来以口语对话翻译和机器人智能等背景,进行深度认知计算和类脑认知计算研究,从基本认知单元和任务多脑区协同等角度研究建立类脑智能计算模型。2001年起担任国际口语信息处理联合会中文口语信息处理分会主席;2004.7—2011.7担任国家863计划信息技术领域专家组专家;2006年开始担任中国中文信息学会副理事长,2018年开始担任国家新一代人工智能战略咨询委员会委员、中国人工智能产业发展联盟副理事长、中国科学院大学人工智能学院院长、中科院脑科学与智能技术卓越创新中心副主任等职位;分别获得过国际中文口语信息处理优秀论文一等奖、“中国科学院杰出青年奖”、“王选新闻科技进步一等奖”等奖项;主持多项国家支撑、863、973以及自然科学基金项目重点项目,实现口语评估、口语识别和翻译等技术转移转化若干。 近年来主要研究方向是类脑认知计算、语音识别与合成、自然语言处理与应用及群体智能与博弈对抗。人工智能已经上升到国家战略,被认为是第四次工业革命的重要引擎。视听觉和语言是人类的基本能力,也是人工智能皇冠上的明珠,极具挑战性。通过类脑机制研究视听觉感知和语言认知问题对于解决大多数人工智能具有重要价值。目前,如何借鉴大脑脉冲神经网络工作机制,研究探索神经动力学、机器学习以及博弈理论等相统一的智能产生机理和范式,如何借鉴大脑听觉处理机制,使得在极其嘈杂环境下达到类人语音听辩能力达到或者超越人类,即鸡尾酒效应问题。如何借鉴人类认知机制,研究基于博弈方法的智慧医疗领域等应用,是拟解决的重大科学问题或工程问题。由此应运而生的未来技术拟将产生超级听觉能力的语音前端系统,能集中顶尖医生智慧的超级医疗技术以及产生可人机共进的新一代人工智能。 教授课程 类脑智能导论 科研成果 近五年来,指导发表科技论文76篇,申请及授权发明专利22项,其中国际专利4项。目前承担包括中科院“脑功能联结图谱和类脑智能研究”先导B项目和北京市脑科学专项“大脑认知功能计算模型”在内的类脑智能研究项目。 先后获得过国际中文口语信息处理优秀论文一等奖、“中国科学院杰出青年奖”、“王选新闻科技进步一等奖”、政府特殊津贴、中国科学院杰出青年、新世纪百千万人才工程国家级人选、CIUR中国产学研合作创新奖等奖项。 先后担任国际口语信息处理学会中文口语信息处理分会主席,国家863计划信息技术领域专家组专家,并在在新加坡建立中新数字媒体研究院并担任院长。现任中国科学院自动化研究所所长,中国科学院大学人工智能学院院长,中国科学院脑科学与智能技术卓越创新中心副主任,北京市脑科学专项专家组成员,“新一代人工智能实施专家组组长”,国家新一代人工智能战略咨询委员会委员,国家广播电视总局媒体融合发展人才工程优秀专家学者成员。 在科研方面,开展人类跨语言沟通技术研究,由于在汉语声学模型和识别、大规模口语发音评测技术、媒体内容识别与监测等创新性工作,近年来更是以口语对话翻译和机器人智能等背景,进行深度认知计算和类脑认知计算研究,从基本认知单元和多脑区协同等角度研究建立类脑智能计算模型。 合作情况 在新加坡成立了科学院首家海外研究创新单元”中新数字媒体研究院“,担任中方院长,从事人类多模态沟通技术的研究;与日本ATR等亚洲国家级研究机构建立长期合作,实现亚洲语言之间的翻译;长期与国内外著名企业和研究机构开展项目合作,实现技术的转移转化;同时与外企在华研究中心包括Panasonic, Nokia等建立了长期稳定的合作关系。

研究领域

​语音识别与合成;自然语言处理;类脑认知计算模型;博弈智能等。

近期论文

查看导师最新文章 (温馨提示:请注意重名现象,建议点开原文通过作者单位确认)

(1) Speaker-Conditional Chain Model for Speech Separation and Extraction, In Proceedings of the 21th Annual Conference of the International Speech Communication Association (INTERSPEECH2020, CCF-C), 2020, 第 5 作者 (2) A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments, In Proceedings of the 21th Annual Conference of the International Speech Communication Association (INTERSPEECH2020, CCF-C), 2020, 通讯作者 (3) LISNN: Improving Spiking Neural Networks with Lateral Interactions for Robust Object Recognition, In Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI2020, CCF-A), 2020, 通讯作者 (4) DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog, In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI2020, CCF-A), 2020, 第 5 作者 (5) CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition, International Conference on Acoustic Speech and Signal Processing (ICASSP), 2020, 第 2 作者 (6) LOW-FREQUENCY GUIDED SELF-SUPERVISED LEARNING FOR HIGH-FIDELITY 3D FACE RECONSTRUCTION IN THE WILD, IEEE International Conference on Multimedia and Expo (ICME2020), 2020, 第 3 作者 (7) "A Biologically Plausible Supervised Learning Method for Spiking Neural Networks Using the Symmetric STDP Rule", neural networks, 2019, 第 4 作者 (8) Modelling Speaker-dependent Auditory Attention Using A Spiking Neural Network with Temporal Coding and Supervised Learning, ICONIP2019, 2019, 第 3 作者 (9) Effectively Training Neural Machine Translation with Monolingual Data, NeuroComputing, 2019, 第 4 作者 (10) The World in My Mind: Visual Dialog with Adversarial Multi-modal Feature Encoding, NAACL2019, 2019, 第 3 作者 (11) "A Unified Multi-output Semi-supervised Network for 3D Face Reconstruction", International Joint Conference on Neural Network (IJCNN), 2019, 第 4 作者 (12) Research Advances and Perspectives on the Cocktail Party Problem and Related Auditory Models, Acta Automatica Sinica, 2019, 第 4 作者 (13) "EFFICIENT AND ACCURATE FACE SHAPE RECONSTRUCTION BY FUSION OF MULTIPLE LANDMARK DATABASES", International Conference on Image Processing (ICIP), 2019, 第 4 作者 (14) Adapting Translation Models for Transcript Disfluency Detection, AAAI2019, 2019, 第 6 作者 (15) Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks, Neural Networks, 2018, 2018, 第 2 作者 (16) Distant Supervision for Relation Extraction with Hierarchical Selective Attention., Neural Networks, 2018, 2018, 第 2 作者 (17) Cascaded Mutual Modulation for Visual Reasoning, In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP2018), 2018, 第 2 作者 (18) Learning to Activate Logic Rules for Textual Reasoning., Neural Networks, 2018, 2018, 第 2 作者 (19) Improving Speech Separation with Adversarial Network and Reinforcement Learning., In Proceedings of the 30th International Joint Conference on Neural Network (IJCNN2018), 2018, 第 2 作者 (20) Distilled Binary Neural Network for Monaural Speech Separation., In Proceedings of the 30th International Joint Conference on Neural Network (IJCNN2018), 2018, 第 2 作者 (21) Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment, AAAI 2018, 2018, 第 2 作者 (22) A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese.release, ICONIP2018, 2018, 第 4 作者 (23) Generative Adversarial Training in Neural Machine Translation, NeuroComputing, 2018, 第 4 作者 (24) SPEECH-TRANSFORMER: A No-Recurrence Sequence-to-Sequence Model for Speech Recognition, International Conference on Acoustics, Speech and Signal Processing(ICASSP), 2018, 第 3 作者 (25) Exending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin, Interspeech2018, 2018, 第 4 作者 (26) Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets, North American Chapter of the Association for Computational Lingustics(NAACL), 2018, 第 1 作者 (27) Syllable-Based Acoustic Modeling with CTC for Multi-Scenarios Mandarin speech recognition, IJCNN, 2018, 第 1 作者 (28) Unsupervised Neural Machine Translation with Weight Sharing, The Association for Computational Linguistics(ACL), 2018, 第 1 作者 (29) Self-Attention Based Network for Punctuation Restoration, International Comference on Pattern Recognition(ICPR), 2018, 第 1 作者 (30) Unsupervised Domain Adaptation for Neural Machine Translation, International Comference on Pattern Recognition(ICPR), 2018, 第 1 作者 (31) A Cascaded Framework For Model-Based 3D Face Reconstruction, International Conference on Acoustics, Speech and Signal Processing(ICASSP 2018), 2018, 第 1 作者 (32) CBLDNN-BASED SPEAKER-INDEPENDENT SPEECH SEPARATION VIA GENERATIVE ADVERSARIAL TRAINING, International Conference on Acoustics, Speech and Signal Processing(ICASSP), 2018, 第 1 作者 (33) Recurrent Neural Network Based Small-footprint Wake-up-word Speech Recognition System with a Score Calibration Method, International Conference on Pattern Recognition(ICPR), 2018, 第 1 作者 (34) Compression of Acoustic Model via Knowledge Distillation and Pruning, International Conference on Pattern Recognition(ICPR), 2018, 第 1 作者 (35) Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation, IJCAI2018, 2018, 第 1 作者 (36) Paraphrase Recognition via Combination of Neural Classifier and Keywords, IJCNN 2018: International Joint Conference on Neural Networks, 2018, 第 1 作者 (37) Paraphrase Recognition via Combination of Neural Classifier and Keywords, IJCNN 2018: International Joint Conference on Neural Networks, 2018, 第 1 作者 (38) Paraphrase Recognition via Combination of Neural Classifier and Keywords, IJCNN 2018: International Joint Conference on Neural Networks, 2018, 第 1 作者 (39) Hierarchical Tree Long Short-Term Memory for Sentence Representations, IJCNN 2018: International Joint Conference on Neural Networks, 2018, 第 1 作者 (40) Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme, ACL 2017, 2017, 第 2 作者 (41) Multilingual Recurrent Neural Networks with Residual Learning for Low-Resource Speech Recognition, InterSpeech 2017, 2017, 第 2 作者 (42) Multilingual Recurrent Neural Networks with Residual Learning for Low-Resource Speech Recognition, InterSpeech 2017, 2017, 第 2 作者 (43) Self-Taught convolutional neural networks for short text clustering, Neural Networks, 2017, 第 2 作者 (44) Towards Compact and Fast Neural Machine Translation Using a Combined Method, EMNLP 2017, 2017, 第 2 作者 (45) End-to-End Chinese Image Text Recognition with Attention Model, ICONIP 2017, 2017, 第 2 作者 (46) "Hierarchical Hybrid Attention Networks for Chinese Conversation Topic Classification ", The International Conference On Neural Information Processing(ICONIP 2017), 2017, 第 2 作者 (47) Constructing a Chinese Conversation Corpus for Sentiment Analysis, The Natural Language Processing and Chinese Computing(NLPCC 2017), 2017, 第 2 作者 (48) Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation, The Eighth International Joint Conference on Natural Language Processing (IJCNLP 2017), 2017, 第 2 作者 (49) Named Entity Recognition with Gated Convolutional Neural Networks, Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data (CCL 2017, NLP-NABD 2017), 2017, 第 2 作者 (50) COMBINING UNIDIRECTIONAL LONG SHORT-TERM MEMORY WITH CONVOLUTIONAL OUTPUT LAYER FOR HIGH-PERFORMANCE SPEECH SYNTHESIS, International Conference on Acoustics, Speech and Signal Processing(ICASSP), 2017, 第 2 作者 (51) Hybrid Attention Networks for Chinese Short Text, International Conference on Computational Linguistics and Intelligent Text Processing(CICLing 2017), a special issue of the journal Computación y Sistemas, 2017, 第 4 作者 (52) Encoder-decoder recurrent network model for interactive character animation generation, COMPUTER GRAPHICS INTERNATIONAL 2017 (CGI’17), 2017, 第 3 作者 (53) A Class-specific Copy Network for Handling the Rare Word Problem in Neural Machine Translation, The International Joint Conference on Neural Networks (IJCNN ), 2017, 第 6 作者 (54) Multi-Sense Based Neural Machine Translation, The International Joint Conference on Neural Networks (IJCNN ), 2017, 第 4 作者 (55) Chinese Image Text Recognition with BLSTM-CTC: A Segmentation-free Method, Chinese Conference on Pattern Recognition, CCPR, 2016, 第 4 作者 (56) Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling, The 26th International Conference on Computational Linguistics (CoLing2016), 2016, 第 6 作者 (57) Hierarchical Memory Networks for Answer Selection on Unknown Words, The 26th International Conference on Computational Linguistics (CoLing2016), 2016, 第 5 作者 (58) Ensemble of Feature Sets and Classification Methods for Stance Detection, The 5th Conference on Natural Language Processing and Chinese Computing ( NLPCC2016), 2016, 第 5 作者 (59) A Neural Network Framework for Relation Extraction, Knowledge-Based Systems (KBS), 2016, 第 6 作者 (60) A Character-Aware Encoder for Neural Machine Translation, The 26th International Conference on Computational Linguistics (CoLing2016), 2016, 第 4 作者 (61) Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification, the 54th Annual Meeting of the Association for Computational Linguistics (ACL2016, short paper), 2016, 第 7 作者 (62) Stable-time Prediction during Incremental Speech Recognition, 2016 IEEE International Conference of Online Analysis and Computing Science(ICOACS 2016), 2016, 第 2 作者 (63) GATING RECURRENT MIXTURE DENSITY NETWORKS FOR ACOUSTIC MODELING IN, International Conference on Acoustics,Speech and Signal Processing(ICASSP), 2016, 第 3 作者 (64) Joint Learning of Entity Semantics and Relation Pattern for Relation Extraction, The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery(ECML/PKDD), 2016, 通讯作者 (65) Multidimensional Residual Learning Based on Recurrent Neural Networks for Acoustic Modeling, Interspeech2016, 2016, 第 3 作者 (66) End-to-end Language Identification using Attention-based Recurrent Neural Networks, Interspeech2016, 2016, 通讯作者 (67) Gating Recurrent Enhanced Memory Neural Networks on Language Identification, Interspeech2016, 2016, 通讯作者 (68) First Step Towards End-to-end Parametric TTS Synthesis:Generating Spectral Parameters with Neural Attention, Interspeech2016, 2016, 第 3 作者 (69) Automatic Variable-Timing Animation Transition Based on Hierarchical Interpolation Method, the 10th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (GRAPP2015), 2015, 第 4 作者 (70) Multilingual Tandem Bottleneck Feature For Language Identification, Interspeech 2015, 2015, 第 4 作者 (71) Towards End-to-End Speech Recognition for Chinese Mandarin Using Long Short-Term Memory Recurrent Neural Networks, Interspeech 2015, 2015, 第 4 作者 (72) Dialogue Management based on Sentence Clustering., ACL-2015, the 53rd Annual Meeting of the Association for Computational Linguistics, 2015, 第 2 作者 (73) Image Character Recognition Using Deep Convolutional Neural Network learned from different languages, International Conference on Image Processing(ICIP), 2014, 第 4 作者 (74) Chinese image text recognition on grayscale pixels, 2014 International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014, 第 4 作者 (75) Video to Article Hyperlinking by Multiple Tag Property Exploration, MMM2014, 2014, 通讯作者 (76) Spatial Similarity Measure of Visual Phrases for Image Retrieval, MMM2014, 2014, 第 3 作者 (77) Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machine Translation, The 52nd Annual Meeting of the Association for Computational Linguisics, 2014, 第 3 作者 (78) A Novel Noise-Robust ASR Method by Applying Partially Connected DNN Model and Mixed-Bandwidth Concept, The 2013 2nd International Symposium on Computer,Communication,Control and Automation(3CA 2013), 2014, 第 4 作者 (79) Parallel Recursive Deep Model for Sentiment Analysis, THE 19TH PACIFIC-ASIA CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 第 2 作者 (80) Structured Vectors for Chinese Word Representation, The 3RD INTERNATIONAL CONFERENCE ON INFORMATION and INTELLIGENT COMPUTING(ICCIC), 2014, 第 2 作者 (81) Labeling Sequential Data Based on Word Representations and Conditional Random Fields, The 3RD INTERNATIONAL CONFERENCE ON INFORMATION and INTELLIGENT COMPUTING(ICCIC), 2014, 第 2 作者 (82) Exploring One Pass Learning For Deep Neural Network Training With Averaged Stochastic Gradient Descent, 2014 International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014, 第 3 作者 (83) Variational Bayes Based I-vector for Speaker Diarization of Telephone Conversations, 2014 International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014, 第 4 作者 (84) An Investigation of summed-channel speaker recognition with multi-session enrollment, 2014 International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014, 第 4 作者 (85) Recursive Neural Network based Word Topology Model for Hierarchical Phrase-based Speech Translation, 2014 International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014, 第 4 作者 (86) Improving Wideband Acoustic Models, Interspeech, 2014, 第 2 作者 (87) Investigation of Cross-lingual Bottleneck, Interspeech, 2014, 第 3 作者 (88) Investigation of Stochastic Hessian-Free Optimization In Deep Neural Networks For Speech Recognition, International Symposium on Chinese Spoken Language Processing (ISCSLP), 2014, 第 2 作者 (89) Improving Training Time of Deep Neural Network With Asynchronous Averaged Stochastic Gradient Descent, International Symposium on Chinese Spoken Language Processing (ISCSLP), 2014, 第 2 作者 (90) An iVector Extractor Using Pre-trained Neural Networks for Speaker Verification, International Symposium on Chinese Spoken Language Processing (ISCSLP), 2014, 第 3 作者 (91) Data-driven Tree Structure Based UBM, International Symposium on Chinese Spoken Language Processing (ISCSLP), 2014, 第 2 作者 (92) Learning New Semi-Supervised Deep Auto-encoder Features for Statistical Machine Translation, The 52nd Annual Meeting of the Association for Computational Linguisics, 2014, 第 3 作者 (93) Optimization Control for Biped Motion Trajectory, ICALIP (International Conference on Audio, Language and Image Processing), 2014, 第 4 作者 (94) Chinese Image Character Recognition using DNN and Machine Simulated Training Samples, International Conference on Artificial Neural Networks(ICANN), 2014, 第 4 作者 (95) Improving word embeddings via combining with complementary languages, 27th Canadian Conference on Artificial Intelligence(CAI), 2014, 第 2 作者 (96) Experimental comparison of text information based punctuation recovery algorithms real data, The 3rd International Conference on Computer Science and Network Technology(ICCSNT2013), 2013, 第 3 作者 (97) UNDERSTANDING THE DROPOUT STRATEGY AND ANALYZING ITS EFFECTIVENESS ON LVCSR, International Conference on Acoustics,Speech,and Signal Processing(ICASSP), 2013, 第 3 作者 (98) INVESTIGATION of DEEP BOLTZMANN MACHINES FOR PHONE RECOGNITION, International Conference on Acoustics,Speech,and Signal Processing(ICASSP), 2013, 第 3 作者 (99) Integrating Multi-source Bilingual Information for Chinese Word Segmentation in Statistical Machine Translation, The Twelfth China National Conference on Computational Linguistics,CCL 2013, 2013, 第 4 作者 (100) ASYNCHRONOUS STOCHASTIC GRADIENT DESCENT FOR DNN TRAINING, International Conference on Acoustics,Speech,and Signal Processing(ICASSP), 2013, 通讯作者 (101) MULTI-MODAL TOPIC UNIT SEGMENTATION IN VIDEOS USINGCONDITIONAL RANDOM FIELDS, International Conference on Acoustics,Speech,and Signal Processing(ICASSP), 2013, 第 3 作者 (102) Punctuation prediction for Chinese spoken sentence based on model combination, The 8th International Conference on Intelligent Systems and Knowledge Engineering(ISKE 2013), 2013, 第 3 作者 (103) Phras-based Parallel Fragments Extraction from Comparable Corpore, The 6th International Joint Conference on Natural Language Processing(IJCNLP 2013), 2013, 通讯作者 (104) A General Framework of Video Segmentation to Logical Unit based on Conditional Random Fields, ACM International Conference on Multimedia Retrieval(ICMR), 2013, 第 4 作者 (105) Pseudo in-domain data selection from large-scale web corpus for spoken language translation, The 2nd Conference on Natural Language Processing and Chinese Computing,NLP&CC 2013, 2013, 第 3 作者 (106) BINARIZATION of NATURAL SCENE TEXT BASEDO NL1-NORM PCA, IEEE International Conference on Multimedia and Expo(ICME), 2013, 第 3 作者 (107) Joint and coupled bilingual topic model based sentence representations for language model adaptation, 23rd International Joint Conference on Artificial Intelligence(IJCAI 2013), 2013, 第 4 作者 (108) Mulitple Style Exploration for Story Unit Segmentation of Broadcast News Video, Mulitimedia Systems, 2013, 第 3 作者 (109) Data-driven Gaussian Component Selection for Fast GMM-Based Speaker Verification, Interspeech,2011 , 2011, 第 3 作者 (110) An Empirical Study of Multilingual Spoken Term Detection, Interspeech,2011 , 2011, 第 3 作者 (111) Fusing Multiple Confidence Measures for Chinese Spoken Term Detection, Interspeech,2011 , 2011, 第 3 作者 (112) A Robust Approach to Mining Repeated Sequence in Audio Stream, Interspeech,2011 , 2011, 通讯作者 (113) Context-dependent Duration Modeling with Backoff Strategy and Look-up Tables for Pronunciation Assessment and Mispronunciation Detection, Interspeech,2011 , 2011, 第 4 作者 (114) Restoring the Residual Speaker Information in Total Variability Modeling for Speaker Verification, Interspeech,2011, 2011, 第 3 作者 (115) TV Commercial Detection Using Audiovisual Features and Support Vector Machine, ICCDA2011, 2011, 第 2 作者 (116) Efficient Commercial Video Retrieval using Multi- Modality and Segment-based Search, ICCDA 2011, 2011, 第 4 作者 (117) Commercial Detection by Mining Maximal Repeated Sequence in Audio Stream, ICME 2011, 2011, 通讯作者 (118) Exploring nuisance attribute projection and score normalization for GLDS-SVM based automatic mispronunciation detection method, ICASSP2011, 2011, 第 4 作者 (119) An Exploration on Improving Statistical Machine Translation Performance by Using Post-editing Information, In Proceedings of the 2011 International Conference on Multimedia and Signal Processing, Guilin, 2011, 2011, 第 2 作者 (120) SUBSPACE CONSTRAINED LU DECOMPOSITION OF FMLLR FOR RAPID ADAPTATION, ICASSP 2011, 2011, 第 3 作者 (121) SUBSPACE CONSTRAINED LU DECOMPOSITION OF FMLLR FOR RAPID ADAPTATION, ICASSP 2011, 2011, 第 3 作者 (122) EXPLORING IMPLICIT SCORE NORMALIZATION TECHNIQUES IN SPEAKER VERIFICATION, ICASSP2011 , 2011, 第 3 作者 (123) Data-driven Gaussian Component Selection for Fast GMM-Based Speaker Verification, Interspeech,2011 , 2011, 第 3 作者 (124) Construct a naturalistic 3D avatar with live help interfaces based on multi-layered representation, CISP2010, 2010, 第 2 作者 (125) An Investigation into Direct Scoring Methods without SVM Training in Speaker Verification, interspeech2010, 2010, 第 3 作者 (126) 基于GMM-UBM和GLDS-SVM的英文发音错误检测方法, 自动化学报, 2010, 通讯作者 (127) Monaural Speech Separation Based on MAXVQ and CASA for Robust Speech Recognition, Computer Speech and Language, 2010, 通讯作者 (128) Automatic reference independent evaluation of prosody quality using multiple knowledge fusions, Interspeech2010, 2010, 第 3 作者 (129) Exploring goodness of prosody by diverse matching templates, Interspeech2010, 2010, 第 3 作者 (130) Automatic Pronunciation Error Detection Based on Linguistic knowledge and Pronunciation Space, ICASSP, 2009, 2009, 第 4 作者 (131) Chinese Intonation Assessment Using SEV Feasures, ICASSP, 2009, 2009, 第 2 作者 (132) Exploring the Automatic Mispronunciation Detection of Confusable Phones for Mandarin, ICASSP, 2009, 2009, 第 2 作者 (133) 基于计算听觉场景分析和语者模型信息的语音识别鲁棒前端研究, 自动化学报, 2009, 第 4 作者 (134) 基于能量损失率估计的麦克风阵列语音增强, 声学学报, 2009, 第 3 作者 (135) 一种基于互补声学模型的多系统融合语音关键词检测方法, 自动化学报, 2009, 第 4 作者 (136) Monaural Speech Separation Based on MAXVQ and CASA for Robust Speech Recognition, Computer Speech and Language, 2008, 通讯作者 (137) Improved Phonotactic LID using Random Forest Language Models, ICASSP, 2008, 2008, 第 4 作者 (138) Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Asse, IEEE Transactions on Audio, Speech, and Language Processing, 2006, 第 3 作者

学术兼职

目前任中科院自动化研究所所长,中国科学院人工智能创新研究院院长,中国科学院大学人工智能学院院长,兼任国家新一代人工智能战略咨询委员会委员,中科院脑科学与智能技术卓越创新中心副主任,北京市脑科学专项专家组成员,中国人工智能产业发展联盟副理事长等职。

推荐链接
down
wechat
bug