薛向阳 - 复旦大学 - 计算机科学技术学院

个人简介

主要从事计算机视觉、多媒体和机器学习等理论算法研究，在国际权威期刊（IEEE TPAMI、TIP等）和顶级会议（ICCV、CVPR、ICML、IJCAI、AAAI、ACM MM 等）上发表百余篇论文，引用 11000 余次，获 2016 年 IEEE TMM 论文奖（Prize Paper Award Honorable Mention）和 2017 年国际会议 ICME 论文奖（Platinum Best Paper）。曾获国家科技进步一等奖（第 10）和二等奖（第 7）、上海市科学技术一等奖（第 1、2、6、7）。获发明专利授权 30 多项。2012 年入选 “上海市学术带头人” 计划。2020 年入选由清华—中国工程院知识智能联合研究中心和清华大学人工智能研究院联合发布的 “2020 年度 AI 2000 人工智能全球最具影响力提名学者”。2018 年成为教育部人工智能科技创新专家组的工作组专家，2016 年被聘为科技部重点专项 “云计算与大数据” 总体专家组成员。2015 年开始任上海市图像图形学会副理事长，2016 年任中国图像图形学学会常务理事，2021年开始任中国图像图形学学会情感计算与理解专委会副主任， 2017 年开始任上海市人工智能学会副理事长。目前担任复旦大学大数据研究院和类脑智能科学与技术研究院副院长。

研究领域

学术研究方向：计算机视觉；多媒体；机器学习

应用研究领域：自动驾驶，服务机器人

近期论文

查看导师新发文章（温馨提示：请注意重名现象，建议点开原文通过作者单位确认）

Rethinking Local and Global Feature Representation for Dense Prediction.Pattern Recognition.2023,135 Exploring Efficient Few-shot Adaptation for Vision Transformers.arXiv.2023 Pixel2Mesh++: 3D Mesh Generation and Refinement From Multi-View Images.IEEE Transactions on Pattern Analysis and Machine Intelligence.2023,45 (2):2166-2180 AGO-Net: Association-Guided 3D Point Cloud Object Detection Network.IEEE Transactions on Pattern Analysis and Machine Intelligence.2022,44 (11):8097-8109 Vocabulary-Informed Zero-Shot and Open-Set Learning.IEEE Transactions on Pattern Analysis and Machine Intelligence.2023,42 (12):3136-3152 Pixel2Mesh++: 3D Mesh Generation and Refinement From Multi-View Images.IEEE Transactions on Pattern Analysis and Machine Intelligence.2023,45 (2):2166-2180 Transfer learning for collaborative filtering via a rating-matrix generative model.Proceedings of the 26Th International Conference on Machine Learning, Icml 2009.2009 :617-624 Temporal Context as Cortical Spatial Codes.IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6.2009 :1999 一种格矢量量化半易损水印算法.小型微型计算机系统.2009,30 (2):327-331 Bilingual query translation and expansion for supporting more effective cross-language image retri….Mm'10 - Proceedings of the Acm Multimedia 2010 International Conference.2010 :875-878 Robust hashing for music copyright protection by combining beat segmentation and chroma.Mm'10 - Proceedings of the Acm Multimedia 2010 International Conference.2010 :935-938 Semantic video indexing by fusing explicit and implicit context spaces.Mm'10 - Proceedings of the Acm Multimedia 2010 International Conference.2010 :967-970 A novel audio fingerprinting method robust to time scale modification and pitch shifting.Mm'10 - Proceedings of the Acm Multimedia 2010 International Conference.2010 :987-990 Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation.Coling 2010 - 23Rd International Conference on Computational Linguistics, Proceedings of the Conference.2010,2 :1435-1443 An effective method for video genre classification.Civr 2010 - 2010 Acm International Conference on Image and Video Retrieval.2010 :97-104 Transfer incremental learning for pattern classification.International Conference on Information and Knowledge Management, Proceedings.2010 :1709-1712 A Hybrid Probabilistic Model for Unified Collaborative and Content-Based Image Tagging.IEEE Transactions on Pattern Analysis and Machine Intelligence.2011,33 (7):1281-1294 Transfer active learning.International Conference on Information and Knowledge Management, Proceedings.2011 :2169-2172 Level influence of spatial pyramid matching in object classification.Mm'11 - Proceedings of the 2011 Acm Multimedia Conference and Co-Located Workshops.2011 :1373-1376 Ensemble approach based on conditional random field for multi-label image and video annotation.Mm'11 - Proceedings of the 2011 Acm Multimedia Conference and Co-Located Workshops.2011 :1377-1380 Refining local descriptors by embedding semantic information for visual categorization.Mm'11 - Proceedings of the 2011 Acm Multimedia Conference and Co-Located Workshops.2011 :1381-1384 Semi-supervised multi-instance multi-label learning for video annotation task.Mm 2012 - Proceedings of the 20Th Acm International Conference on Multimedia.2012 :737-740 A fast video event recognition system and its application to video search.Mm 2012 - Proceedings of the 20Th Acm International Conference on Multimedia.2012 :1347-1348 Semantic context learning with large-scale weakly-labeled image set.Acm International Conference Proceeding Series.2012 :1859-1863 Complex Text Processing by the Temporal Context Machines.2009 IEEE 8Th International Conference on Development and Learning.2009 :220 Fudan University at TRECVID 2010: Semantic indexing.2010 TREC VIDEO RETRIEVAL EVALUATION NOTEBOOK PAPERS.2010 Automatic image annotation with weakly labeled dataset.Mm'11 - Proceedings of the 2011 Acm Multimedia Conference and Co-Located Workshops.2011 :1185-1188 Parallel proximal support vector machine for high-dimensional pattern classification.Acm International Conference Proceeding Series.2012 :2351-2354 CDTD: A Large-Scale Cross-Domain Benchmark for Instance-Level Image-to-Image Translation and Domai….International Journal of Computer Vision.2021,129 (3):761-780 Can Movies and Books Collaborate? Cross-Domain Collaborative Filtering for Sparsity Reduction.21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS.2009 :2052-2057 基于核密度估计的图像自动标注方法.计算机工程.2010,36 (6):198-200 Constructions of Cryptographically Significant Boolean Functions Using Primitive Polynomials.IEEE TRANSACTIONS ON INFORMATION THEORY.2010,56 (6):3048-3053 面向三网融合的统一安全管控技术.中兴通讯技术.2011,17 (4):23-28 A Double-Ranking Strategy for Long-Tail Product Recommendation.2012 IEEE/Wic/Acm International Conference on Web Intelligence and Intelligent Agent Technology (Wi-Iat 2012), Vol 1.2012 :282-286 Gradient Ordinal Signature and Fixed-Point Embedding for Efficient Near-Duplicate Video Detection.IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY.2012,22 (4):555-566 A Segmentation and Graph-Based Video Sequence Matching Method for Video Copy Detection.IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING.2013,25 (8):1706-1718 Query-Adaptive Image Search With Hash Codes.IEEE TRANSACTIONS ON MULTIMEDIA.2013,15 (2):442-453 Leveraging color harmony and spatial context for aesthetic assessment of photographs.Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics).2014,8879 :323-332 多媒体技术研究:2013——面向智能视频监控的视觉感知与处理.中国图象图形学报.2014,19 (11):1539-1562 A Graph Minor Perspective to Multicast Network Coding.IEEE TRANSACTIONS ON INFORMATION THEORY.2014,60 (9):5375-5386 Evaluating Two-Stream CNN for Video Classification.ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL.2015 :435-442 Low-Rank and Sparse Decomposition Based Frame Difference Method for Small Infrared Target Detectio….IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS.2016,E99D (2):554-557 Face Recognition via Active Annotation and Learning.MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE.2016 :1058-1062 Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification.MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE.2016 :791-800 Frame-transformer emotion classification network.ICMR 2017 - PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL.2017 :78-83 Multi-task deep neural network for joint face recognition and facial attribute prediction.ICMR 2017 - PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL.2017 :365-374 Learning to generate and edit hairstyles.MM 2017 - PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE.2017 :1627-1635 Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification.IEEE TRANSACTIONS ON MULTIMEDIA.2018,20 (11):3137-3147 Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Ne….IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE.2018,40 (2):352-364 Dual Skipping Networks.PROCEEDINGS OF THE IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION.2018 :4071-4079 CODA: Counting objects via scale-aware adversarial density adaption.PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO.2019,2019-July :193-198 Low-Rank and Locality Constrained Self-Attention for Sequence Modeling.IEEE-ACM Transactions on Audio Speech and Language Processing.2019,27 (12):2213-2222 COMP-GAN: Compositional generative adversarial network in synthesizing and recognizing facial expr….MM 2019 - PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA.2019 :211-219 Embodied one-shot video recognition: Learning from actions of a virtual embodied agent.MM 2019 - PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA.2019 :411-419 MEAL: Multi-Model Ensemble via Adversarial Learning.THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE.2019 :4886-4893 SSF-DAN: Separated semantic feature based domain adaptation network for semantic segmentation.PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION.2019,2019-October :982-991 A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization.IEEE Transactions on Multimedia.2020,22 (1):148-159 Visual Evaluation for Autonomous Driving.IEEE Transactions on Visualization and Computer Graphics.2022,28 (1):1030-1039 Raven''s Progressive Matrices Completion with Latent Gaussian Process Priors.THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE.2021,35 :9612-9620 Structured max-margin learning for multi-label image annotation.Civr 2010 - 2010 Acm International Conference on Image and Video Retrieval.2010 :82-88 Correlative Multi-Label Multi-Instance Image Annotation.2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV).2011 :651-658 Towards content-based audio fragment authentication.Mm'11 - Proceedings of the 2011 Acm Multimedia Conference and Co-Located Workshops.2011 :1249-1252 数字媒体理解验证平台与应用示范研究2013年度报告.科技资讯.2016,14 (8):165-166 小视频内容分析技术发展探讨.中兴通讯技术.2021,27 (1):54-59

学术兼职

中国计算机学会杰出会员 2020.1-2023.12 中国图像图形学学会常务理事，情感计算专委会副主任 2019.1-2022.12 上海市图像图形学会副理事长 2018.1-2022.12 上海市人工智能学会副理事长