样式: 排序: IF: - GO 导出 标记为已读
-
EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-19 Qiang Qu, Xiaoming Chen, Yuk Ying Chung, Yiran Shen
-
Dynamic Semantic-based Spatial-Temporal Graph Convolution Network for Skeleton-based Human Action Recognition IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-19 Jianyang Xie, Yanda Meng, Yitian Zhao, Nguyen Anh, Xiaoyun Yang, Yalin Zheng
-
DeepDuoHDR: A Low Complexity Two Exposure Algorithm for HDR Deghosting on Mobile Devices IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-19 Kadir Cenk Alpay, Ahmet Oğuz Akyüz, Nicola Brandonisio, Joseph Meehan, Alan Chalmers
-
AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-19 Wengyi Zhan, Mingbao Lin, Chia-Wen Lin, Rongrong Ji
-
Enhanced Multispectral Band-to-Band Registration using Co-occurrence Scale Space and Spatial Confined RANSAC Guided Segmented Affine Transformation IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-14 Indranil Misra, Mukesh Kumar Rohil, S. Manthira Moorthi, Debajyoti Dhar
-
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-14 Hongmin Liu, Canbin Zhang, Bin Fan, Jinglin Xu
-
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-12 Xu Zhang, Kailun Yang, Jiacheng Lin, Jin Yuan, Zhiyong Li, Shutao Li
-
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-12 Huan Liu, Wei Li, Xiang-Gen Xia, Mengmeng Zhang, Zhengqi Guo, Lujie Song
-
Constructing Diverse Inlier Consistency for Partial Point Cloud Registration IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-12 Yu-Xin Zhang, Jie Gui, James Tin-Yau Kwok
-
Noisy-Aware Unsupervised Domain Adaptation for Scene Text Recognition IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-12 Xiao-Qian Liu, Fei-Peng Zhang, Xin Luo, Zi Huang, Xin-Shun Xu
-
Explainability Enhanced Object Detection Transformer With Feature Disentanglement IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-12 Wenlong Yu, Ruonan Liu, Dongyue Chen, Qinghua Hu
-
CFOR: Character-First Open-Set Text Recognition via Context-Free Learning IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-11 Chang Liu, Chun Yang, Zhiyu Fang, Hai-Bo Qin, Xu-Cheng Yin
-
Smooth Tensor Product for Tensor Completion IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-11 Tongle Wu, Jicong Fan
-
Cost Volume Aggregation in Stereo Matching Revisited: A Disparity Classification Perspective IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-11 Yun Wang, Longguang Wang, Kunhong Li, Yongjian Zhang, Dapeng Oliver Wu, Yulan Guo
-
Learning a Cross-modality Anomaly Detector for Remote Sensing Imagery IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-08 Jingtao Li, Xinyu Wang, Hengwei Zhao, Yanfei Zhong
-
Laplacian Gradient Consistency Prior for Flash Guided Non-Flash Image Denoising IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-07 Jingyi Xu, Xin Deng, Chenxiao Zhang, Shengxi Li, Mai Xu
-
Efficient Swin Transformer for Remote Sensing Image Super-Resolution IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-06 Xudong Kang, Puhong Duan, Jier Li, Shutao Li
-
Multi-Dimensional Visual Data Restoration: Uncovering the Global Discrepancy in Transformed High-Order Tensor Singular Values IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-04 Chengxun He, Yang Xu, Zebin Wu, Shangdong Zheng, Zhihui Wei
-
Learning Cross-Attention Point Transformer with Global Porous Sampling IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-01 Yueqi Duan, Haowen Sun, Juncheng Yan, Jiwen Lu, Jie Zhou
-
Salient Object Detection From Arbitrary Modalities IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-11-01 Nianchang Huang, Yang Yang, Ruida Xi, Qiang Zhang, Jungong Han, Jin Huang
-
AnlightenDiff: Anchoring Diffusion Probabilistic Model on Low Light Image Enhancement IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-31 Cheuk-Yiu Chan, Wan-Chi Siu, Yuk-Hee Chan, H. Anthony Chan
-
GSSF: Generalized Structural Sparse Function for Deep Cross-Modal Metric Learning IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-31 Haiwen Diao, Ying Zhang, Shang Gao, Jiawen Zhu, Long Chen, Huchuan Lu
Cross-modal metric learning is a prominent research topic that bridges the semantic heterogeneity between vision and language. Existing methods frequently utilize simple cosine or complex distance metrics to transform the pairwise features into a similarity score, which suffers from an inadequate or inefficient capability for distance measurements. Consequently, we propose a Generalized Structural
-
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-29 Yifan Xu, Mengdan Zhang, Xiaoshan Yang, Changsheng Xu
We explore multi-modal contextual knowledge learned through multi-modal masked language modeling to provide explicit localization guidance for novel classes in open-vocabulary object detection (OVD). Intuitively, a well-modeled and correctly predicted masked concept word should effectively capture the textual contexts, visual contexts, and the cross-modal correspondence between texts and regions, thereby
-
Rethinking Noise Sampling in Class-Imbalanced Diffusion Models IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-29 Chenghao Xu, Jiexi Yan, Muli Yang, Cheng Deng
-
Image Copy-Move Forgery Detection via Deep PatchMatch and Pairwise Ranking Learning IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-25 Yuanman Li, Yingjie He, Changsheng Chen, Li Dong, Bin Li, Jiantao Zhou, Xia Li
-
λ-Domain Rate Control via Wavelet-Based Residual Neural Network for VVC HDR Intra Coding IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-25 Feng Yuan, Jianjun Lei, Zhaoqing Pan, Bo Peng, Haoran Xie
High dynamic range (HDR) video offers a more realistic visual experience than standard dynamic range (SDR) video, while introducing new challenges to both compression and transmission. Rate control is an effective technology to overcome these challenges, and ensure optimal HDR video delivery. However, the rate control algorithm in the latest video coding standard, versatile video coding (VVC), is tailored
-
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-24 Chi Zhang, Wenbo Chen, Wei Wang, Zhaoxiang Zhang
Recently, unsupervised domain adaptation (UDA) for 3D object detectors has increasingly garnered attention as a method to eliminate the prohibitive costs associated with generating extensive 3D annotations, which are crucial for effective model training. Self-training (ST) has emerged as a simple and effective technique for UDA. The major issue involved in ST-UDA for 3D object detection is refining
-
Enhancing Few-Shot Out-of-Distribution Detection with Pre-Trained Model Features IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-24 Jiuqing Dong, Yifan Yao, Wei Jin, Heng Zhou, Yongbin Gao, Zhijun Fang
-
AS2LS: Adaptive Anatomical Structure-based Two-layer Level Set Framework for Medical Image Segmentation IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-24 Tianyi Han, Haoyu Cao, Yunyun Yang
-
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-24 Donggon Jang, Sunhyeok Lee, Gyuwon Choi, Yejin Lee, Sanghyeok Son, Dae-Shik Kim
Robust segmentation performance under dense fog is crucial for autonomous driving, but collecting labeled real foggy scene datasets is burdensome in the real world. To this end, existing methods have adapted models trained on labeled clear weather images to the unlabeled real foggy domain. However, these approaches require intermediate domain datasets (e.g. synthetic fog) and involve multi-stage training
-
Deblurring Videos Using Spatial-Temporal Contextual Transformer with Feature Propagation IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-24 Liyan Zhang, Boming Xu, Zhongbao Yang, Jinshan Pan
-
Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-23 Xinyue Li, Aous Naman, David Taubman
This paper provides a comprehensive study on features and performance of different ways to incorporate neural networks into lifting-based wavelet-like transforms, within the context of fully scalable and accessible image compression. Specifically, we explore different arrangements of lifting steps, as well as various network architectures for learned lifting operators. Moreover, we examine the impact
-
A Bi-directionally Fused Boundary Aware Network for Skin Lesion Segmentation IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-23 Feiniu Yuan, Yuhuan Peng, Qinghua Huang, Xuelong Li
-
Latitude-Redundancy-Aware All-Zero Block Detection for Fast 360-Degree Video Coding IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-22 Chang Yu, Xiaopeng Fan, Pengjin Chen, Yuxin Ni, Hengyu Man, Debin Zhao
The sphere-to-plane projection of 360-degree video introduces substantial stretched redundant data, which is discarded when reprojected to the 3D sphere for display. Consequently, encoding and transmitting such redundant data is unnecessary. Highly redundant blocks can be referred to as all-zero blocks (AZBs). Detecting these AZBs in advance can reduce computational and transmission resource consumption
-
Enhancing Sample Utilization in Noise-Robust Deep Metric Learning With Subgroup-Based Positive-Pair Selection IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-22 Zhipeng Yu, Qianqian Xu, Yangbangyan Jiang, Yingfei Sun, Qingming Huang
The existence of noisy labels in real-world data negatively impacts the performance of deep learning models. Although much research effort has been devoted to improving the robustness towards noisy labels in classification tasks, the problem of noisy labels in deep metric learning (DML) remains under-explored. Existing noisy label learning methods designed for DML mainly discard suspicious noisy samples
-
A Scalable Training Strategy for Blind Multi-Distribution Noise Removal IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-22 Kevin Zhang, Sakshum Kulshrestha, Christopher A. Metzler
Despite recent advances, developing general-purpose universal denoising and artifact-removal networks remains largely an open problem: Given fixed network weights, one inherently trades-off specialization at one task (e.g., removing Poisson noise) for performance at another (e.g., removing speckle noise). In addition, training such a network is challenging due to the curse of dimensionality: As one
-
EviPrompt: A Training-Free Evidential Prompt Generation Method for Adapting Segment Anything Model in Medical Images IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-22 Yinsong Xu, Jiaqi Tang, Aidong Men, Qingchao Chen
Medical image segmentation is a critical task in clinical applications. Recently, the Segment Anything Model (SAM) has demonstrated potential for natural image segmentation. However, the requirement for expert labour to provide prompts, and the domain gap between natural and medical images pose significant obstacles in adapting SAM to medical images. To overcome these challenges, this paper introduces
-
Sparse Coding Inspired LSTM and Self-Attention Integration for Medical Image Segmentation IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-22 Zexuan Ji, Shunlong Ye, Xiao Ma
Accurate and automatic segmentation of medical images plays an essential role in clinical diagnosis and analysis. It has been established that integrating contextual relationships substantially enhances the representational ability of neural networks. Conventionally, Long Short-Term Memory (LSTM) and Self-Attention (SA) mechanisms have been recognized for their proficiency in capturing global dependencies
-
Nonparametric Clustering-Guided Cross-View Contrastive Learning for Partially View-Aligned Representation Learning IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-21 Shengsheng Qian, Dizhan Xue, Jun Hu, Huaiwen Zhang, Changsheng Xu
With the increasing availability of multi-view data, multi-view representation learning has emerged as a prominent research area. However, collecting strictly view-aligned data is usually expensive, and learning from both aligned and unaligned data can be more practicable. Therefore, Partially View-aligned Representation Learning (PVRL) has recently attracted increasing attention. After aligning multi-view
-
Toward Blind Flare Removal Using Knowledge-Driven Flare-Level Estimator IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-21 Haoyou Deng, Lida Li, Feng Zhang, Zhiqiang Li, Bin Xu, Qingbo Lu, Changxin Gao, Nong Sang
Lens flare is a common phenomenon when strong light rays arrive at the camera sensor and a clean scene is consequently mixed up with various opaque and semi-transparent artifacts. Existing deep learning methods are always constrained with limited real image pairs for training. Though recent synthesis-based approaches are found effective, synthesized pairs still deviate from the real ones as the mixing
-
Transforming Image Super-Resolution: A ConvFormer-Based Efficient Approach IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-18 Gang Wu, Junjun Jiang, Junpeng Jiang, Xianming Liu
Recent progress in single-image super-resolution (SISR) has achieved remarkable performance, yet the computational costs of these methods remain a challenge for deployment on resource-constrained devices. In particular, transformer-based methods, which leverage self-attention mechanisms, have led to significant breakthroughs but also introduce substantial computational costs. To tackle this issue,
-
TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-17 Khang Truong Giang, Soohwan Song, Sungho Jo
This study tackles image matching in difficult scenarios, such as scenes with significant variations or limited texture, with a strong emphasis on computational efficiency. Previous studies have attempted to address this challenge by encoding global scene contexts using Transformers. However, these approaches have high computational costs and may not capture sufficient high-level contextual information
-
NTK-Guided Few-Shot Class Incremental Learning IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-17 Jingren Liu, Zhong Ji, Yanwei Pang, Yunlong Yu
The proliferation of Few-Shot Class Incremental Learning (FSCIL) methodologies has highlighted the critical challenge of maintaining robust anti-amnesia capabilities in FSCIL learners. In this paper, we present a novel conceptualization of anti-amnesia in terms of mathematical generalization, leveraging the Neural Tangent Kernel (NTK) perspective. Our method focuses on two key aspects: ensuring optimal
-
Learning Content-Weighted Pseudocylindrical Representation for 360° Image Compression IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-17 Mu Li, Youneng Bao, Xiaohang Sui, Jinxing Li, Guangming Lu, Yong Xu
Learned 360° image compression methods using equirectangular projection (ERP) often confront a non-uniform sampling issue, inherent to sphere-to-rectangle projection. While uniformly or nearly uniformly sampling representations, along with their corresponding convolution operations, have been proposed to mitigate this issue, these methods often concentrate solely on uniform sampling rates, thus neglecting
-
Improved MRF Reconstruction via Structure-Preserved Graph Embedding Framework IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-16 Peng Li, Yuping Ji, Yue Hu
Highly undersampled schemes in magnetic resonance fingerprinting (MRF) typically lead to aliasing artifacts in reconstructed images, thereby reducing quantitative imaging accuracy. Existing studies mainly focus on improving the reconstruction quality by incorporating temporal or spatial data priors. However, these methods seldom exploit the underlying MRF data structure driven by imaging physics and
-
DHM-Net: Deep Hypergraph Modeling for Robust Feature Matching IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-16 Shunxing Chen, Guobao Xiao, Junwen Guo, Qiangqiang Wu, Jiayi Ma
We present a novel deep hypergraph modeling architecture (called DHM-Net) for feature matching in this paper. Our network focuses on learning reliable correspondences between two sets of initial feature points by establishing a dynamic hypergraph structure that models group-wise relationships and assigns weights to each node. Compared to existing feature matching methods that only consider pair-wise
-
Searching Discriminative Regions for Convolutional Neural Networks in Fundus Image Classification With Genetic Algorithms IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-16 Yibiao Rong, Tian Lin, Haoyu Chen, Zhun Fan, Xinjian Chen
Deep convolutional neural networks (CNNs) have been widely used for fundus image classification and have achieved very impressive performance. However, the explainability of CNNs is poor because of their black-box nature, which limits their application in clinical practice. In this paper, we propose a novel method to search for discriminative regions to increase the confidence of CNNs in the classification
-
Hierarchical Graph Interaction Transformer With Dynamic Token Clustering for Camouflaged Object Detection IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-15 Siyuan Yao, Hao Sun, Tian-Zhu Xiang, Xiao Wang, Xiaochun Cao
Camouflaged object detection (COD) aims to identify the objects that seamlessly blend into the surrounding backgrounds. Due to the intrinsic similarity between the camouflaged objects and the background region, it is extremely challenging to precisely distinguish the camouflaged objects by existing approaches. In this paper, we propose a hierarchical graph interaction network termed HGINet for camouflaged
-
Cross-Scope Spatial-Spectral Information Aggregation for Hyperspectral Image Super-Resolution IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-15 Shi Chen, Lefei Zhang, Liangpei Zhang
Hyperspectral image super-resolution has attained widespread prominence to enhance the spatial resolution of hyperspectral images. However, convolution-based methods have encountered challenges in harnessing the global spatial-spectral information. The prevailing transformer-based methods have not adequately captured the long-range dependencies in both spectral and spatial dimensions. To alleviate
-
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-15 Jianqun Zhou, Pengwen Dai, Yang Li, Manjiang Hu, Xiaochun Cao
Scene text editing aims to replace the source text with the target text while preserving the original background. Its practical applications span various domains, such as data generation and privacy protection, highlighting its increasing importance in recent years. In this study, we propose a novel Scene Text Editing network with Explicitly-decoupled text transfer and Minimized background reconstruction
-
Injecting Text Clues for Improving Anomalous Event Detection From Weakly Labeled Videos IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-15 Tianshan Liu, Kin-Man Lam, Bing-Kun Bao
Video anomaly detection (VAD) aims at localizing the snippets containing anomalous events in long unconstrained videos. The weakly supervised (WS) setting, where solely video-level labels are available during training, has attracted considerable attention, owing to its satisfactory trade-off between the detection performance and annotation cost. However, due to lack of snippet-level dense labels, the
-
GRiD: Guided Refinement for Detector-Free Multimodal Image Matching IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-11 Yuyan Liu, Wei He, Hongyan Zhang
Multimodal image matching is essential in image stitching, image fusion, change detection, and land cover mapping. However, the severe nonlinear radiometric distortion (NRD) and geometric distortions in multimodal images severely limit the accuracy of multimodal image matching, posing significant challenges to existing methods. Additionally, detector-based methods are prone to feature point offset
-
Toward Real-World Super Resolution With Adaptive Self-Similarity Mining IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-09 Zejia Fan, Wenhan Yang, Zongming Guo, Jiaying Liu
Despite efforts to construct super-resolution (SR) training datasets with a wide range of degradation scenarios, existing supervised methods based on these datasets still struggle to consistently offer promising results due to the diversity of real-world degradation scenarios and the inherent complexity of model learning. Our work explores a new route: integrating the sample-adaptive property learned
-
MLFA: Toward Realistic Test Time Adaptive Object Detection by Multi-Level Feature Alignment IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-09 Yabo Liu, Jinghua Wang, Chao Huang, Yiling Wu, Yong Xu, Xiaochun Cao
Object detection methods have achieved remarkable performances when the training and testing data satisfy the assumption of i.i.d. However, the training and testing data may be collected from different domains, and the gap between the domains can significantly degrade the detectors. Test Time Adaptive Object Detection (TTA-OD) is a novel online approach that aims to adapt detectors quickly and make
-
Error Model and Concise Temporal Network for Indirect Illumination in 3D Reconstruction IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-08 Yuchong Chen, Pengcheng Yao, Rui Gao, Wei Zhang, Shaoyan Gai, Jian Yu, Feipeng Da
3D reconstruction is a fundamental task in robotics and AI, providing a prerequisite for many related applications. Fringe projection profilometry is an efficient and non-contact method for generating 3D point clouds out of 2D images. However, during the actual measurement, it is inevitable to experiment with translucent objects, such as skin, marble, and fruit. Indirect illumination from these objects
-
Multi-Scale Spatio-Temporal Memory Network for Lightweight Video Denoising IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-08 Lu Sun, Fangfang Wu, Wei Ding, Xin Li, Jie Lin, Weisheng Dong, Guangming Shi
Deep learning-based video denoising methods have achieved great performance improvements in recent years. However, the expensive computational cost arising from sophisticated network design has severely limited their applications in real-world scenarios. To address this practical weakness, we propose a multiscale spatio-temporal memory network for fast video denoising, named MSTMN, aiming at striking
-
A Virtual-Sensor Construction Network Based on Physical Imaging for Image Super-Resolution IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-08 Guozhi Tang, Hongwei Ge, Liang Sun, Yaqing Hou, Mingde Zhao
Image imaging in the real world is based on physical imaging mechanisms. Existing super-resolution methods mainly focus on designing complex network structures to extract and fuse image features more effectively, but ignore the guiding role of physical imaging mechanisms for model design, and cannot mine features from a physical perspective. Inspired by the mechanism of physical imaging, we propose
-
Self-Supervised Sub-Action Parsing Network for Semi-Supervised Action Quality Assessment IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-07 Kumie Gedamu, Yanli Ji, Yang Yang, Jie Shao, Heng Tao Shen
Semi-supervised Action Quality Assessment (AQA) using limited labeled and massive unlabeled samples to achieve high-quality assessment is an attractive but challenging task. The main challenge relies on how to exploit solid and consistent representations of action sequences for building a bridge between labeled and unlabeled samples in the semi-supervised AQA. To address the issue, we propose a Self-supervised
-
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-07 Yujie Zhang, Qi Yang, Yiling Xu, Shan Liu
Full-reference point cloud quality assessment (FR-PCQA) aims to infer the quality of distorted point clouds with available references. Most of the existing FR-PCQA metrics ignore the fact that the human visual system (HVS) dynamically tackles visual information according to different distortion levels (i.e., distortion detection for high-quality samples and appearance perception for low-quality samples)
-
Accurate and Robust Object Detection via Selective Adversarial Learning With Constraints IEEE Trans. Image Process. (IF 10.8) Pub Date : 2024-10-04 Jianpin Chen, Heng Li, Qi Gao, Junling Liang, Ruipeng Zhang, Liping Yin, Xinyu Chai
ConvNet-based object detection networks have achieved outstanding performance on clean images. However, many works have shown that these detectors perform poorly on corrupted images caused by noises, blurs, poor weather conditions and so on. With the development of security-sensitive applications, the detector’s practicability has raised increasing concerns. Existing approaches improve detector robustness