当前位置: X-MOL 学术IEEE Trans. Pattern Anal. Mach. Intell. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
HDF-Net: Capturing Homogeny Difference Features to Localize the Tampered Image
IEEE Transactions on Pattern Analysis and Machine Intelligence ( IF 20.8 ) Pub Date : 7-23-2024 , DOI: 10.1109/tpami.2024.3432551
Ruidong Han 1 , Xiaofeng Wang 2 , Ningning Bai 2 , Yihang Wang 2 , Jianpeng Hou 2 , Jianru Xue 3

Modern image editing software enables anyone to alter the content of an image to deceive the public, which can pose a security hazard to personal privacy and public safety. The detection and localization of image tampering is becoming an urgent issue to be addressed. We have revealed that the tampered region exhibits homogenous differences (the changes in metadata organization form and organization structure of the image) from the real region after manipulations such as splicing, copy-move, and removal. Therefore, we propose a novel end-to-end network named HDF-Net to extract these homogeny difference features for precise localization of tampering artifacts. The HDF-Net is composed of RGB and SRM dual-stream networks, including three complementary modules, namely the suspicious tampering-artifact prominent (STP) module, the fine tampering-artifact salient (FTS) module, and the tampering-artifact edge refined (TER) module. We utilize the fully attentional block (FLA) to enhance the characterization ability of homogeny difference features extracted by each module and preserve the specifics of tampering artifacts. These modules are gradually merged according to the strategy of “coarse-fine-finer”, which significantly improves the localization accuracy and edge refinement. Extensive experiments demonstrate that HDF-Net performs better than state-of-the-art tampering localization models on five benchmarks, achieving satisfactory generalization and robustness. Code can be found at https://github.com/ruidonghan/HDF-Net/.



现代图像编辑软件使任何人都可以更改图像内容来欺骗公众,这可能对个人隐私和公共安全构成安全隐患。图像篡改的检测和定位正在成为一个迫切需要解决的问题。我们发现,经过拼接、复制移动和删除等操作后,篡改区域与真实区域表现出同质差异(图像的元数据组织形式和组织结构的变化)。因此,我们提出了一种名为 HDF-Net 的新型端到端网络来提取这些同质差异特征,以精确定位篡改伪影。 HDF-Net由RGB和SRM双流网络组成,包括三个互补模块,即可疑篡改伪影突出(STP)模块、精细篡改伪影突出(FTS)模块和篡改伪影边缘细化(TER)模块。我们利用完全注意块(FLA)来增强每个模块提取的同质差异特征的表征能力,并保留篡改伪影的细节。这些模块按照“粗-细-细”的策略逐步融合,显着提高了定位精度和边缘细化。大量实验表明,HDF-Net 在五个基准测试中比最先进的篡改定位模型表现更好,实现了令人满意的泛化性和鲁棒性。代码可以在https://github.com/ruidonghan/HDF-Net/找到。