引用本文
  • 何沛,王萌,王卓,卢光云.基于特征增强的对抗哈希跨模态检索[J].广西科学,2022,29(4):691-699.    [点击复制]
  • HE Pei,WANG Meng,WANG Zhuo,LU Guangyun.Feature Boosting Adversarial Hashing for Cross-Modal Retrieval[J].Guangxi Sciences,2022,29(4):691-699.   [点击复制]
【打印本页】 【在线阅读全文】【下载PDF全文】 查看/发表评论下载PDF阅读器关闭

←前一篇|后一篇→

过刊浏览    高级检索

本文已被:浏览 215次   下载 196 本文二维码信息
码上扫一扫!
基于特征增强的对抗哈希跨模态检索
何沛1, 王萌2, 王卓1, 卢光云3
0
(1.广西科技大学理学院, 广西柳州 545000;2.广西科技大学数字启迪学院, 广西柳州 545000;3.柳州工学院, 广西柳州 545616)
摘要:
在跨模态检索任务中,哈希方法由于其检索效率高效、储存成本低廉而被广泛应用。但是,这些方法很少关注如何去弥补主体网络将高维特征转换为哈希码的过程中所丢失的特征信息。为解决这些问题,本文提出了一种特征增强对抗跨模态哈希(Feature Boosting Adversarial Hashing for Cross-Modal,FBAH)方法。FBAH方法将子空间学习与对抗学习相结合,来减少不同模态数据的差异性。另外,构造一种类残差模块,它可以将筛选出具有区别性的特征绕过主体网络直接输入到哈希空间进行特征增强。这样,生成的哈希码能够具有更多的原始特征信息。最后,通过带有分支网络的线性分类器在标签空间进行两种方式的预测,并最小化与真实标签的差距来保证语义的不变性。本文选择两个跨模态检索任务中常用的大型数据集进行大量实验,结果表明FBAH方法的性能优于目前7种较为先进的跨模态哈希方法。
关键词:  特征增强  跨模态检索  稀疏矩阵  哈希子空间学习  对抗学习
DOI:10.13656/j.cnki.gxkx.20220919.009
投稿时间:2022-04-04
基金项目:广西中青年教师基础能力提升项目“基于语义的跨媒体检索方法研究(2019KY1095)”资助。
Feature Boosting Adversarial Hashing for Cross-Modal Retrieval
HE Pei1, WANG Meng2, WANG Zhuo1, LU Guangyun3
(1.College of Science, Guangxi University of Science and Technology, Liuzhou, Guangxi, 545000, China;2.Tus College of Digit, Guangxi University of Science and Technology, Liuzhou, Guangxi, 545000, China;3.Liuzhou Institute of Technology, Liuzhou, Guangxi, 545616, China)
Abstract:
In cross-modal retrieval tasks, hashing method is widely used because of its high retrieval efficiency and low storage cost.However, these methods pay little attention on how to compensate for the loss of feature information in the process of transforming high-dimensional features into hash codes.To solve these problems, this article proposes a Feature Boosting Adversarial Hashing for Cross-Modal (FBAH).FBAH combines subspace learning with adversarial learning to reduce the difference of different modal data.In addition, a module similar to residual structure is constructed, which can bypass the main network and directly input the selected distinguishing features into the hash space for feature boosting.In this way, the generated hash code can have more original feature information.Finally, the linear classifier with branch network is used to make two kinds of prediction in label space, and the gap with the real label is minimized to ensure the invariance of semantics.In this article, two large datasets commonly used in cross-modal retrieval tasks are selected for a large number of experiments.The results show that the performance of FBAH is superior to seven existing advanced cross-modal hashing methods.
Key words:  feature boosting  cross-modal retrieval  sparse matrix  hashing subspace learning  adversarial learning

用微信扫一扫

用微信扫一扫