期刊文献+

交互式图像检索中的相关反馈技术研究进展 被引量:14

Relevance Feedback in Content-based Image Retrieval:the State of the Art
在线阅读 下载PDF
导出
摘要 相关反馈是近年来交互式图像检索领域研究的重要方向。首先提出了基于相关反馈的图像检索系统框架,并在此基础上从机器学习的角度分析了相关反馈学习的算法模型、样本获取、分布密度估计,及其在其特定应用背景下的困难和挑战。进而对图像相关反馈技术的研究现状进行调查总结,从聚类和分类两个方面对各种相关反馈算法在基于内容的图像检索中的应用进行了较为深入地研究和比较。最后对相关反馈技术发展趋势进行了展望,指出了该技术与图像语义抽取、用户模型建立以及软计算技术之间存在的密切关系。 Motivated by the fast growth of image databases, content-based image retrieval (CBIR) has received widespread research interest recently, where a typical user query is represented by a dynamic combination of visual and semantic descriptions of the desired image or class of images. Unfortunately, users often have difficulty specifying such descriptions. To alleviate those problems, users' queries are often refined interactively through mining the relevance of the feedback images, with the parameters combining the features automatically adjusted to adapt to the users' original needs. The aim of this paper is to clarify some of the issues raised by this new technology by reviewing the current capabilities and limitations of the relevance feedback (RF) techniques from the viewpoint of machine learning. A concept CBIR framework with RF techniques embed in is proposed first to facilitate the description, and the basic components of a RF-based CBIR system are illustrated as well. By using the proposed model, the paper proceeds with reviewing various RE techniques existing in the literatures from three aspects. The data which can be used as the source of RF-mining is investigated firstly. To analysis the obtained feedbacks, some assumptions about the distribution of the data must be made. We describe two kinds of probable distribution assumptions commonly seen in literatures, i. e., Gaussian distribution and mixture models, and the advantages and shortcomings of both assumptions are compared. Thirdly, we identify six challenges one may encounter in the practice of applying RE technique in CBIR context and point out that there is not such perfect algorithm that can deal with all the mentioned difficulties well at the same time. Next, we focus on the classical RE methods, which mainly originate from the pattern recognition or machine learning area. Roughly saying, we survey the typical RF algorithms from two branches, i.e. the clustering-based and classifying-based algorithms. The main difference between the two kinds of algorithms lies in that the clustering-based algorithms care more about the understanding of the 'query point', which is usually considered as the semantic representation of user's query in his or her mind, while the classifying-based ones try to find the class boundary directly from the image database using some prior knowledge about the statistical structures of the feedback data. However, it is worth mentioning that the boundary between the two branches of algorithms is soft in nature. The merits and shortcomings of the classical RF methods are also discussed. Finally, several typical CBIR systems, either commercial or academic, with RF-techniques embed in are surveyed The surveyed systems include such famous systems as QBIC, FourEyes, PicHunter, etc. The relevance feedback techniques adopted in those systems are identified and emphasized. In summary, we think that human is a key factor in the running of the whole CBIR system and it would play a more important role in the operation of the system than now. RF techniques, as a way to incorporate man in loop, will continue to receive wide interests from researchers. The paper then suggests several future promising research directions through analyzing the close relationship between relevance feedback technique and the abstracting of image semantic, user modeling and soft computing.
出处 《南京大学学报(自然科学版)》 CAS CSCD 北大核心 2004年第5期639-648,共10页 Journal of Nanjing University(Natural Science)
基金 国家自然科学基金(69903006) 教育部高等学校骨干教师资助项目[教技司(2000)65号] 中国博士后科学基金(中博基金[1997]11号)
关键词 相关反馈 基于内容的图像检索 信息检索 relevance feedback content based image retrieval information retrieval
  • 相关文献

参考文献2

二级参考文献12

  • 1Wu M R,Proc 2000 Int Conference on Pattern Recognition,2000年,561页
  • 2Rui Y,Proc 7th ACM International Conference on Multimedia,1999年,67页
  • 3Rui Y,IEEE Trans Circuits and Systems for Video Technology,1998年,8卷,5期,644页
  • 4Huang J,IEEE Conference on Computer Visionand Pattern Recognition,1997年,762页
  • 5Vapnik V. The Nature of statistical learning theory. New York: Springer-Verlag,1995. 5-13.
  • 6Christopher J C B. A tutorial on support vector machines for pattern recognition. Knowledge Discovery Data Mining, 1998, 2(2): 235-244.
  • 7Vapnik V, Levin E, Cun Y Le. Measuring the VC-dimension of a learning machine. Neural Computation, 1994, 6(5): 851-876.
  • 8Vapnik V. Statistical learning theory. New York:Wiley, 1998. 21-22.
  • 9Vapnik V, Golowich S, Smola A. Support vector method for function approximation, regression estimation, and signal processing. Mozer M, Jordan M, Petsche T. Advances in Neural Information Processing Systems 9. Cambridge: MIT Press, 1997: 281-287.
  • 10Osuma E, Freund R, Girosi F.An improved training algorithm for support vector machine. Principe J,Gile L, Morgan N, et al. Proc IEEE .NNSP'97. Amelia Island FL, 1997: 276-285.

共引文献34

同被引文献92

引证文献14

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部