上海交通大学学报(自然版) ›› 2012, Vol. 46 ›› Issue (09): 1406-1410.
唐峰1,孙锬锋1,2,蒋兴浩1,2,陆欢1
收稿日期:2011-10-14
出版日期:2012-09-28
发布日期:2012-09-28
基金资助:国家自然科学基金(60802057, 61071153),教育部新世纪优秀人才支持计划(NCET100569)资助项目
TANG Feng-1, SUN Tan-Feng-1, 2 , JIANG Xing-Hao-1, 2 , LU Huan-1
Received:2011-10-14
Online:2012-09-28
Published:2012-09-28
摘要: 针对图像表示,提出了一种基于改进稀疏编码模型的图像分类算法.首先,提取表示图像视觉局部特征的SIFT (ScaleInvariant Feature Transform) 描述子;然后,利用稀疏编码方法生成基于SIFT描述子的视觉词汇库,将SIFT描述子编成稀疏向量;通过有效稀疏向量的区域融合和空间结合而获取整体的稀疏向量并用于图像表示;最后,采用随机森林多分类器对稀疏向量进行训练和测试.结果表明,与现有的算法相比,该算法的性能更佳,可以有效表示图像的特性并提高其分类的准确率.
中图分类号:
唐峰1, 孙锬锋1, 2, 蒋兴浩1, 2, 陆欢1. 基于改进稀疏编码模型的图像分类算法[J]. 上海交通大学学报(自然版), 2012, 46(09): 1406-1410.
TANG Feng-1, SUN Tan-Feng-1, 2 , JIANG Xing-Hao-1, 2 , LU Huan-1. Image Categorization Algorithm Based on Improved Sparse Coding Model[J]. Journal of Shanghai Jiaotong University, 2012, 46(09): 1406-1410.
| [1]Van Gemert J C, Veenman C J, Smeulders W M, et al. Visual word ambiguity[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32 (7): 12711283.[2]Bosch A, Zisserman A, Muoz X. Image classification using random forests and ferns [C]//2007 IEEE 11th International Conference on Computer Vision. Rio de Janeiro, Brazil: IEEE Press, 2007: 18.[3]Olshausen B A, Field D J. Sparse coding with an overcomplete basis set: A strategy employed by V1? [J]. Vision Research, 1997, 37(23): 33113325.[4]Yang J C, Yu K, Gong Y H, et al. Linear spatial pyramid matching using sparse coding for image classification[C]//IEEE Conference on Computer Vision and Pattern Recognition (2009). Miami, USA: IEEE Press, 2009: 17941801.[5]Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories [C]//2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Volume 2 CVPR06. New York, USA: IEEE Press, 2006: 21692178.[6]Breiman L. Random forests [J]. Machine Learning, 2001, 45(1): 532.[7]Mairal J, Bach F, Ponce J, et al. Online learning for matrix factorization and sparse coding [J]. Machine Learning Research, 2010, 11(1): 1960.[8]Efron B, Hastie T, Johnstone I, et al. Least angle regression[J]. Annals of Statistics, 2004, 32(2): 407499.[9]Caruana R, NiculescuMizil A. An empirical comparison of supervised learning algorithms [C]//Proc 23 rd International Conference on Machine Learning. New York, USA: ACM, 2006: 161168.[10]Li F F, Rob F, Pietro P. Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories[J]. Computer Vision and Image Understanding, 2007, 106(1): 5970. |
| [1] | . 基于多尺度融合和自适应标签相关性的多标签图像分类模型[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(5): 889-898. |
| [2] | 周 雷1, 苏 馨2, 张 崎2, 黄 一2. 基于LSTM重构环境特征的异常值检测[J]. 海洋工程装备与技术, 2024, 11(4): 1-7. |
| [3] | 李川, 聂熠文, 刘军伟, 孟凡钦, 沈晓静. 基于机器学习的多算法融合航迹稳健起始方法[J]. 空天防御, 2022, 5(1): 20-24. |
| [4] | 郭维诚,李蓓智,杨建国,周勤之. 磨削过程信号监测与砂轮磨损预测模型构建[J]. 上海交通大学学报, 2019, 53(12): 1475-1481. |
| [5] | 马潮, 陆志沣, 余海鸣, 洪泽华, 杨杰, 乔宇. 红外成像导引头抗干扰性能评估方法研究[J]. 空天防御, 2018, 1(4): 44-47. |
| [6] | 陈东岳,陈宗文. 基于特征显著性的均值漂移鲁棒目标跟踪[J]. 上海交通大学学报(自然版), 2013, 47(11): 1807-1812. |
| [7] | 程东阳, 蒋兴浩, 孙锬锋. 基于稀疏编码和多核学习的图像分类算法[J]. 上海交通大学学报(自然版), 2012, 46(11): 1789-1793. |
| [8] | 傅光磊,孙锬锋,蒋兴浩,. 基于语义绑定的分层视觉词汇库的图像检索 [J]. 上海交通大学学报(自然版), 2011, 45(02): 154-0158. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||