سامانه پژوهشی دانشگاه ملایر | Semantic image representation for image recognition and retrieval using multilayer variational auto-encoder, InceptionNet and low-level image features

عنوان	Semantic image representation for image recognition and retrieval using multilayer variational auto-encoder, InceptionNet and low-level image features
نوع پژوهش	مقاله چاپ شده
کلیدواژه‌ها	Image representation · Image recognition · Content-based image retrieval · Deep learning
چکیده	This paper presents a novel image descriptor that enhances performance in image recognition and retrieval by combining deep learning and handcrafted features. Our method integrates high-level semantic features extracted via InceptionResNet-V2 with color and texture features to create a comprehensive representation of image content. The descriptor’s effectiveness is demonstrated through extensive experiments across a range of image recognition and retrieval tasks. Our approach is tested on six benchmark datasets, including Corel-1 K, VS, OT, QT, SUN-397, and ILSVRC-2012 for single-label classification, and COCO and NUS-WIDE for multilabel classification, achieving high performances. The results establish that the proposed method is versatile and robust, excelling in single-label and multi-label recognition as well as image retrieval tasks, and outperforms several state-of-theart methods. This work provides a significant advancement in image representation, with broad applicability in various computer vision domains.
پژوهشگران	داور گیوکی (نفر اول)، سجاد اسفندیاری (نفر دوم)