Completion of Missing Labels for Multi-Label Annotation by a Unified Graph Laplacian Regularization

IEICE Transactions on Information and Systems E103.D 巻 10 号 2154-2161 頁 2020-10-01 発行
アクセス数 : 190
ダウンロード数 : 59

今月のアクセス数 : 7
今月のダウンロード数 : 3
ファイル情報(添付)
IEICETIS_E103.D_2154.pdf 1.47 MB 種類 : 全文
タイトル ( eng )
Completion of Missing Labels for Multi-Label Annotation by a Unified Graph Laplacian Regularization
作成者
MOJOO Jonathan
ZHAO Yu
KAVITHA Muthu Subash
収録物名
IEICE Transactions on Information and Systems
E103.D
10
開始ページ 2154
終了ページ 2161
抄録
The task of image annotation is becoming enormously important for efficient image retrieval from the web and other large databases. However, huge semantic information and complex dependency of labels on an image make the task challenging. Hence determining the semantic similarity between multiple labels on an image is useful to understand any incomplete label assignment for image retrieval. This work proposes a novel method to solve the problem of multi-label image annotation by unifying two different types of Laplacian regularization terms in deep convolutional neural network (CNN) for robust annotation performance. The unified Laplacian regularization model is implemented to address the missing labels efficiently by generating the contextual similarity between labels both internally and externally through their semantic similarities, which is the main contribution of this study. Specifically, we generate similarity matrices between labels internally by using Hayashi's quantification method-type III and externally by using the word2vec method. The generated similarity matrices from the two different methods are then combined as a Laplacian regularization term, which is used as the new objective function of the deep CNN. The Regularization term implemented in this study is able to address the multi-label annotation problem, enabling a more effectively trained neural network. Experimental results on public benchmark datasets reveal that the proposed unified regularization model with deep CNN produces significantly better results than the baseline CNN without regularization and other state-of-the-art methods for predicting missing labels.
著者キーワード
multi-label image annotation
regularization
missing labels
内容記述
This work was partly supported by JSPS KAKENHI Grant Number 16K00239.
言語
英語
資源タイプ 学術雑誌論文
出版者
The Institute of Electronics, Information and Communication Engineers
電子情報通信学会
発行日 2020-10-01
権利情報
Copyright © 2020 The Institute of Electronics, Information and Communication Engineers
出版タイプ Version of Record(出版社版。早期公開を含む)
アクセス権 オープンアクセス
収録物識別子
[ISSN] 0916-8532
[ISSN] 1745-1361
[DOI] 10.1587/transinf.2019EDP7318
[DOI] https://doi.org/10.1587/transinf.2019EDP7318