• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Chen, Mingqin (Chen, Mingqin.) [1] | Wang, Yilei (Wang, Yilei.) [2] (Scholars:王一蕾) | Chen, Shan (Chen, Shan.) [3] | Wu, Yingjie (Wu, Yingjie.) [4]

Indexed by:

EI Scopus

Abstract:

The multi-object counting in visual question answering (VQA) is still a challenging problem. Existing VQA models mainly adopt object detection network to extract image features and combine soft attention mechanism to further increase the model accuracy. However, repeated counting of the same object may occur when the object detection network extracts image features. In addition, the sum of attention weights of all objects calculated by soft attention mechanism is 1, which leads to the constant quantity information of objects being 1. We propose a new counting attention mechanism based on classification confidence. The main idea is to calculate the initial attention with sigmoid function and similarity with the object location generated by object detection network; we introduce classification confidence to calculate a more accurate similarity and solve the problem that the quantity information under existing soft attention mechanism is always 1. The experiment compares the proposed counting attention mechanism with the baseline model and the related work under the VQA v2 dataset. The results show that the counting attention mechanism improves the counting accuracy by 6.4% compared with the baseline model and surpasses most VQA models. © 2019 IEEE.

Keyword:

Big data Classification (of information) Cloud computing Feature extraction Object detection Object recognition Social networking (online)

Community:

  • [ 1 ] [Chen, Mingqin]College of Mathematcis and Computer Science, Fuzhou University, Fuzhou, China
  • [ 2 ] [Wang, Yilei]College of Mathematcis and Computer Science, Fuzhou University, Fuzhou, China
  • [ 3 ] [Chen, Shan]School of Electrical Engineering, Chongqing University, Chongqing, China
  • [ 4 ] [Wu, Yingjie]College of Mathematcis and Computer Science, Fuzhou University, Fuzhou, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

  • Mixed word embedding method based on knowledge graph augment for text classification

    2019,17th IEEE International Conference on Parallel and Distributed Processing with Applications, 9th IEEE International Conference on Big Data and Cloud Computing, 9th IEEE International Conference on Sustainable Computing and Communications, 12th IEEE International Conference on Social Computing and Networking, ISPA/BDCloud/SustainCom/SocialCom 2019

  • User-scene-based recommendation of app service

    2019,17th IEEE International Conference on Parallel and Distributed Processing with Applications, 9th IEEE International Conference on Big Data and Cloud Computing, 9th IEEE International Conference on Sustainable Computing and Communications, 12th IEEE International Conference on Social Computing and Networking, ISPA/BDCloud/SustainCom/SocialCom 2019

  • Adaptively extracting structured data from web pages

    2019,17th IEEE International Conference on Parallel and Distributed Processing with Applications, 9th IEEE International Conference on Big Data and Cloud Computing, 9th IEEE International Conference on Sustainable Computing and Communications, 12th IEEE International Conference on Social Computing and Networking, ISPA/BDCloud/SustainCom/SocialCom 2019

  • Copyright protection application based on blockchain technology

    2019,17th IEEE International Conference on Parallel and Distributed Processing with Applications, 9th IEEE International Conference on Big Data and Cloud Computing, 9th IEEE International Conference on Sustainable Computing and Communications, 12th IEEE International Conference on Social Computing and Networking, ISPA/BDCloud/SustainCom/SocialCom 2019

Source :

Year: 2019

Page: 1173-1179

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 3

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Online/Total:82/10005941
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1