• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Deng, Xiongwen (Deng, Xiongwen.) [1] | Tang, Haoyu (Tang, Haoyu.) [2] | Jiang, Han (Jiang, Han.) [3] | Zheng, Qinghai (Zheng, Qinghai.) [4] (Scholars:郑清海) | Zhu, Jihua (Zhu, Jihua.) [5]

Indexed by:

EI

Abstract:

Zero-shot Natural Language Video Localization (NLVL) aims to automatically generate moments and corresponding pseudo queries from raw videos for the training of the localization model without any manual annotations. Existing approaches typically produce pseudo queries as simple words, which overlook the complexity of queries in real-world scenarios. Considering the powerful text modeling capabilities of large language models (LLMs), leveraging LLMs to generate complete queries that are closer to human descriptions is a potential solution. However, directly integrating LLMs into existing approaches introduces several issues, including insensitivity, isolation, and lack of regulation, which prevent the full exploitation of LLMs to enhance zero-shot NLVL performance. To address these issues, we propose BTDP, an innovative framework for Boundary-aware Temporal Dynamic Pseudo-supervision pairs generation. Our method contains two crucial operations: 1) Boundary Segmentation that identifies both visual boundaries and semantic boundaries to generate the atomic segments and activity descriptions, tackling the issue of insensitivity. 2) Context Aggregation that employs the LLMs with a self-evaluation process to aggregate and summarize global video information for optimized pseudo moment-query pairs, tackling the issue of isolation and lack of regulation. Comprehensive experimental results on the Charades-STA and ActivityNet Captions datasets demonstrate the effectiveness of our BTDP method. © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Keyword:

Semantic Segmentation

Community:

  • [ 1 ] [Deng, Xiongwen]School of Software Engineering, Xi’an Jiaotong University, China
  • [ 2 ] [Deng, Xiongwen]School of Software, Shandong University, China
  • [ 3 ] [Tang, Haoyu]School of Software, Shandong University, China
  • [ 4 ] [Jiang, Han]School of Software Engineering, Xi’an Jiaotong University, China
  • [ 5 ] [Zheng, Qinghai]College of Computer and Data Science, Fuzhou University, China
  • [ 6 ] [Zhu, Jihua]School of Software Engineering, Xi’an Jiaotong University, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 2159-5399

Year: 2025

Issue: 3

Volume: 39

Page: 2717-2725

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Online/Total:108/10105586
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1