Details - 福州大学机构库

Query：

学者姓名：柯逍

Refining：

Year

2025 (13)
2024 (13)
2023 (9)
2022 (10)
2021 (14)
2020 (10)
2019 (12)
2018 (6)
2017 (15)
2016 (13)
2015 (3)
2014 (2)
2013 (2)
2012 (3)
2011 (2)

Submit Unfold

Type

期刊论文 (81)
专利 (29)
会议论文 (17)

Submit Unfold

Indexed by

EI (70)
Scopus (59)
SCIE (45)
incoPat (29)
PKU (22)
CSCD (21)
CNKI (18)
万方 (16)
CQVIP (14)
CPCI-S (12)

Submit Unfold

Source

PATTERN RECOGNITION (7)
模式识别与人工智能 (6)
电子学报 (6)
IEEE ACCESS (4)
2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML) (3)
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (3)
NEUROCOMPUTING (3)
2nd International Conference on Artificial Intelligence and Intelligent Information Processing, AIIIP 2023 (2)
APPLIED INTELLIGENCE (2)
Acta Electronica Sinica (2)
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2)
IET COMPUTER VISION (2)
IET IMAGE PROCESSING (2)
MACHINE VISION AND APPLICATIONS (2)
NEURAL COMPUTING & APPLICATIONS (2)
NEURAL NETWORKS (2)
软件学报 (2)
11th International Conference on Information Technology in Medicine and Education, ITME 2021 (1)
2012 International Conference on Electrical and Electronics Engineering, ICEE 2012 (1)
2014 International Conference on Information Technology and Application, ITA2014 (1)
2021 International Symposium on Intelligent Robotics and Systems, ISoIRS 2021 (1)
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024) (1)
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025 (1)
2nd International Conference on Computer Science and Application Engineering (CSAE) (1)
33rd IEEE International Conference on Visual Communications and Image Processing (IEEE VCIP) (1)
33rd IEEE International Conference on Visual Communications and Image Processing, VCIP 2018 (1)
38th AAAI Conference on Artificial Intelligence, AAAI 2024 (1)
39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 (1)
3rd International Conference on Mechanical Engineering and Intelligent Systems (ICMEIS) (1)
7th IEEE International Conference on Computer Science and Network Technology, ICCSNT 2019 (1)
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (1)
COMPUTER VISION - ECCV 2024, PT XLII (1)
COMPUTER VISION AND IMAGE UNDERSTANDING (1)
COMPUTERS & ELECTRICAL ENGINEERING (1)
Computer Research and Development (1)
Computers in Biology and Medicine (1)
IEEE 3rd International Conference on Multimedia Big Data (BigMM) (1)
IEEE SIGNAL PROCESSING LETTERS (1)
IEEE TRANSACTIONS ON BROADCASTING (1)
IEEE TRANSACTIONS ON IMAGE PROCESSING (1)
IEEE TRANSACTIONS ON MULTIMEDIA (1)
IEEE Transactions on Circuits and Systems for Video Technology (1)
IEEE Transactions on Multimedia (1)
IEEE/CVF International Conference on Computer Vision (ICCVW) (1)
IMAGE AND VISION COMPUTING (1)
INFORMATION SCIENCES (1)
MATHEMATICAL PROBLEMS IN ENGINEERING (1)
MULTIMEDIA TOOLS AND APPLICATIONS (1)
OPTICS AND LASERS IN ENGINEERING (1)
PEER-TO-PEER NETWORKING AND APPLICATIONS (1)
Pattern Recognition and Artificial Intelligence (1)
SIGNAL IMAGE AND VIDEO PROCESSING (1)
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 3 (1)
THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 8 (1)
中国图象图形学报 (1)
厦门大学学报（自然科学版） (1)
小型微型计算机系统 (1)
智能科学与技术学报 (1)
福州大学学报（自然科学版） (1)
计算机仿真 (1)
计算机应用研究 (1)
计算机研究与发展 (1)

Submit Unfold

Complex

First Author (73)
Reprint Author (19)
First Comm (46)
Reprint Comm (69)
CAS 1 (12)
CAS 2 (17)
CAS 3 (7)
CAS 4 (9)
JCR 1 (15)
JCR 2 (11)
JCR 3 (6)
JCR 4 (1)

Submit Unfold

Former Name

Ke, Xiao (74)
柯逍 (51)
Ke, X. (2)

Submit

Co-

Guo, Wenzhong (32)
牛玉贞 (6)
陈羽中 (4)
Niu, Yuzhen (11)
周铭柯 (10)
杜明智 (10)
林洋洋 (1)
黄腾达 (1)
郭文忠 (4)
Liu, Hao (8)
Guo, W. (1)
Li, Yuezhou (7)
王俊强 (1)
Cai, Yuhang (6)
Chen, Baitao (6)
Wu, Huanqi (6)
Xu, Huangbiao (6)
Li, Jianping (4)
Xu, Rui (5)
刘童安 (1)
曾淦雄 (1)
林艳 (1)
Chen, Guolong (4)
Xu, Peirong (4)
叶东毅 (4)
叶宇 (1)
李悦洲 (1)
杜鹏强 (1)
缪欣 (2)
陈昭炯 (4)
Chen, Yuzhong (3)
Huang, Yanyan (3)
Lin, Xinru (3)
Miao, Xin (3)
Shi, Yiqing (3)
Tan, Guozhen (2)
Wang, Hanling (2)
Zheng, Wukun (2)
张毓峰 (2)
朱敏琛 (1)
李绍滋 (3)
林文奇 (2)
石晓楠 (3)
郑毅腾 (1)
黄新恩 (1)
Cao, Donglin (2)
Chen, Bo-Hao (2)
Chen, Qiuqin (2)
Chen, W. (1)
Chen, Weibin (2)
Guo, WenZhong (2)
Lin, Wenqi (2)
Lin, Xiaofeng (2)
Lin, Yangyang (2)
Li, Shaozi (2)
Li, Y. (1)
Li, Zhenda (2)
Shi, Ling-Feng (2)
Wang, Zhihao (1)
Wan, JinCheng (1)
Wu, H. (1)
Xu, H. (1)
Xu, R. (1)
Ye, Yu (2)
Zhong, Bineng (2)
Zhong, Yini (2)
Zhou, Mingke (2)
张雨婷 (2)
曹冬林 (2)
李振达 (1)
蔡宇航 (1)
邹嘉伟 (2)
陈国龙 (2)
陈观鸿 (1)
Abdelpakey, Mohamed (1)
Bhat, Goutam (1)
Bian, Yongheng (1)
Cao, Dong-Lin (1)
Cerkezi, Llukman (1)
Cevikalp, Hakan (1)
Chang, Hyung Jin (1)
Chen, Dewang (1)
Chen, Duansheng (1)
Cheng, Miao (1)
Chen, Guanhong (1)
Cheng, Ziyi (1)
Chen, Jianer (1)
Chen, Junhao (1)
Chen, Shengyong (1)
Chen, Wenyao (1)
Chen, Xin (1)
Chiu, Yu-Chen (1)
Cirakman, Ozgun (1)
Cui, Yutao (1)
Dai, Kenan (1)
Danelljan, Martin (1)
Dasari, Mohana Murali (1)
Deng, Qili (1)
Dong, Xingping (1)
Drbohlav, Ondrej (1)
Du, Daniel K. (1)
Du, Ji-Xiang (1)
Du, Mingzhi (1)
Dunnhofer, Matteo (1)
Du, Pengqiang (1)
Felsberg, Michael (1)
Feng, Zhen-Hua (1)
Feng, Zhiyong (1)
Fernandez, Gustavo (1)
Fu, Zhihong (1)
Ge, Shiming (1)
Gorthi, Rama Krishna (1)
Gunsel, Bilge (1)
Guo, Qing (1)
Guo, Wen-Zhong (1)
Gurkan, Filiz (1)
Gu, Yuzhang (1)
Hager, Gustav (1)
Han, Wencheng (1)
Huang, Dong (1)
Huang, tengda (1)
Huang, Tengda (1)
Huang, Xu (1)
Jhang, Shang-Jhih (1)
Jiang, Cheng (1)
Jiang, Peilong (1)
Jiang, Yingjie (1)
Ji, Rongrong (1)
Juefei-Xu, Felix (1)
Jun, Yin (1)
Kamarainen, Joni-Kristian (1)
Kapyla, Jani (1)
Ke, Lingling (1)
Khan, Fahad Shahbaz (1)
Kim, Byeong Hak (1)
Kittler, Josef (1)
Kristan, Matej (1)
Lai, Xinyi (1)
Lan, Jie (1)
Lan, Xiangyuan (1)
Lawin, Felix Jaremo (1)
Lee, Jun Ha (1)
Leibe, Bastian (1)
Lei, Qing (1)
Leonardis, Ales (1)
Li, Hui (1)
Li, Jianhua (1)
Li, Lingxiao (1)
Lin, BingHui (1)
Lin, Dazhen (1)
Lin, Hanyang (1)
Li, Shao-Zi (1)
Liu, Binghan (1)
Liu, Bo (1)
Liu, Chang (1)
Liu, Jingen (1)
Liu, Li (1)
Liu, Qingjie (1)
Liu, Shiqin (1)
Liu, Tongan (1)
Li, Xianxian (1)
Lu, Huchuan (1)
Luiten, Jonathon (1)
Lukezic, Alan (1)
Lu, Wei (1)
Lv, Yanping (1)
Ma, Jie (1)
Mao, Changjiang (1)
Martinel, Niki (1)
Matas, Jiri (1)
Mayer, Christoph (1)
Ma, Ziang (1)
Memarmoghadam, Alireza (1)
Micheloni, Christian (1)
Niu, Yu-Zhen (1)
Paudel, Danda (1)
Peng, Houwen (1)
Peng, Jialin (1)
Peng, Yanfei (1)
Pflugfelder, Roman (1)
Qin, Liyun (1)
Qiu, Shoumeng (1)
Rajiv, Aravindh (1)
Rana, Muhammad (1)
Robinson, Andreas (1)
Saribas, Hasan (1)
Shao, Ling (1)
Shehata, Mohamed (1)
Shen, Furao (1)
Shen, Jianbing (1)
Shi, Lingfeng (1)
Shi, Yude (1)
Si, Huaiwei (1)
Simonato, Kristian (1)
Song, Xiaoning (1)
Tang, Zhangyong (1)
Timofte, Radu (1)
Torr, Philip (1)
Tsai, Chi-Yi (1)
Uzun, Bedirhan (1)
Van Gool, Luc (1)
Voigtlaender, Paul (1)
Wang, Dong (1)
Wang, Guangting (1)
Wang, Liangliang (1)
Wang, Lijun (1)
Wang, Limin (1)
Wang, Linyuan (1)
Wang, Yezhen (1)
Wang, Yong (1)
Wang, Yunhong (1)
Wu, Chenyan (1)
Wu, Gangshan (1)
Wu, Shanghui (1)
Wu, Xiao-Jun (1)
Xie, Fei (1)
Xue, Wanli (1)
Xu, Tianyang (1)
Xu, Xiang (1)
Yan, Bin (1)
Yang, Jinyu (1)
Yang, Wankou (1)
Yang, Xiaoyun (1)
Yang, Xiong (1)
Yan, Song (1)
Yin, Jun (1)
Zajc, Luka Cehovin (1)
Zeng, Ganxiong (1)
Zhang, Chengwei (1)
Zhang, Chunhui (1)
Zhang, Haitao (1)
Zhang, Hong-Bo (1)
Zhang, Jing (1)
Zhang, Kaihua (1)
Zhang, Kairui (1)
Zhang, Kangkai (1)
Zhang, Ling-Xin (1)
Zhang, Xiaohan (1)
Zhang, Xiaolin (1)
Zhang, Xinyu (1)
Zhang, Yufeng (1)
Zhang, Zhibin (1)
Zhang, Zhongqun (1)
Zhan, Yongzhao (1)
Zhao, Shaochuan (1)
Zheng, Haobin (1)
Zhen, Ming (1)
Zhu, Jiawen (1)
Zhu, Xue-Feng (1)
Zou, Jiawei (1)
兰杰 (1)
刘秉瀚 (1)
叶菁 (1)
张凌昕 (1)
施玲凤 (1)
施逸青 (1)
朱丹红 (1)
李东艳 (1)
杨彦 (1)
林世平 (1)
林德威 (1)
江澳鑫 (1)
王汉灵 (1)
范京 (1)
许培荣 (1)
许煌标 (1)
许瑞 (1)
郑岸以 (1)
郑晓华 (1)
钟伊妮 (1)
陈文垚 (1)
陈秋琴 (1)

Submit Unfold

Language

English (72)
Chinese (55)

Submit

Clean All

Select All Sort by：

Default

Default
Title
Year
WOS Cited Count
Impact factor
Ascending
Descending

< Page ，Total 13 >

基于分频式生成对抗网络的非成对水下图像增强

期刊论文 | 2025 | 电子学报

牛玉贞 | 张凌昕 | 兰杰 | 许瑞 | 柯逍

Abstract&Keyword Cite Version(1)

Abstract ：

增强水下图像质量对水下作业领域的发展具有重要意义 . 现有的水下图像增强方法通常基于成对的水下图像和参考图像进行训练，然而实际获取与水下图像对应的参考图像比较困难，相比之下获得非成对高质量水下图像或者陆上图像较为容易. 此外，现有的水下图像增强方法很难同时针对各种失真类型进行图像增强. 为了避免对成对训练数据的依赖和进一步降低获得训练数据的难度，并应对多样的水下图像失真类型，本文提出了一种基于分频式生成对抗网络（Frequency-Decomposed Generative Adversarial Network，FD-GAN）的非成对水下图像增强方法，并在此基础上设计了高低频双分支生成器用于重建高质量水下增强图像. 具体来说，本文引入特征级别的小波变换将特征分为低频和高频部分，并基于循环一致性生成对抗网络对低频和高频部分区分处理. 其中，低频分支采用结合低频注意力机制的编码-解码器结构实现对图像颜色和亮度的增强，高频分支则采用并行的高频注意力机制对各高频分量进行增强，从而实现对图像细节的恢复. 在多个标准水下图像数据集上的实验结果表明，本文提出的方法在使用非成对的高质量水下图像和引入部分陆上图像的情况下，均能有效生成高质量的水下增强图像，且有效性和泛化性均优于当前主流的水下图像增强方法.

Keyword ：

小波变换小波变换水下图像增强水下图像增强注意力机制注意力机制生成对抗网络生成对抗网络高低频双分支生成器高低频双分支生成器

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	牛玉贞 , 张凌昕 , 兰杰 et al. 基于分频式生成对抗网络的非成对水下图像增强 [J]. \| 电子学报 , 2025 .
MLA	牛玉贞 et al. "基于分频式生成对抗网络的非成对水下图像增强" . \| 电子学报 (2025) .
APA	牛玉贞 , 张凌昕 , 兰杰 , 许瑞 , 柯逍 . 基于分频式生成对抗网络的非成对水下图像增强 . \| 电子学报 , 2025 .
Export to	NoteExpress RIS BibTex

Version ：

基于分频式生成对抗网络的非成对水下图像增强

期刊论文 | 2025 , 53 (02) , 527-544 | 电子学报

牛玉贞 | 张凌昕 | 兰杰 | 许瑞 | 柯逍

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment CPCI-S

期刊论文 | 2025 , 15100 , 423-440 | COMPUTER VISION - ECCV 2024, PT XLII

Abstract&Keyword Cite Version(1)

Abstract ：

Action quality assessment (AQA) is a challenging vision task that requires discerning and quantifying subtle differences in actions from the same class. While recent research has made strides in creating fine-grained annotations for more precise analysis, existing methods primarily focus on coarse action segmentation, leading to limited identification of discriminative action frames. To address this issue, we propose a Vision-Language Action Knowledge Learning approach for action quality assessment, along with a multi-grained alignment framework to understand different levels of action knowledge. In our framework, prior knowledge, such as specialized terminology, is embedded into video-level, stage-level, and frame-level representations via CLIP. We further propose a new semantic-aware collaborative attention module to prevent confusing interactions and preserve textual knowledge in cross-modal and cross-semantic spaces. Specifically, we leverage the powerful cross-modal knowledge of CLIP to embed textual semantics into image features, which then guide action spatial-temporal representations. Our approach can be plug-and-played with existing AQA methods, frame-wise annotations or not. Extensive experiments and ablation studies show that our approach achieves state-of-the-art on four public short and long-term AQA benchmarks: FineDiving, MTL-AQA, JIGSAWS, and Fis-V.

Keyword ：

Action quality assessment Action quality assessment Semantic-aware learning Semantic-aware learning Vision-language pre-training Vision-language pre-training

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Xu, Huangbiao , Ke, Xiao , Li, Yuezhou et al. Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment [J]. \| COMPUTER VISION - ECCV 2024, PT XLII , 2025 , 15100 : 423-440 .
MLA	Xu, Huangbiao et al. "Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment" . \| COMPUTER VISION - ECCV 2024, PT XLII 15100 (2025) : 423-440 .
APA	Xu, Huangbiao , Ke, Xiao , Li, Yuezhou , Xu, Rui , Wu, Huanqi , Lin, Xiaofeng et al. Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment . \| COMPUTER VISION - ECCV 2024, PT XLII , 2025 , 15100 , 423-440 .
Export to	NoteExpress RIS BibTex

Version ：

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment Scopus

其他 | 2025 , 15100 LNCS , 423-440 | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Xu, H. | Ke, X. | Li, Y. | Xu, R. | Wu, H. | Lin, X. | Guo, W.

Quality-Guided Vision-Language Learning for Long-Term Action Quality Assessment Scopus

期刊论文 | 2025 | IEEE Transactions on Multimedia

Xu, H. | Wu, H. | Ke, X. | Li, Y. | Xu, R. | Guo, W.

Abstract&Keyword Cite

Abstract ：

Long-term action quality assessment poses a challenging visual task since it requires assessing technical actions at different skill levels in a long video. Recent state-of-the-art methods incorporate additional modality information to aid in understanding action semantics, which incurs extra annotation costs and imposes higher constraints on action scenes and datasets. To address this issue, we propose a Quality-Guided Vision-Language Learning (QGVL) method to map visual features into appropriate fine-grained intervals of quality scores. Specifically, we use a set of quality-related textual prompts as quality prototypes to guide the discrimination and aggregation of specific visual actions. To avoid fuzzy rule mapping, we further propose a progressive semantic learning strategy with a Granularity-Adaptive Semantic Learning Module (GSLM) that refines accurate score intervals from coarse to fine at clip, grade, and score levels. The quality-related semantics we designed are universal to all types of action scenarios without any additional annotations. Extensive experiments show that our approach outperforms previous work by a significant margin and establishes new state-of-the-art on four public AQA benchmarks: Rhythmic Gymnastics, Fis-V, FS1000, and FineFS. © 1999-2012 IEEE.

Keyword ：

Action quality assessment Action quality assessment human motion analysis human motion analysis video understanding video understanding vision-language learning vision-language learning

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Xu, H. , Wu, H. , Ke, X. et al. Quality-Guided Vision-Language Learning for Long-Term Action Quality Assessment [J]. \| IEEE Transactions on Multimedia , 2025 .
MLA	Xu, H. et al. "Quality-Guided Vision-Language Learning for Long-Term Action Quality Assessment" . \| IEEE Transactions on Multimedia (2025) .
APA	Xu, H. , Wu, H. , Ke, X. , Li, Y. , Xu, R. , Guo, W. . Quality-Guided Vision-Language Learning for Long-Term Action Quality Assessment . \| IEEE Transactions on Multimedia , 2025 .
Export to	NoteExpress RIS BibTex

Version ：

SFCE-Det: Sub-feature Fusion and Cross-layer Perceptual Enhancement Detector Scopus

期刊论文 | 2025 | IEEE Transactions on Circuits and Systems for Video Technology

Ke, X. | Chen, W.

Abstract&Keyword Cite

Abstract ：

Edge devices face a pressing demand for low-cost object detection networks. However, because of limited computational resources, lightweight detectors often suffer significant performance degradation. In this paper, we propose SFCE-Det, an efficient object detector that achieves remarkable performance with remarkably few parameters and GFLOPs. The key contribution of our work lies in the novel subfeature fusion and cross-layer perceptual enhancement block (SFCE-Block), which effectively extracts feature information from images at a very low computational cost. SFCE-Block can be seamlessly integrated into existing convolutional neural networks and serves as a plug-and-play component for lightweight upgrades to the network. SFCE-Block can not only be used to upgrade classic models but also has excellent lightweight effects on state-of-the-art models (e.g. YoLOv8). Additionally, we propose a dynamic label assignment strategy that leverages global label correlation to further enhance the performance of SFCE-Det. Experimental results demonstrate that SFCE-Det surpasses many state-of-the-art lightweight object detectors, on multiple public datasets while maintaining an extremely low cost. For example, SFCE-Det-D2 achieves an impressive mAP of 83.4% on the PASCAL VOC dataset, comparable to YOLOv8-S. However, SFCE-Det-D2 requires only 26% of the parameters and 35% of the GFLOPs, which are 2.96M parameters and 9.9 GFLOPs, respectively. © 1991-2012 IEEE.

Keyword ：

convolution module design convolution module design lightweight network lightweight network model compression model compression Object detection Object detection

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Ke, X. , Chen, W. . SFCE-Det: Sub-feature Fusion and Cross-layer Perceptual Enhancement Detector [J]. \| IEEE Transactions on Circuits and Systems for Video Technology , 2025 .
MLA	Ke, X. et al. "SFCE-Det: Sub-feature Fusion and Cross-layer Perceptual Enhancement Detector" . \| IEEE Transactions on Circuits and Systems for Video Technology (2025) .
APA	Ke, X. , Chen, W. . SFCE-Det: Sub-feature Fusion and Cross-layer Perceptual Enhancement Detector . \| IEEE Transactions on Circuits and Systems for Video Technology , 2025 .
Export to	NoteExpress RIS BibTex

Version ：

DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose CPCI-S

期刊论文 | 2025 , 8869-8877 | THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 8

Abstract&Keyword Cite

Abstract ：

The fair and objective assessment of performances and competitions is a common pursuit and challenge in human society. The application of computer vision technology offers hope for this purpose, but it still faces obstacles such as occlusion and motion blur. To address these hindrances, our Dance-Fix proposes a bidirectional spatial-temporal context optical flow correction (BOFC) method. This approach leverages the consistency and complementarity of motion information between two modalities: optical flow, which excels at pixel capture, and lightweight skeleton data. It enables the extraction of pixel-level motion changes and the correction of abnormal skeleton data. Furthermore, we propose a part-level dance dataset (Dancer Parts) and part-level motion feature extraction based on task decoupling (PETD). This aims to decouple complex whole-body parts tracking into fine-grained limb-level motion extraction, enhancing the confidence of temporal information and the accuracy of correction for abnormal data. Finally, we present the DNV dataset, which simulates fully neat group dance scenes and provides reliable labels and validation methods for the newly introduced group dance neatness assessment (GDNA). To the best of our knowledge, this is the first work to develop quantitative criteria for assessing limb and joint neatness in group dance. We conduct experiments on DNV and video-based public JHMDB datasets. Our method effectively corrects abnormal skeleton points, flexibly embeds, and improves the accuracy of existing pose estimation algorithms.

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Xu, Huangbiao , Ke, Xiao , Wu, Huanqi et al. DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose [J]. \| THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 8 , 2025 : 8869-8877 .
MLA	Xu, Huangbiao et al. "DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose" . \| THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 8 (2025) : 8869-8877 .
APA	Xu, Huangbiao , Ke, Xiao , Wu, Huanqi , Xu, Rui , Li, Yuezhou , Xu, Peirong et al. DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose . \| THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 8 , 2025 , 8869-8877 .
Export to	NoteExpress RIS BibTex

Version ：

DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose EI

会议论文 | 2025 , 39 (8) , 8869-8877 | 39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025

Abstract&Keyword Cite

Abstract ：

The fair and objective assessment of performances and competitions is a common pursuit and challenge in human society. The application of computer vision technology offers hope for this purpose, but it still faces obstacles such as occlusion and motion blur. To address these hindrances, our DanceFix proposes a bidirectional spatial-temporal context optical flow correction (BOFC) method. This approach leverages the consistency and complementarity of motion information between two modalities: optical flow, which excels at pixel capture, and lightweight skeleton data. It enables the extraction of pixel-level motion changes and the correction of abnormal skeleton data. Furthermore, we propose a part-level dance dataset (Dancer Parts) and part-level motion feature extraction based on task decoupling (PETD). This aims to decouple complex whole-body parts tracking into fine-grained limb-level motion extraction, enhancing the confidence of temporal information and the accuracy of correction for abnormal data. Finally, we present the DNV dataset, which simulates fully neat group dance scenes and provides reliable labels and validation methods for the newly introduced group dance neatness assessment (GDNA). To the best of our knowledge, this is the first work to develop quantitative criteria for assessing limb and joint neatness in group dance. We conduct experiments on DNV and video-based public JHMDB datasets. Our method effectively corrects abnormal skeleton points, flexibly embeds, and improves the accuracy of existing pose estimation algorithms. Copyright © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Xu, Huangbiao , Ke, Xiao , Wu, Huanqi et al. DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose [C] . 2025 : 8869-8877 .
MLA	Xu, Huangbiao et al. "DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose" . (2025) : 8869-8877 .
APA	Xu, Huangbiao , Ke, Xiao , Wu, Huanqi , Xu, Rui , Li, Yuezhou , Xu, Peirong et al. DanceFix: An Exploration in Group Dance Neatness Assessment Through Fixing Abnormal Challenges of Human Pose . (2025) : 8869-8877 .
Export to	NoteExpress RIS BibTex

Version ：

MSP: Multimodal Self-Attention Prompt Learning SCIE

期刊论文 | 2025 , 34 , 5978-5988 | IEEE TRANSACTIONS ON IMAGE PROCESSING

Lai, Xinyi | Ke, Xiao | Xu, Huangbiao | Wu, Shanghui | Guo, Wenzhong

Abstract&Keyword Cite Version(1)

Abstract ：

Multimodal prompt learning has emerged as an effective strategy for adapting vision-language models such as CLIP to downstream tasks. However, conventional approaches typically operate at the input level, forcing learned prompts to propagate through a sequence of frozen Transformer layers. This indirect adaptation introduces cumulative geometric distortions, a limitation that we formalize as the indirect learning dilemma (ILD), leading to overfitting of the base class and reduced generalization to novel classes. To overcome this challenge, we propose the Multimodal Self-Attention Prompt (MSP) framework, which shifts adaptation into the semantic core of the model by injecting learnable prompts directly into the key and value sequences of attention blocks. This direct modulation preserves the pretrained embedding geometry while enabling more precise downstream adaptation. MSP further incorporates distance-aware optimization to maintain semantic consistency with CLIP's original representation space, and partial prompt learning via stochastic dimension masking to improve robustness and prevent over-specialization. Extensive evaluations across 11 benchmarks demonstrate the effectiveness of MSP. It achieves a state-of-the-art harmonic mean accuracy of 80.67%, with 77.32% accuracy on novel classes-representing a 2.18% absolute improvement over prior methods-while requiring only 0.11M learnable parameters. Notably, MSP surpasses CLIP's zero-shot performance on 10 out of 11 datasets, establishing a new paradigm for efficient and generalizable prompt-based adaptation.

Keyword ：

Adaptation models Adaptation models Distortion Distortion Few-shot learning Few-shot learning Geometry Geometry image classification image classification Optimization Optimization prompt learning prompt learning Semantics Semantics Training Training transfer learning transfer learning Transformers Transformers Tuning Tuning Vectors Vectors vision-language model vision-language model Visualization Visualization

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Lai, Xinyi , Ke, Xiao , Xu, Huangbiao et al. MSP: Multimodal Self-Attention Prompt Learning [J]. \| IEEE TRANSACTIONS ON IMAGE PROCESSING , 2025 , 34 : 5978-5988 .
MLA	Lai, Xinyi et al. "MSP: Multimodal Self-Attention Prompt Learning" . \| IEEE TRANSACTIONS ON IMAGE PROCESSING 34 (2025) : 5978-5988 .
APA	Lai, Xinyi , Ke, Xiao , Xu, Huangbiao , Wu, Shanghui , Guo, Wenzhong . MSP: Multimodal Self-Attention Prompt Learning . \| IEEE TRANSACTIONS ON IMAGE PROCESSING , 2025 , 34 , 5978-5988 .
Export to	NoteExpress RIS BibTex

Version ：

MSP: Multimodal Self-Attention Prompt Learning EI

期刊论文 | 2025 , 34 , 5978-5988 | IEEE Transactions on Image Processing

Lai, Xinyi | Ke, Xiao | Xu, Huangbiao | Wu, Shanghui | Guo, Wenzhong

FD-GAN: Frequency-Decomposed Generative Adversarial Network for Unpaired Underwater Image Enhancement EI

期刊论文 | 2025 , 53 (2) , 527-544 | Acta Electronica Sinica

Niu, Yu-Zhen | Zhang, Ling-Xin | Lan, Jie | Xu, Rui | Ke, Xiao

Abstract&Keyword Cite Version(1)

Abstract ：

Enhancing the quality of underwater images is crucial for advancements in the fields of underwater exploration and underwater rescue. Existing underwater image enhancement methods typically rely on paired underwater images and reference images for training. However, obtaining corresponding reference images for underwater images is challenging in practice. In contrast, acquiring high-quality unpaired underwater images or images captured on land are relatively more straightforward. Furthermore, existing techniques for underwater image enhancement often struggle to address a variety of distortion types simultaneously. To avoid the reliance on paired training data, reduce the difficulty of acquiring training data, and effectively handle diverse types of underwater image distortions, in this paper, we propose a novel unpaired underwater image enhancement method based on the frequency-decomposed generative adversarial network (FD-GAN). We design a dual-branch generator based on high and low frequencies to reconstruct high-quality underwater images. Specifically, feature-level wavelet transform is introduced to separate the features into low-frequency and high-frequency parts. Then the separated features are processed by a cycle-consistent generative adversarial network, so as to simultaneously enhance the color and luminance in the low-frequency component and details in the high-frequency part. More specific, the low-frequency branch employs an encoder-decoder structure with a low-frequency attention mechanism to enhance the color and brightness of the image. The high-frequency branch utilizes parallel high-frequency attention mechanisms to enhance various high-frequency components, thereby achieving the restoration of image details. Experimental results on multiple datasets show that the proposed method trained with unpaired high-quality underwater images or unpaired high-quality underwater images and on-land images, can effectively generate high-quality underwater enhanced images and the proposed method is superior to the state-of-the-art underwater image enhancement methods in terms of effectiveness and generalization. © 2025 Chinese Institute of Electronics. All rights reserved.

Keyword ：

Color image processing Color image processing Image coding Image coding Image compression Image compression Image enhancement Image enhancement Photointerpretation Photointerpretation Underwater photography Underwater photography Wavelet decomposition Wavelet decomposition

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Niu, Yu-Zhen , Zhang, Ling-Xin , Lan, Jie et al. FD-GAN: Frequency-Decomposed Generative Adversarial Network for Unpaired Underwater Image Enhancement [J]. \| Acta Electronica Sinica , 2025 , 53 (2) : 527-544 .
MLA	Niu, Yu-Zhen et al. "FD-GAN: Frequency-Decomposed Generative Adversarial Network for Unpaired Underwater Image Enhancement" . \| Acta Electronica Sinica 53 . 2 (2025) : 527-544 .
APA	Niu, Yu-Zhen , Zhang, Ling-Xin , Lan, Jie , Xu, Rui , Ke, Xiao . FD-GAN: Frequency-Decomposed Generative Adversarial Network for Unpaired Underwater Image Enhancement . \| Acta Electronica Sinica , 2025 , 53 (2) , 527-544 .
Export to	NoteExpress RIS BibTex

Version ：

FD-GAN: Frequency-Decomposed Generative Adversarial Network for Unpaired Underwater Image Enhancement; [基于分频式生成对抗网络的非成对水下图像增强] Scopus

期刊论文 | 2025 , 53 (2) , 527-544 | Acta Electronica Sinica

Niu, Y.-Z. | Zhang, L.-X. | Lan, J. | Xu, R. | Ke, X.

MEFA-Net: A mask enhanced feature aggregation network for polyp segmentation EI

期刊论文 | 2025 , 186 | Computers in Biology and Medicine

Ke, Xiao | Chen, Guanhong | Liu, Hao | Guo, Wenzhong

Abstract&Keyword Cite Version(1)

Abstract ：

Accurate polyp segmentation is crucial for early diagnosis and treatment of colorectal cancer. This is a challenging task for three main reasons: (i) the problem of model overfitting and weak generalization due to the multi-center distribution of data; (ii) the problem of interclass ambiguity caused by motion blur and overexposure to endoscopic light; and (iii) the problem of intraclass inconsistency caused by the variety of morphologies and sizes of the same type of polyps. To address these challenges, we propose a new high-precision polyp segmentation framework, MEFA-Net, which consists of three modules, including the plug-and-play Mask Enhancement Module (MEG), Separable Path Attention Enhancement Module (SPAE), and Dynamic Global Attention Pool Module (DGAP). Specifically, firstly, the MEG module regionally masks the high-energy regions of the environment and polyps through a mask, which guides the model to rely on only a small amount of information to distinguish between polyps and background features, avoiding the model from overfitting the environmental information, and improving the robustness of the model. At the same time, this module can effectively counteract the 'dark corner phenomenon' in the dataset and further improve the generalization performance of the model. Next, the SPAE module can effectively alleviate the inter-class fuzzy problem by strengthening the feature expression. Then, the DGAP module solves the intra-class inconsistency problem by extracting the invariance of scale, shape and position. Finally, we propose a new evaluation metric, MultiColoScore, for comprehensively evaluating the segmentation performance of the model on five datasets with different domains. We evaluated the new method quantitatively and qualitatively on five datasets using four metrics. Experimental results show that MEFA-Net significantly improves the accuracy of polyp segmentation and outperforms current state-of-the-art algorithms. Code posted on https://github.com/847001315/MEFA-Net. © 2024

Keyword ：

Endoscopy Endoscopy Image coding Image coding Image segmentation Image segmentation Risk assessment Risk assessment

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Ke, Xiao , Chen, Guanhong , Liu, Hao et al. MEFA-Net: A mask enhanced feature aggregation network for polyp segmentation [J]. \| Computers in Biology and Medicine , 2025 , 186 .
MLA	Ke, Xiao et al. "MEFA-Net: A mask enhanced feature aggregation network for polyp segmentation" . \| Computers in Biology and Medicine 186 (2025) .
APA	Ke, Xiao , Chen, Guanhong , Liu, Hao , Guo, Wenzhong . MEFA-Net: A mask enhanced feature aggregation network for polyp segmentation . \| Computers in Biology and Medicine , 2025 , 186 .
Export to	NoteExpress RIS BibTex

Version ：

MEFA-Net: A mask enhanced feature aggregation network for polyp segmentation Scopus

期刊论文 | 2025 , 186 | Computers in Biology and Medicine

Ke, X. | Chen, G. | Liu, H. | Guo, W.

Zero-shot 3D anomaly detection via online voter mechanism SCIE

期刊论文 | 2025 , 187 | NEURAL NETWORKS

Zheng, Wukun | Ke, Xiao | Guo, Wenzhong

Abstract&Keyword Cite Version(2)

Abstract ：

3D anomaly detection aims to solve the problem that image anomaly detection is greatly affected by lighting conditions. As commercial confidentiality and personal privacy become increasingly paramount, access to training samples is often restricted. To address these challenges, we propose a zero-shot 3D anomaly detection method. Unlike previous CLIP-based methods, the proposed method does not require any prompt and is capable of detecting anomalies on the depth modality. Furthermore, we also propose a pre-trained structural rerouting strategy, which modifies the transformer without retraining or fine-tuning for the anomaly detection task. Most importantly, this paper proposes an online voter mechanism that registers voters and performs majority voter scoring in a one-stage, zero-start and growth-oriented manner, enabling direct anomaly detection on unlabeled test sets. Finally, we also propose a confirmatory judge credibility assessment mechanism, which provides an efficient adaptation for possible few-shot conditions. Results on datasets such as MVTec3D-AD demonstrate that the proposed method can achieve superior zero-shot 3D anomaly detection performance, indicating its pioneering contributions within the pertinent domain.

Keyword ：

Anomaly detection Anomaly detection Multimodal Multimodal Online voter mechanism Online voter mechanism Pretrained model Pretrained model Zero-shot Zero-shot

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Zheng, Wukun , Ke, Xiao , Guo, Wenzhong . Zero-shot 3D anomaly detection via online voter mechanism [J]. \| NEURAL NETWORKS , 2025 , 187 .
MLA	Zheng, Wukun et al. "Zero-shot 3D anomaly detection via online voter mechanism" . \| NEURAL NETWORKS 187 (2025) .
APA	Zheng, Wukun , Ke, Xiao , Guo, Wenzhong . Zero-shot 3D anomaly detection via online voter mechanism . \| NEURAL NETWORKS , 2025 , 187 .
Export to	NoteExpress RIS BibTex

Version ：

Zero-shot 3D anomaly detection via online voter mechanism EI

期刊论文 | 2025 , 187 | Neural Networks

Zheng, Wukun | Ke, Xiao | Guo, Wenzhong

Zero-shot 3D anomaly detection via online voter mechanism Scopus

期刊论文 | 2025 , 187 | Neural Networks

Zheng, W. | Ke, X. | Guo, W.

10| 20| 50 per page

< Page ，Total 13 >

Type
Departments

All Years Choose Year From to