Query:
学者姓名:傅仰耿
Refining:
Year
Type
Indexed by
Source
Complex
Former Name
Co-
Language
Clean All
Abstract :
Graph representation learning is a crucial area in machine learning, with widespread applications in social networks, recommendation systems, and traffic flow prediction. Recently, Graph Transformers have emerged as powerful tools for this purpose, garnering significant attention. In this work, we observe a fundamental issue of previous Graph Transformers that they overlook the scale-related information gap and often employ an identical attention computation method for different-scale node interactions, leading to suboptimality of model performance. To address this, we propose a Multi-Scale Attention Graph Transformer (MSA-GT) that enables each node to conduct adaptive interactions conditioned on different scales from both local and global perspectives. Specifically, MSA-GT guides several attention mechanisms to focus on individual scales and then perform customized combinations via an attention-based fusion module, thereby obtaining much more semantically fine-grained node representations. Despite the potential of the above design, we still observe over- fitting to some extent, which is atypical challenge for training Graph Transformers. We propose two additional technical components to prevent over-fitting and improve the performance further. We first introduce a path- based pruning strategy to reduce ineffective attention interactions, facilitating more accurate relevant node selection. Additionally, we propose a Heterophilous Curriculum Augmentation (HCA) module, which gradually increases the training difficulty, forming a weak-to-strong regularization schema and therefore enhancing the model's generalization ability step-by-step. Extensive experiments show that our method outperforms many state-of-the-art methods on eight public graph benchmarks, proving its effectiveness.
Keyword :
Curriculum learning Curriculum learning Graph Transformer Graph Transformer Multi-scale attention Multi-scale attention Node classification Node classification Representation learning Representation learning
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | Zhuang, Jianzhi , Li, Jin , Shi, Chenjunhao et al. Enhanced Graph Transformer: Multi-scale attention with Heterophilous Curriculum Augmentation [J]. | KNOWLEDGE-BASED SYSTEMS , 2025 , 309 . |
MLA | Zhuang, Jianzhi et al. "Enhanced Graph Transformer: Multi-scale attention with Heterophilous Curriculum Augmentation" . | KNOWLEDGE-BASED SYSTEMS 309 (2025) . |
APA | Zhuang, Jianzhi , Li, Jin , Shi, Chenjunhao , Lin, Xinyi , Fu, Yang-Geng . Enhanced Graph Transformer: Multi-scale attention with Heterophilous Curriculum Augmentation . | KNOWLEDGE-BASED SYSTEMS , 2025 , 309 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
针对异构图神经网络模型依赖元路径和复杂聚合操作导致元路径受限与高成本的不足,提出一种基于注意力融合机制和拓扑关系挖掘的异构图神经网络模型(FTHGNN).该模型首先使用一种轻量级的注意力融合机制,融合全局关系信息和局部节点信息,以较低的时空开销实现更有效的消息聚合;接着使用一种无需先验知识的拓扑关系挖掘方法替代元路径方法,挖掘图上的高阶邻居关系,并引入对比学习捕获图上的高阶语义信息;最后,在4个广泛使用的现实世界异构图数据集上进行的充分实验,验证了 FTHGNN简单而高效,在分类预测准确率上超越了绝大多数现有模型.
Keyword :
图神经网络 图神经网络 对比学习 对比学习 异构图 异构图 注意力机制 注意力机制
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | 陈金杰 , 王一蕾 , 傅仰耿 . 注意力融合机制和拓扑关系挖掘的异构图神经网络 [J]. | 福州大学学报(自然科学版) , 2025 , 53 (1) : 1-9 . |
MLA | 陈金杰 et al. "注意力融合机制和拓扑关系挖掘的异构图神经网络" . | 福州大学学报(自然科学版) 53 . 1 (2025) : 1-9 . |
APA | 陈金杰 , 王一蕾 , 傅仰耿 . 注意力融合机制和拓扑关系挖掘的异构图神经网络 . | 福州大学学报(自然科学版) , 2025 , 53 (1) , 1-9 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
为解决中文小样本命名实体识别(NER)任务所面临的问题和挑战,提出了一种面向中文小样本NER的BERT优化方法,该方法包含两方面的优化:首先,针对训练样本数量不足限制了预训练语言模型BERT的语义感知能力的问题,提出了 Pro-ConBERT,一种基于提示学习与对比学习的BERT预训练策略.在提示学习阶段,设计掩码填充模板来训练BERT预测出每个标记对应的中文标签词.在对比学习阶段,利用引导模板训练BERT学习每个标记和标签词之间的相似性与差异性.其次,针对中文缺乏明确的词边界所带来的复杂性和挑战性,修改BERT模型的第一层Transformer结构,并设计了一种带有混合权重引导器的特征融合模块,将词典信息集成到BERT底层中.最后,实验结果验证了所提方法在中文小样本NER任务中的有效性与优越性.该方法结合BERT和条件随机场(CRF)结构,在4个采样的中文NER数据集上取得了最好的性能.特别是在Weibo数据集的3个小样本场景下,模型的F1值分别达到了 63.78%、66.27%、70.90%,与其他方法相比,平均F1值分别提高了16.28%、14.30%、11.20%.此外,将ProConBERT应用到多个基于BERT的中文NER模型中能进一步提升实体识别的性能.
Keyword :
BERT模型 BERT模型 中文小样本命名实体识别 中文小样本命名实体识别 对比学习 对比学习 提示学习 提示学习 特征融合 特征融合 预训练 预训练
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | 杨三和 , 赖沛超 , 傅仰耿 et al. 面向中文小样本命名实体识别的BERT优化方法 [J]. | 小型微型计算机系统 , 2025 , 46 (3) : 602-611 . |
MLA | 杨三和 et al. "面向中文小样本命名实体识别的BERT优化方法" . | 小型微型计算机系统 46 . 3 (2025) : 602-611 . |
APA | 杨三和 , 赖沛超 , 傅仰耿 , 王一蕾 , 叶飞扬 , 张林 . 面向中文小样本命名实体识别的BERT优化方法 . | 小型微型计算机系统 , 2025 , 46 (3) , 602-611 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
图神经网络已经成功应用于各种与图相关的任务中.以有监督的方式训练一个图神经网络需要大量标签,而现实世界中受到成本制约难以获取大量标签,因此在小样本学习或半监督学习场景的标签就更为稀少.为了克服这个问题,许多方法通过标签传播的方法来估计标签,但通常会受到图上连接性和同质性假设的限制,容易生成带有噪声的伪标签.为了解决这些限制,本文提出了一个名为图超球面原型网络的新方法GHPN,专注于半监督小样本节点分类.为了减轻图结构对预测结果的影响,GHPN在超球面表示空间中建模类别表示,通过类级别表示在语义空间中传播标签信息.此外,为了利用未标记节点的监督信息,本文设计了一个基于原型网络预测结果的负学习框架,用于补充监督信号,调整各类别原型之间的距离.在5个真实世界的数据集上进行的实验表明,该方法与10个最先进的方法相比能够有效提高性能,在4个数据集上能取得平均排名最佳结果.
Keyword :
半监督学习 半监督学习 原型网络 原型网络 图表示学习 图表示学习 小样本学习 小样本学习 负学习 负学习
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | 徐祖豪 , 陈鑫龙 , 李进 et al. GHPN:面向半监督小样本节点分类的图超球面原型网络 [J]. | 小型微型计算机系统 , 2025 , 46 (3) : 542-551 . |
MLA | 徐祖豪 et al. "GHPN:面向半监督小样本节点分类的图超球面原型网络" . | 小型微型计算机系统 46 . 3 (2025) : 542-551 . |
APA | 徐祖豪 , 陈鑫龙 , 李进 , 黄益颂 , 傅仰耿 . GHPN:面向半监督小样本节点分类的图超球面原型网络 . | 小型微型计算机系统 , 2025 , 46 (3) , 542-551 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
The Belief Rule -Based (BRB) system faces the rule combination explosion issue, making it challenging to construct the rule base efficiently. The Extended Belief Rule -Based (EBRB) system offers a solution to this problem by using data -driven methods. However, using EBRB system requires the traversal of the entire rule base, which can be time-consuming and result in the activation of many irrelevant rules, leading to an incorrect decision. Existing search optimization methods can somewhat solve this issue, but they have limitations. Moreover, the calculation of the rule activation weight only considers the similarity between input data and a single rule, ignoring the influence of the rule linkage. To address these problems, we propose a new EBRB system based on the K -Nearest Neighbor graph index (Graph-EBRB). We introduce the Hierarchical Navigable Small World (HNSW) algorithm to create the K -Nearest Neighbor graph index of the EBRB system. This index allows us to efficiently search and activate a set of key rules. We also propose a new activation weight calculation method based on the Graph Convolution Neural Network (GCN), and we optimize the system performance using a parameter learning strategy. We conduct a comprehensive experiment on 14 commonly used public data sets, and the results show that Graph-EBRB system significantly improves the reasoning efficiency and accuracy of the EBRB system. Finally, we apply the Graph-EBRB system to tree disease identification and achieve excellent classification performance, identifying over 90% of the diseased trees on the complete dataset.
Keyword :
Extended belief rule-based system Extended belief rule-based system Graph convolution neural network Graph convolution neural network Hierarchical navigable small world graph Hierarchical navigable small world graph K-Nearest Neighbor graph K-Nearest Neighbor graph
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | Fu, Yang-Geng , Lin, Xin-Yi , Fang, Geng-Chao et al. A novel extended rule-based system based on K-Nearest Neighbor graph [J]. | INFORMATION SCIENCES , 2024 , 662 . |
MLA | Fu, Yang-Geng et al. "A novel extended rule-based system based on K-Nearest Neighbor graph" . | INFORMATION SCIENCES 662 (2024) . |
APA | Fu, Yang-Geng , Lin, Xin-Yi , Fang, Geng-Chao , Li, Jin , Cai, Hong-Yi , Gong, Xiao-Ting et al. A novel extended rule-based system based on K-Nearest Neighbor graph . | INFORMATION SCIENCES , 2024 , 662 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
Anomaly detection in time series data is crucial for many fields such as healthcare, meteorology, and industrial fault detection. However, traditional unsupervised time series anomaly detection methods suffer from biased anomaly measurement under contaminated training data. Most of existing methods employ hard strategies for contamination calibration by assigning pseudo -label to training data. These hard strategies rely on threshold selection and result in suboptimal performance. To address this problem, in this paper, we propose a novel unsupervised anomaly detection framework for contaminated time series (NegCo), which builds an effective soft contamination calibration strategy by exploiting the observed negative correlation between semantic representation and anomaly detection inherent within the autoencoder framework. We innovatively redefine anomaly detection in data contamination scenarios as an optimization problem rooted in this negative correlation. To model this negative correlation, we introduce a dual construct: morphological similarity captures semantic distinctions relevant to normality, while reconstruction consistency quantifies deviations indicative of anomalies. Firstly, the morphological similarity is effectively measured based on the representative normal samples generated from the center of the learned Gaussian distribution. Then, an anomaly measurement calibration loss function is designed based on negative correlation between morphological similarity and reconstruction consistency, to calibrate the biased anomaly measurement caused by contaminated samples. Extensive experiments on various time series datasets show that the proposed NegCo outperforms stateof-the-art baselines, achieving an improvement of 6.2% to 26.8% in Area Under the Receiver Operating Characteristics (AUROC) scores, particularly in scenarios with heavily contaminated training data.
Keyword :
Anomaly detection Anomaly detection Data contamination Data contamination Negative correlation Negative correlation Time series Time series
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | Lin, Xiaohui , Li, Zuoyong , Fan, Haoyi et al. Exploiting negative correlation for unsupervised anomaly detection in contaminated time series [J]. | EXPERT SYSTEMS WITH APPLICATIONS , 2024 , 249 . |
MLA | Lin, Xiaohui et al. "Exploiting negative correlation for unsupervised anomaly detection in contaminated time series" . | EXPERT SYSTEMS WITH APPLICATIONS 249 (2024) . |
APA | Lin, Xiaohui , Li, Zuoyong , Fan, Haoyi , Fu, Yanggeng , Chen, Xinwei . Exploiting negative correlation for unsupervised anomaly detection in contaminated time series . | EXPERT SYSTEMS WITH APPLICATIONS , 2024 , 249 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
Graph Neural Networks (GNNs) have demonstrated great potential in achieving outstanding performance in various graph -related tasks, e.g., graph classification and link prediction. However, most of them suffer from the following issue: shallow networks capture very limited knowledge. Prior works design deep GNNs with more layers to solve the issue, which however introduces a new challenge, i.e., the infamous oversmoothness. Graph representation over emphasizes node features but only considers the static graph structure with a uniform weight are the key reasons for the over -smoothness issue. To alleviate the issue, this paper proposes a Dynamic Weighting Strategy (DWS) for addressing over -smoothness. We first employ Fuzzy CMeans (FCM) to cluster all nodes into several groups and get each node's fuzzy assignment, based on which a novel metric function is devised for dynamically adjusting the aggregation weights. This dynamic weighting strategy not only enables the intra-cluster interactions, but also inter -cluster aggregations, which well addresses undifferentiated aggregation caused by uniform weights. Based on DWS, we further design a Structure Augmentation (SA) step for addressing the issue of underutilizing the graph structure, where some potentially meaningful connections (i.e., edges) are added to the original graph structure via a parallelable KNN algorithm. In general, the optimized Dynamic Weighting Strategy with Structure Augmentation (DWSSA) alleviates over -smoothness by reducing noisy aggregations and utilizing topological knowledge. Extensive experiments on eleven homophilous or heterophilous graph benchmarks demonstrate the effectiveness of our proposed method DWSSA in alleviating over -smoothness and enhancing deep GNNs performance.
Keyword :
Clustering Clustering Deep graph neural networks Deep graph neural networks Node classification Node classification Over-smoothness Over-smoothness Structure augmentation Structure augmentation
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | Zhang, Qirong , Li, Jin , Ye, Qingqing et al. DWSSA: Alleviating over-smoothness for deep Graph Neural Networks [J]. | NEURAL NETWORKS , 2024 , 174 . |
MLA | Zhang, Qirong et al. "DWSSA: Alleviating over-smoothness for deep Graph Neural Networks" . | NEURAL NETWORKS 174 (2024) . |
APA | Zhang, Qirong , Li, Jin , Ye, Qingqing , Lin, Yuxi , Chen, Xinlong , Fu, Yang-Geng . DWSSA: Alleviating over-smoothness for deep Graph Neural Networks . | NEURAL NETWORKS , 2024 , 174 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
Despite Graph neural networks’ significant performance gain over many classic techniques in various graph-related downstream tasks, their successes are restricted in shallow models due to over-smoothness and the difficulties of optimizations among many other issues. In this paper, to alleviate the over-smoothing issue, we propose a soft graph normalization method to preserve the diversities of node embeddings and prevent indiscrimination due to possible over-closeness. Combined with residual connections, we analyze the reason why the method can effectively capture the knowledge in both input graph structures and node features even with deep networks. Additionally, inspired by Curriculum Learning that learns easy examples before the hard ones, we propose a novel label-smoothing-based learning framework to enhance the optimization of deep GNNs, which iteratively smooths labels in an auxiliary graph and constructs many gradual non-smooth tasks for extracting increasingly complex knowledge and gradually discriminating nodes from coarse to fine. The method arguably reduces the risk of overfitting and generalizes better results. Finally, extensive experiments are carried out to demonstrate the effectiveness and potential of the proposed model and learning framework through comparison with twelve existing baselines including the state-of-the-art methods on twelve real-world node classification benchmarks. Copyright © 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
Keyword :
Curricula Curricula Graphic methods Graphic methods Graph neural networks Graph neural networks Graph structures Graph structures Graph theory Graph theory Iterative methods Iterative methods Learning systems Learning systems
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | Li, Jin , Zhang, Qirong , Xu, Shuling et al. Curriculum-Enhanced Residual Soft An-Isotropic Normalization for Over-Smoothness in Deep GNNs [C] . 2024 : 13528-13536 . |
MLA | Li, Jin et al. "Curriculum-Enhanced Residual Soft An-Isotropic Normalization for Over-Smoothness in Deep GNNs" . (2024) : 13528-13536 . |
APA | Li, Jin , Zhang, Qirong , Xu, Shuling , Chen, Xinlong , Guo, Longkun , Fu, Yang-Geng . Curriculum-Enhanced Residual Soft An-Isotropic Normalization for Over-Smoothness in Deep GNNs . (2024) : 13528-13536 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
Graph self-supervised learning is an effective technique for learning common knowledge from unlabeled graph data through pretext tasks. To capture the interrelationships between nodes and their essential roles globally, existing methods use clustering labels as self-supervised signals. However, in some cases, these methods may introduce noise, leading to over-fitting of the model and a reduction in performance. To address these issues, a novel framework for G raph S elf-Supervised S upervised C urriculum L earning based on clustering label smoothing called GSSCL has been proposed. GSSCL clusters knowledge in an easy-to-difficult manner, reducing the heavy dependence on the reliability of clustering and improving the generalizability of the model. Moreover, the Silhouette Coefficient is employed to evaluate the clustering confident scores for all nodes. Some nodes are selected based on high confident scores to perform self-supervised learning. To account for the possibility of complex heterophilous information in graphs (e.g., noisy links), clustering pseudo-label smoothing is performed on K-nearest neighbor graphs built upon the similarities between node features instead of the original graph structures. The obtained multi-scale knowledge is then applied to curriculum learning. Finally, comprehensive experiments conducted across diverse public graph benchmarks demonstrate the superior performance of the proposed framework. It exhibits comparable results to state-of-the-art methods across semi-supervised node classification and clustering tasks.
Keyword :
Clustering label smoothing Clustering label smoothing Curriculum learning Curriculum learning Graph neural network Graph neural network Graph self-supervised learning Graph self-supervised learning Selection enhancement Selection enhancement
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | Fu, Yang-Geng , Chen, Xinlong , Xu, Shuling et al. GSSCL: A framework for Graph Self-Supervised Curriculum Learning based on clustering label smoothing [J]. | NEURAL NETWORKS , 2024 , 181 . |
MLA | Fu, Yang-Geng et al. "GSSCL: A framework for Graph Self-Supervised Curriculum Learning based on clustering label smoothing" . | NEURAL NETWORKS 181 (2024) . |
APA | Fu, Yang-Geng , Chen, Xinlong , Xu, Shuling , Li, Jin , Yao, Xi , Huang, Ziyang et al. GSSCL: A framework for Graph Self-Supervised Curriculum Learning based on clustering label smoothing . | NEURAL NETWORKS , 2024 , 181 . |
Export to | NoteExpress RIS BibTex |
Version :
Abstract :
Although significant progress has been made in Chinese Named Entity Recognition (NER) methods based on deep learning, their performance often falls short in few-shot scenarios. Feature enhancement is considered a promising approach to address the issue of Chinese few-shot NER. However, traditional feature fusion methods tend to lead to the loss of important information and the integration of irrelevant information. Despite the benefits of incorporating BERT for improving entity recognition, its performance is limited when training data is insufficient. To tackle these challenges, this paper proposes a Feature Enhancement-based approach for Chinese Few-shot NER called FE-CFNER. FE-CFNER designs a double cross neural network to minimize information loss through the interaction of feature cross twice. Additionally, adaptive weights and a top-k mechanism are introduced to sparsify attention distributions, enabling the model to prioritize important information related to entities while excluding irrelevant information. To further enhance the quality of BERT embeddings, FE-CFNER employs a contrastive template for contrastive learning pre-training of BERT, enhancing BERT's semantic understanding capability. We evaluate the proposed method on four sampled Chinese NER datasets: Weibo, Resume, Taobao, and Youku. Experimental results validate the effectiveness and superiority of FE-CFNER in Chinese few-shot NER tasks.
Keyword :
Chinese Named Entity Recognition Chinese Named Entity Recognition Contrastive learning pre-training Contrastive learning pre-training Feature enhancement Feature enhancement Few-shot learning Few-shot learning
Cite:
Copy from the list or Export to your reference management。
GB/T 7714 | Yang, Sanhe , Lai, Peichao , Fang, Ruixiong et al. FE-CFNER: Feature Enhancement-based approach for Chinese Few-shot Named Entity Recognition [J]. | COMPUTER SPEECH AND LANGUAGE , 2024 , 90 . |
MLA | Yang, Sanhe et al. "FE-CFNER: Feature Enhancement-based approach for Chinese Few-shot Named Entity Recognition" . | COMPUTER SPEECH AND LANGUAGE 90 (2024) . |
APA | Yang, Sanhe , Lai, Peichao , Fang, Ruixiong , Fu, Yanggeng , Ye, Feiyang , Wang, Yilei . FE-CFNER: Feature Enhancement-based approach for Chinese Few-shot Named Entity Recognition . | COMPUTER SPEECH AND LANGUAGE , 2024 , 90 . |
Export to | NoteExpress RIS BibTex |
Version :
Export
Results: |
Selected to |
Format: |