• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Yu, J. (Yu, J..) [1] | Zhao, H. (Zhao, H..) [2] | Wu, S. (Wu, S..) [3] | Xi, Y. (Xi, Y..) [4]

Indexed by:

Scopus

Abstract:

[Objective] To reduce semantic deviation and loss caused by language differences and text feature selection in the text classification process while preserving more textual information. [Methods] Firstly, we used a pre-trained SBERT model for sentence representation. Secondly, we calculated the sentence similarity between texts with a Sentence Vectors Rotator’s Similarity method. We also applied sentence weighting within texts to form vectors. Finally, we combined machine learning and neural network classification methods to achieve cross-lingual text classification. [Results] We conducted experiments on multiple cross-lingual text datasets in Chinese, English, Russian, French, and Spanish, and the multilingual public dataset Reuters demonstrated that the proposed method significantly improved accuracy compared to existing methods. Additionally, recall, precision, and F1 scores also showed enhancements. [Limitations] The study does not consider the impact of sentence position within the text on its weight. [Conclusions] The proposed model could reduce semantic deviation and loss, thus improving the performance of cross-lingual text classification. © 2025 Chinese Academy of Sciences. All rights reserved.

Keyword:

Cross-Lingual Sentence Vectors Weighting Text Classification Text Similarity

Community:

  • [ 1 ] [Yu J.]School of Economics and Management, Fuzhou University, Fuzhou, 350108, China
  • [ 2 ] [Zhao H.]School of Economics and Management, Fuzhou University, Fuzhou, 350108, China
  • [ 3 ] [Wu S.]School of Economics and Management, Fuzhou University, Fuzhou, 350108, China
  • [ 4 ] [Xi Y.]School of Business Administration, South China University of Technology, Guangzhou, 510641, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Data Analysis and Knowledge Discovery

ISSN: 2096-3467

Year: 2025

Issue: 2

Volume: 9

Page: 39-47

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:103/10025737
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1