Home>Results

  • Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

[期刊论文]

VoxT-GNN: A 3D object detection approach from point cloud based on voxel-level transformer and graph neural network

Share
Edit Delete 报错

author:

Zheng, Qiangwen (Zheng, Qiangwen.) [1] | Wu, Sheng (Wu, Sheng.) [2] | Wei, Jinghui (Wei, Jinghui.) [3]

Indexed by:

EI

Abstract:

Recently, a variety of LiDAR-based methods for the 3D detection of single-class objects, large objects, or in straightforward scenes have exhibited competitive performance. However, their detection performance in complex scenarios with multi - sized and multi - class objects is limited. We observe that the core problem leading to this phenomenon is the insufficient feature learning of small objects in point clouds, making it difficult to obtain more discriminative features. To address this challenge, we propose a 3D object detection framework based on point clouds that takes into account the detection of small objects, termed VoxT-GNN. The framework comprises two core components: a Voxel-Level Transformer (VoxelFormer) for local feature learning and a Graph Neural Network Feed-Forward Network (GnnFFN) for global feature learning. By embedding GnnFFN as an intermediate layer between the encoder and decoder of VoxelFormer, we achieve flexible scaling of the global receptive field while maximally preserving the original point cloud structure. This design enables effective adaptation to objects of varying sizes and categories, providing a viable solution for detection applications across diverse scenarios. Extensive experiments on KITTI and Waymo Open Dataset (WOD) demonstrate the strong competitiveness of our method, particularly showing significant improvements in small object detection. Notably, our approach achieves the second-highest mAP of 65.44% across three categories (car, pedestrian, and cyclist) on KITTI benchmark. The source code is available at https://github.com/yujianxinnian/VoxT-GNN. © 2025 The Author(s)

Keyword:

Graph neural networks

Community:

  • [ 1 ] [Zheng, Qiangwen]The College of Computer and Data Science, Fuzhou University, China
  • [ 2 ] [Wu, Sheng]The Academy of Digital China (Fujian), Fuzhou University, China
  • [ 3 ] [Wei, Jinghui]The College of Computer and Data Science, Fuzhou University, China

Reprint 's Address:

  • [wu, sheng]the academy of digital china (fujian), fuzhou university, china

Show more details

Related Article:

Source :

Information Processing and Management

ISSN: 0306-4573

Year: 2025

Issue: 4

Volume: 62

7 . 4 0 0

JCR@2023

CAS Journal Grade:1

Cited Count:

WoS CC Cited Count:

30 Days PV: 2

Affiliated Colleges:

Online/Total:138/10201702
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1