Hub Selection for Hub Based Clustering Algorithms - Details

author：

He, Zhenfeng (He, Zhenfeng.) ^[1] (Scholars：何振峰)

Indexed by：

CPCI-S

Abstract：

Hubs　are　the　data　instances　appearing　frequently　on　the　nearest　neighbours　lists.　As　the　hubs　of　a　high-dimensional　dataset　are　close　to　the　centres　of　clusters　or　sub-clusters,　some　of　them　are　selected　as　the　centres　of　clusters　by　hub　based　clustering　algorithms.　In　the　process　of　hub　selection,　these　algorithms　rank　data　instances　in　terms　of　their　global　hubness　scores　computed　upon　their　nearest　neighbours　lists,　ignoring　cluster　related　information　such　as　their　labels,　their　and　their　related　instances＇　clustering　quality.　As　a　result,　some　suitable　hubs　may　be　neglected.　To　solve　this　problem,　we　suggest　evaluating　instances　by　their　relative　hubness　scores.　Moreover,　we　propose　a　weighted　relative　hubness　score　computed　upon　nearest　neighbours　lists　and　silhouette　information.　Besides,　we　suggest　selecting　the　instance　of　the　highest　silhouette　information　when　two　or　more　instances　tie　for　first　place.　Experimental　results　on　real　datasets　and　synthetic　datasets　suggest　that　both　the　relative　hubness　score　and　the　weighted　relative　hubness　score　can　improve　hub　based　clustering,　and　the　weighted　relative　hubness　score　often　plays　better.

Keyword：

Clustering High-dimensional data Hubness Silhouette Information

Community：

[ 1 ] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350002, Peoples R China

Reprint 's Address：

何振峰
[He, Zhenfeng]Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350002, Peoples R China

Email：

hezhenfeng@fzu.edu.cn

Show more details

Related Keywords：

Constrained Silhouette Based Evolutionary K-Means
2013，Chinese Intelligent Automation Conference (CIAC)
Spanning Tree Based Attribute Clustering
2009，13th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2009
SGAE: Stacked Graph Autoencoder for Deep Clustering
2023，IEEE TRANSACTIONS ON BIG DATA
A controlled data envelopment analysis clustering approach based on individual perspective
2024，INFORMATION SCIENCES

Source ：

2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD)

Year： 2014

Page： 479-484

Language： English

Cited Count：

WoS CC Cited Count： 2

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to