• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Hu, J. (Hu, J..) [1] | Zeng, C. (Zeng, C..) [2] | Wang, Z. (Wang, Z..) [3] | Zhang, J. (Zhang, J..) [4] | Guo, K. (Guo, K..) [5] | Xu, H. (Xu, H..) [6] | Huang, J. (Huang, J..) [7] | Chen, K. (Chen, K..) [8]

Indexed by:

Scopus

Abstract:

Various datacenter network (DCN) load balancing schemes have been proposed in the past decade. Unfortunately, most of these solutions designed for lossy DCNs do not work well for Priority Flow Control (PFC) enabled lossless DCNs, primarily due to the reason that the individual congestion signals used in these solutions, e.g., link load, queue length, Round Trip Time (RTT) and Explicit Congestion Notification (ECN), may not be able to correctly or timely reflect the hop-by-hop PFC pausing. This paper first reveals the above problems via extensive experiments, and then based on the insights learned, we present Proteus, a PFC-aware load balancing scheme that is resilient to PFC pausing by exploring a combination of multi-level congestion signals. At its heart, Proteus leverages RTT-level signals (i.e., RTT and link utilization) to detect path status for initial routing decision, and exploits sub-RTT level signal (i.e., cumulative sojourn time) to reflect instantaneous PFC pausing and make timely rerouting choices based on the idea of better-late-than-never. We have implemented Proteus in the hardware programmable switch. Our testbed experiments as well as large-scale simulations show that Proteus can effectively handle PFC pausing under realistic workloads and achieve up to 35%, 31%, 28%, 22% and 46%, 42%, 34%, 29% better average FCT and  $99^{th}$  percentile FCT than CONGA, DRILL, Hermes and MP-RDMA, respectively. IEEE

Keyword:

Computer science Datacenter Delays load balancing Load management Load modeling lossless networks Receivers Switches Transport protocols

Community:

  • [ 1 ] [Hu J.]School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha, China
  • [ 2 ] [Zeng C.]Computer Science and Engineering Department, The Hong Kong University of Science and Technology, Hong Kong, Hong Kong
  • [ 3 ] [Wang Z.]Computer Science and Engineering Department, The Hong Kong University of Science and Technology, Hong Kong, Hong Kong
  • [ 4 ] [Zhang J.]Computer Science and Engineering Department, The Hong Kong University of Science and Technology, Hong Kong, Hong Kong
  • [ 5 ] [Guo K.]Computer Science and Technology and Management Department, Fuzhou University, Fuzhou, China
  • [ 6 ] [Xu H.]Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, Hong Kong
  • [ 7 ] [Huang J.]School of Computer Science and Engineering, Central South University, Changsha, China
  • [ 8 ] [Chen K.]Computer Science and Engineering Department, The Hong Kong University of Science and Technology, Hong Kong, Hong Kong

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

ACM Transactions on Networking

ISSN: 1063-6692

Year: 2024

Issue: 3

Volume: 32

Page: 1-13

3 . 0 0 0

JCR@2023

CAS Journal Grade:3

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 19

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:394/11077653
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1