• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Lin, Xinquan (Lin, Xinquan.) [1] | Xu, Haobo (Xu, Haobo.) [2] | Han, Yinhe (Han, Yinhe.) [3] | Gan, Yiming (Gan, Yiming.) [4]

Indexed by:

CPCI-S Scopus

Abstract:

Deep learning models are scaling in both parameters and modalities. Multi-modal large language models are increasingly used in robotic applications, driving the need for large-scale deep learning accelerators. Multi-chiplet heterogeneous neural network accelerators are an effective solution for today's multi-modal large language models. Different types of chiplets provide diverse functionalities, enabling large data storage, high on-chip bandwidth, and significant computing capability. While single-core or multi-core NPU accelerators can be validated through simulation, there is still a lack of software-level cycle-accurate simulators for multi-chiplet NPUs. In this work, we propose HEX-SIM, a configurable multi-chiplet deep learning accelerator simulator. HEX-SIM offers designers various macro architectures and system parameters to better evaluate accelerator designs. We conduct extensive simulation experiments using HEX-SIM, demonstrating the effects of parallelism, bandwidth, buffer size, and the number of computing engines on inference latency. These insights can significantly aid users in optimizing their designs. Our project code is open-sourced and available at https://github.com/jimrelief/HEX-SIM.

Keyword:

Community:

  • [ 1 ] [Lin, Xinquan]Fuzhou Univ, Sch Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 2 ] [Xu, Haobo]Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
  • [ 3 ] [Han, Yinhe]Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
  • [ 4 ] [Gan, Yiming]Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China

Reprint 's Address:

  • [Gan, Yiming]Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China

Show more details

Version:

Related Keywords:

Related Article:

Source :

2024 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, IISWC 2024

Year: 2024

Page: 108-120

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Online/Total:48/10107029
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1