Academic
Academic
Home
Publications
Contact
Light
Dark
Automatic
Publications
Type
Conference paper
Date
2025
2024
2023
Hochen Huang
,
Shuzhang Zhong
,
Zhe Zhang
,
Shuangchen Li
,
Dimin Niu
,
Hongzhong Zheng
,
Runsheng Wang
,
Meng Li
(2025).
HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing
. In
ICCAD 2025
.
Zizhuo Fu
,
Xiaotian Guo
,
Wenxuan Zeng
,
Shuzhang Zhong
,
Yadong Zhang
,
Peiyu Chen
,
Runsheng Wang
,
Le Ye
,
Meng Li
(2025).
H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference
. In
ICCAD 2025
.
Linye Wei
,
Shuzhang Zhong
,
Songqiang Xu
,
Runsheng Wang
,
Ru Huang
,
Meng Li
(2025).
SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding
. In
DAC 2025
.
Shuzhang Zhong
,
Yanfan Sun
,
Ling Liang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
(2025).
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference
. In
DAC 2025
.
Shuzhang Zhong
,
Zebin Yang
,
Ruihao Gong
,
Runsheng Wang
,
Ru Huang
,
Meng Li
(2024).
ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
. In
ICCAD 2024
.
Tianshi Xu
,
Shuzhang Zhong
,
Wenxuan Zeng
,
Runsheng Wang
,
Meng Li
(2024).
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
. In
ICCAD 2024
.
Shuzhang Zhong
,
Ling Liang
,
Yuan Wang
,
Runsheng Wang
,
Ru Huang
,
Meng Li
(2024).
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference
. In
ICCAD 2024
.
Shuzhang Zhong
,
Meng Li
,
Yun Liang
,
Runsheng Wang
,
Ru Huang
(2023).
Memory-aware scheduling for complex wired networks with iterative graph optimization
. In
ICCAD 2023
.
Cite
×