<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Publications | Academic</title><link>https://shuzhangzhong.github.io/publication/</link><atom:link href="https://shuzhangzhong.github.io/publication/index.xml" rel="self" type="application/rss+xml"/><description>Publications</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Wed, 25 Jun 2025 00:00:00 +0000</lastBuildDate><image><url>https://shuzhangzhong.github.io/media/icon_hu0b7a4cb9992c9ac0e91bd28ffd38dd00_9727_512x512_fill_lanczos_center_3.png</url><title>Publications</title><link>https://shuzhangzhong.github.io/publication/</link></image><item><title>H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference</title><link>https://shuzhangzhong.github.io/publication/iccad-2025-h2eal/</link><pubDate>Wed, 25 Jun 2025 00:00:00 +0000</pubDate><guid>https://shuzhangzhong.github.io/publication/iccad-2025-h2eal/</guid><description>&lt;!--
&lt;div class="alert alert-note">
&lt;div>
Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.
&lt;/div>
&lt;/div>
&lt;div class="alert alert-note">
&lt;div>
Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.
&lt;/div>
&lt;/div>
Add the publication's **full text** or **supplementary notes** here. You can use rich formatting such as including [code, math, and images](https://docs.hugoblox.com/content/writing-markdown-latex/). --></description></item><item><title>HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing</title><link>https://shuzhangzhong.github.io/publication/iccad-2025-hdmoe/</link><pubDate>Wed, 25 Jun 2025 00:00:00 +0000</pubDate><guid>https://shuzhangzhong.github.io/publication/iccad-2025-hdmoe/</guid><description>&lt;!--
&lt;div class="alert alert-note">
&lt;div>
Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.
&lt;/div>
&lt;/div>
&lt;div class="alert alert-note">
&lt;div>
Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.
&lt;/div>
&lt;/div>
Add the publication's **full text** or **supplementary notes** here. You can use rich formatting such as including [code, math, and images](https://docs.hugoblox.com/content/writing-markdown-latex/). --></description></item><item><title>HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference</title><link>https://shuzhangzhong.github.io/publication/dac-2025-specasr/</link><pubDate>Tue, 25 Feb 2025 00:00:00 +0000</pubDate><guid>https://shuzhangzhong.github.io/publication/dac-2025-specasr/</guid><description>&lt;!--
&lt;div class="alert alert-note">
&lt;div>
Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.
&lt;/div>
&lt;/div>
&lt;div class="alert alert-note">
&lt;div>
Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.
&lt;/div>
&lt;/div>
Add the publication's **full text** or **supplementary notes** here. You can use rich formatting such as including [code, math, and images](https://docs.hugoblox.com/content/writing-markdown-latex/). --></description></item><item><title>SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding</title><link>https://shuzhangzhong.github.io/publication/dac-2025-hybrimoe/</link><pubDate>Tue, 25 Feb 2025 00:00:00 +0000</pubDate><guid>https://shuzhangzhong.github.io/publication/dac-2025-hybrimoe/</guid><description>&lt;!--
&lt;div class="alert alert-note">
&lt;div>
Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.
&lt;/div>
&lt;/div>
&lt;div class="alert alert-note">
&lt;div>
Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.
&lt;/div>
&lt;/div>
Add the publication's **full text** or **supplementary notes** here. You can use rich formatting such as including [code, math, and images](https://docs.hugoblox.com/content/writing-markdown-latex/). --></description></item><item><title>AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference</title><link>https://shuzhangzhong.github.io/publication/iccad-2024-adapmoe/</link><pubDate>Sun, 27 Oct 2024 00:00:00 +0000</pubDate><guid>https://shuzhangzhong.github.io/publication/iccad-2024-adapmoe/</guid><description>&lt;!--
&lt;div class="alert alert-note">
&lt;div>
Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.
&lt;/div>
&lt;/div>
&lt;div class="alert alert-note">
&lt;div>
Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.
&lt;/div>
&lt;/div>
Add the publication's **full text** or **supplementary notes** here. You can use rich formatting such as including [code, math, and images](https://docs.hugoblox.com/content/writing-markdown-latex/). --></description></item><item><title>PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization</title><link>https://shuzhangzhong.github.io/publication/iccad-2024-privquant/</link><pubDate>Sun, 27 Oct 2024 00:00:00 +0000</pubDate><guid>https://shuzhangzhong.github.io/publication/iccad-2024-privquant/</guid><description>&lt;!--
&lt;div class="alert alert-note">
&lt;div>
Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.
&lt;/div>
&lt;/div>
&lt;div class="alert alert-note">
&lt;div>
Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.
&lt;/div>
&lt;/div>
Add the publication's **full text** or **supplementary notes** here. You can use rich formatting such as including [code, math, and images](https://docs.hugoblox.com/content/writing-markdown-latex/). --></description></item><item><title>ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding</title><link>https://shuzhangzhong.github.io/publication/iccad-2024-propd/</link><pubDate>Sun, 27 Oct 2024 00:00:00 +0000</pubDate><guid>https://shuzhangzhong.github.io/publication/iccad-2024-propd/</guid><description>&lt;!--
&lt;div class="alert alert-note">
&lt;div>
Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.
&lt;/div>
&lt;/div>
&lt;div class="alert alert-note">
&lt;div>
Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.
&lt;/div>
&lt;/div>
Add the publication's **full text** or **supplementary notes** here. You can use rich formatting such as including [code, math, and images](https://docs.hugoblox.com/content/writing-markdown-latex/). --></description></item><item><title>Memory-aware scheduling for complex wired networks with iterative graph optimization</title><link>https://shuzhangzhong.github.io/publication/iccad-2023-mem/</link><pubDate>Tue, 24 Oct 2023 00:00:00 +0000</pubDate><guid>https://shuzhangzhong.github.io/publication/iccad-2023-mem/</guid><description>&lt;!--
&lt;div class="alert alert-note">
&lt;div>
Click the &lt;em>Cite&lt;/em> button above to demo the feature to enable visitors to import publication metadata into their reference management software.
&lt;/div>
&lt;/div>
&lt;div class="alert alert-note">
&lt;div>
Create your slides in Markdown - click the &lt;em>Slides&lt;/em> button to check out the example.
&lt;/div>
&lt;/div>
Add the publication's **full text** or **supplementary notes** here. You can use rich formatting such as including [code, math, and images](https://docs.hugoblox.com/content/writing-markdown-latex/). --></description></item></channel></rss>