2025 Fall
Specific Requirements
- We focus on the latest papers from SOSP and OSDI, as well as papers released on arXiv. Each time presenters select one paper from SOSP or OSDI and one from arXiv.
- The presentation follows a "1+N" format, where one person delivers the main content while supporting members assist with preparation and manage the Q&A session. These supporting members are also encouraged to contribute to the presentation.
- The discussion should provide a thorough analysis of the paperβs strengths and weaknesses, along with a comprehensive review of related work from the past three years. The presentation must be at least 45 minutes long.
Other Information
The playback video and text summary will be uploaded to bilibili and zhihu as soon as possible.
Schedule
November 4
Topic I
- π‘ [OSDI'25] Enabling Efficient GPU Communication over Multiple NICs with FuseLink
- πββοΈ Haiquan Wang, Tonghuan Xiao, Jiahui Tan
Topic II
- π‘ [arXiv] Fast-dLLM v2: Efficient Block-Diffusion LLM
- πββοΈ Xiliang Xian
October 28
- π‘ [arXiv] Shift Parallelism: Low-Latency, High-Throughput LLM Inference for Dynamic Workloads
- πββοΈ Jiaan Zhu, Qinghe Wang, Long Zhao
- π slides, πΊ video
October 21
- π‘ [arXiv] ServeGen: Workload Characterization and Generation of Large Language Model Serving in Production
- πββοΈ Zijian Dai
- π slides, πΊ video
September 29
- β¨ SOSP Rehearsal
- π‘ Mantle: Efficient Hierarchical Metadata Management for Cloud Object Storage Services
- πββοΈ Jiahao Li
September 16
- π‘ Kick-off meeting
- πββοΈ Youhui Bai, Zhihui Chen, Ouxiang Zhou and Ruibo Liu
- π slides