★ 20 conference papers ★ 3 journal papers ★ 1 book chapter ★ 5 workshop papers ★ (Last updated: May, 2024)

Conference Papers

[ICML'24] [paper] [bib]
CHAI: Clustered Head Attention for Efficient LLM Inference
S. Agarwal, B. Acun, B. Homer, M. Elhoushi, Y. Lee, S. Venkataraman, D. Papailiopoulos, C.-J. Wu
International Conference on Machine Learning, 2024.

[ACL'24] [paper] [bib]
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding
M. Elhoushi, A. Shrivastava, D. Liskovich, B. Hosmer, B. Wasti, L. Lai, A. Mahmoud, B. Acun, S. Agarwal, A. Roman, A. Aly, B. Chen, C.-J. Wu
Annual Meeting of the Association for Computational Linguistics, 2024.

[ISPASS'24] [paper] [bib]
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
A. Golden, S. Hsia, F. Sun, B. Acun, B. Hosmer, Y. Lee, Z. DeVito, J. Johnson, G.-Y. Wei, D. Brooks, C.-J. Wu
IEEE International Symposium on Performance Analysis of Systems and Software, 2024.

[ISCA'24] [paper] [bib]
Exploring System-Aware Parallelization for Efficient Large-Scale Machine Learning
S. Hsia, A. Golden, B. Acun, N. Ardalani, Z. DeVito, G.Y. Wei, D. Brooks, C.-J. Wu
International Symposium on Computer Architecture, 2024.

[NeurIPS'23] [paper] [bib] [code]
Dataperf: Benchmarks for Data-Centric AI Development
M. Mazumder, C. Banbury, X. Yao, B. Karlaš, W. G. Rojas, S. Diamos, G. Diamos, L. He, D. Kiela, D. Jurado, D. Kanter, R. Mosquera, J. Ciro, L. Aroyo, B. Acun, S. Eyuboglu, A. Ghorbani, E. Goodman, T. Kane, C. R. Kirkpatrick, T.-S. Kuo, J. Mueller, T. Thrush, J. Vanschoren, M. Warren, A. Williams, S. Yeung, N. Ardalani, P. Paritosh, C. Zhang, J. Zou, C.-J. Wu, C. Coleman, A. Ng, P. Mattson, V. J. Reddi
Conference on Neural Information Processing Systems, 2023.

[ASPLOS'23] [IEEE Micro Top Picks'24 Honorable Mention] [paper] [bib]
MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation
S. Hsia, U. Gupta, B. Acun, N. Ardalani, P. Zhong, G.Y. Wei, D. Brooks, C.J. Wu
ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023.

[ASPLOS'23] [IEEE Micro Top Picks'24 Honorable Mention] [paper] [bib] [code]
Carbon Explorer: A Holistic Approach for Designing Carbon Aware Datacenters
B. Acun, B. Lee, F. Kazhamiaka, K. Maeng, M. Chakkaravarthy, U. Gupta, D. Brooks, C. J. Wu
ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023.

[MLSys'22] [paper] [bib]
Sustainable AI: Environmental Implications, Challenges and Opportunities
C. J. Wu, R. Raghavendra, U. Gupta, B. Acun, N. Ardalani, K. Maeng, G. Chang, F. Aga, J. Huang, C. Bai, M. Gschwind, A. Gupta, M. Ott, A. Melnikov, S. Candido, D. Brooks, G. Chauhan, B. Lee, H.-H. Lee, B. Akyildiz, M. Balandat, J. Spisak, R. Jain, M. Rabbat, K. Hazelwood
Conference on Machine Learning and Systems, 2022.

[ASPLOS'22] [paper] [bib]
RecShard: statistical feature-based memory optimization for industry-scale neural recommendation
G. SethiB. Acun, N. Agarwal, C. Kozyrakis, C. Trippel, C. J. Wu
ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2022.

[HPCA'22] [paper] [bib]
SecNDP: Secure Near-Data Processing with Untrusted Memory
W. Xiong, L. Ke, D. Jankov, M. Kounavis, X. Wang, E. Northup, J. A. Yang, B. Acun, C.-J. Wu, P. T. P. Tang, G. E. Suh, X. Zhang, H.-H. Lee
IEEE International Symposium on High-Performance Computer Architecture, 2022.

[MLSys'21] [Outstanding Paper Award] [paper] [bib] [code]
TT-Rec: Tensor Train Compression for Deep Learning Recommendation Models
C. Yin, B. Acun, X. Liu, C. J. Wu
Conference on Machine Learning and Systems, 2021.

[HPCA'21] [paper] [bib]
Understanding Training Efficiency of Deep Learning Recommendation Models at Scale
B. Acun, M. Murphy, X. Wang, J. Nie, C. J. Wu, K. Hazelwood
IEEE International Symposium on High-Performance Computer Architecture, 2021.

[HPCA'19] [paper] [bib]
Power-Aware Heterogeneous Node Assembly
B. Acun, A Buyuktosunoglu, E. K. Lee, Y. Park
IEEE International Symposium on High-Performance Computer Architecture, 2019.

[IGSC'19] [paper] [bib]
Fine-Grained Energy Efficiency Using Per-Core DVFS with an Adaptive Runtime System
B. Acun, K. Chandrasekar, L.V. Kale
International Green and Sustainable Computing Conference, 2019.

[HiPC'17] [paper] [bib]
Support for Power Efficient Proactive Cooling Mechanisms
B. Acun, E. K. Lee, Y. Park, L. V. Kalé
International Conference on High Performance Computing, 2017.

[ICS'16] [paper] [bib]
Variation Among Processors Under Turbo Boost in HPC Systems
B. Acun , P. Miller, L. V. Kalé
International Conference on Supercomputing, 2016.

[HiPC'14] [paper] [bib]
Towards Realizing the Potential of Malleable Jobs
A. Gupta, B. Acun , O. Sarood, L. V. Kalé
International Conference on High Performance Computing, 2014.

[SC'14] [paper] [bib] [code]
Parallel Programming with Migratable Objects: Charm++ in Practice
B. Acun, A. Gupta, N. Jain, A. Langer, H. Menon, E. Mikida, X. Ni, M. Robson, Y. Sun, E. Totoni, L. Wesolowski, L. V. Kalé.
Supercomputing, 2014.

[CLUSTER'13] [paper] [bib]
Thermal-Aware Automated Load Balancing for HPC Applications.
H. Menon, B. Acun , SG De Gonzalo, O. Sarood, L. V. Kalé
IEEE International Conference on Cluster Computing, 2013.

[ISCIS'13] [paper] [bib]
Topic Tracking Using Chronological Term Ranking
B. Acun, A. Başpınar, E. Oğuz, M.İ. Saraç, F. Can
International Symposium on Computer and Information Sciences, 2013.

Preprints

[arXiv'23] [paper] [bib]
Carbon Responder: Coordinating Demand Response for the Datacenter Fleet
J. Xing, B. Acun, A. Sundarrajan, D. Brooks, M. Chakkaravarthy, N. Avila, C.-J. Wu, B. C. Lee

[arXiv'23] [paper] [bib]
Data Acquisition: A New Frontier in Data-centric AI
L. Chen, B. Acun, N. Ardalani, Y. Sun, F. Kang, H. Lyu, Y. Kwon, R. Jia, C.-J. Wu, M. Zaharia, J. Zou

Journal Papers

[Micro'21] [paper] [bib]
Datacenter-Scale Analysis and Optimization of GPU Machine Learning Workloads
L. Wesolowski, B. Acun, V. Andrei, A. Aziz, G. Dankel, C. Gregg, X. Meng, C. Meurillon, D. Sheahan, L. Tian, J. Yang, P. Yu, K. Hazelwood
IEEE Micro, 2021.

[IBM'17] [paper] [bib]
Scalable molecular dynamics with NAMD on the Summit system
B. Acun, D.J. Hardy, L.V. Kalé, K. Li, J.C. Phillips, J.E. Stone
IBM Journal of Research and Development, 2017.

[COMPUTER'16] [Cover Featured] [paper] [bib]
Power, Reliability, Performance: One System to Rule Them All
B. Acun, A. Langer, H. Menon, O. Sarood, E. Totoni, and L. V. Kalé.
IEEE Computer, Energy Efficient Computing Special Issue, 2016.

Book Chapter

[CRC Press'17] [paper] [bib]
NAMD: Scalable Molecular Dynamics Based on the Charm++ Parallel Runtime System
B. Acun, R. Buch, , L.V. Kalé, J. C. Phillips
Exascale Scientific Applications: Scalability and Performance Portability, CRC Press, 2017.

Workshop Papers

[EMC2, ASPLOS'24] [paper]
Is Flash Attention Stable?
A. Golden, S. Hsia, F. Sun, B Acun, B. Hosmer, Y. Lee, Z. DeVito, J. Johnson, G.-Y. Wei, D. Brooks, C.-J. Wu
EMC2 - Energy Efficient Machine Learning and Cognitive Computing Workshop, 2024.

[HotCarbon'22] [paper] [bib]
Carbon Dependencies in Datacenter Design and Management
B. Acun, B. Lee, F. Kazhamiaka, A. Sundarrajan, K. Maeng, M. Chakkaravarthy, D. Brooks, C.-J. Wu
Workshop on Sustainable Computer Systems Design and Implementation, 2022.

[E2SC, SC'16] [paper] [bib]
Neural Network-Based Task Scheduling with Preemptive Fan Control
B. Acun, E. K. Lee, Y. Park, L. V. Kalé
International Workshop on Energy Efficient Supercomputing at Supercomputing Conference, 2016.

[VarSys, IPDPS'16] [paper] [bib]
Mitigating Processor Variation with Dynamic Load Balancing
B. Acun, L. V. Kalé IEEE International Workshop on Variability in Parallel and Distributed Systems at IPDPS. 2016.

[PADABS, EUROPAR'15] [paper] [bib] [code]
TraceR: A Parallel Trace Replay Tool for Studying Interconnection Networks
B. Acun, N. Jain, A. Bhatele, L. V. Kalé
Workshop on Parallel and Distributed Agent-Based Simulations at EUROPAR, 2015.

Full List of Publications

Conference Papers

Preprints

Journal Papers

Book Chapter

Workshop Papers