Full List of Publications

★ 20 conference papers ★ 3 journal papers ★ 1 book chapter ★ 5 workshop papers ★ (Last updated: May, 2024)

Conference Papers

  1. [ICML'24]  [paper]  [bib]

    CHAI: Clustered Head Attention for Efficient LLM Inference

    S. Agarwal, B. Acun, B. Homer, M. Elhoushi, Y. Lee, S. Venkataraman, D. Papailiopoulos, C.-J. Wu
    International Conference on Machine Learning, 2024.

  2. [ACL'24]  [paper] [bib]

    Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding

    M. Elhoushi, A. Shrivastava, D. Liskovich, B. Hosmer, B. Wasti, L. Lai, A. Mahmoud, B. Acun, S. Agarwal, A. Roman, A. Aly, B. Chen, C.-J. Wu
    Annual Meeting of the Association for Computational Linguistics, 2024.

  3. [ISPASS'24]  [paper] [bib]

    Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

    A. Golden, S. Hsia, F. Sun, B. Acun, B. Hosmer, Y. Lee, Z. DeVito, J. Johnson, G.-Y. Wei, D. Brooks, C.-J. Wu
    IEEE International Symposium on Performance Analysis of Systems and Software, 2024.

  4. [ISCA'24]  [paper]  [bib]

    Exploring System-Aware Parallelization for Efficient Large-Scale Machine Learning

    S. Hsia, A. Golden, B. Acun, N. Ardalani, Z. DeVito, G.Y. Wei, D. Brooks, C.-J. Wu
    International Symposium on Computer Architecture, 2024.

  5. [NeurIPS'23]  [paper] [bib] [code]

    Dataperf: Benchmarks for Data-Centric AI Development

    M. Mazumder, C. Banbury, X. Yao, B. Karlaš, W. G. Rojas, S. Diamos, G. Diamos, L. He, D. Kiela, D. Jurado, D. Kanter, R. Mosquera, J. Ciro, L. Aroyo, B. Acun, S. Eyuboglu, A. Ghorbani, E. Goodman, T. Kane, C. R. Kirkpatrick, T.-S. Kuo, J. Mueller, T. Thrush, J. Vanschoren, M. Warren, A. Williams, S. Yeung, N. Ardalani, P. Paritosh, C. Zhang, J. Zou, C.-J. Wu, C. Coleman, A. Ng, P. Mattson, V. J. Reddi
    Conference on Neural Information Processing Systems, 2023.

  6. [ASPLOS'23]  [IEEE Micro Top Picks'24 Honorable Mention] [paper]  [bib]

    MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation

    S. Hsia, U. Gupta, B. Acun, N. Ardalani, P. Zhong, G.Y. Wei, D. Brooks, C.J. Wu
    ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023.

  7. [ASPLOS'23]  [IEEE Micro Top Picks'24 Honorable Mention] [paper]  [bib] [code]

    Carbon Explorer: A Holistic Approach for Designing Carbon Aware Datacenters

    B. Acun, B. Lee, F. Kazhamiaka, K. Maeng, M. Chakkaravarthy, U. Gupta, D. Brooks, C. J. Wu
    ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023.

  8. [MLSys'22]  [paper]  [bib]

    Sustainable AI: Environmental Implications, Challenges and Opportunities

    C. J. Wu, R. Raghavendra, U. Gupta, B. Acun, N. Ardalani, K. Maeng, G. Chang, F. Aga, J. Huang, C. Bai, M. Gschwind, A. Gupta, M. Ott, A. Melnikov, S. Candido, D. Brooks, G. Chauhan, B. Lee, H.-H. Lee, B. Akyildiz, M. Balandat, J. Spisak, R. Jain, M. Rabbat, K. Hazelwood
    Conference on Machine Learning and Systems, 2022.

  9. [ASPLOS'22]  [paper]  [bib]

    RecShard: statistical feature-based memory optimization for industry-scale neural recommendation

    G. SethiB. Acun, N. Agarwal, C. Kozyrakis, C. Trippel, C. J. Wu
    ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2022.

  10. [HPCA'22]  [paper]  [bib]

    SecNDP: Secure Near-Data Processing with Untrusted Memory

    W. Xiong, L. Ke, D. Jankov, M. Kounavis, X. Wang, E. Northup, J. A. Yang, B. Acun, C.-J. Wu, P. T. P. Tang, G. E. Suh, X. Zhang, H.-H. Lee
    IEEE International Symposium on High-Performance Computer Architecture, 2022.

  11. [MLSys'21] [Outstanding Paper Award] [paper]  [bib] [code] 

    TT-Rec: Tensor Train Compression for Deep Learning Recommendation Models

    C. Yin, B. Acun, X. Liu, C. J. Wu
    Conference on Machine Learning and Systems, 2021.

  12. [HPCA'21]  [paper] [bib]

    Understanding Training Efficiency of Deep Learning Recommendation Models at Scale

    B. Acun, M. Murphy, X. Wang, J. Nie, C. J. Wu, K. Hazelwood
    IEEE International Symposium on High-Performance Computer Architecture, 2021.

  13. [HPCA'19]  [paper] [bib]

    Power-Aware Heterogeneous Node Assembly

    B. Acun, A Buyuktosunoglu, E. K. Lee, Y. Park
    IEEE International Symposium on High-Performance Computer Architecture, 2019.

  14. [IGSC'19]  [paper] [bib]

    Fine-Grained Energy Efficiency Using Per-Core DVFS with an Adaptive Runtime System

    B. Acun, K. Chandrasekar, L.V. Kale
    International Green and Sustainable Computing Conference, 2019.

  15. [HiPC'17]  [paper] [bib]

    Support for Power Efficient Proactive Cooling Mechanisms

    B. Acun, E. K. Lee, Y. Park, L. V. Kalé
    International Conference on High Performance Computing, 2017.

  16. [ICS'16]  [paper] [bib]

    Variation Among Processors Under Turbo Boost in HPC Systems

    B. Acun , P. Miller, L. V. Kalé
    International Conference on Supercomputing, 2016.

  17. [HiPC'14]  [paper] [bib]

    Towards Realizing the Potential of Malleable Jobs

    A. Gupta, B. Acun , O. Sarood, L. V. Kalé
    International Conference on High Performance Computing, 2014.

  18. [SC'14]  [paper] [bib] [code]

    Parallel Programming with Migratable Objects: Charm++ in Practice

    B. Acun, A. Gupta, N. Jain, A. Langer, H. Menon, E. Mikida, X. Ni, M. Robson, Y. Sun, E. Totoni, L. Wesolowski, L. V. Kalé.
    Supercomputing, 2014.

  19. [CLUSTER'13]  [paper] [bib]

    Thermal-Aware Automated Load Balancing for HPC Applications.

    H. Menon, B. Acun , SG De Gonzalo, O. Sarood, L. V. Kalé
    IEEE International Conference on Cluster Computing, 2013.

  20. [ISCIS'13]  [paper] [bib]

    Topic Tracking Using Chronological Term Ranking

    B. Acun, A. Başpınar, E. Oğuz, M.İ. Saraç, F. Can
    International Symposium on Computer and Information Sciences, 2013.

Preprints

  1. [arXiv'23]  [paper]  [bib]

    Carbon Responder: Coordinating Demand Response for the Datacenter Fleet

    J. Xing, B. Acun, A. Sundarrajan, D. Brooks, M. Chakkaravarthy, N. Avila, C.-J. Wu, B. C. Lee

  2. [arXiv'23]  [paper] [bib]

    Data Acquisition: A New Frontier in Data-centric AI

    L. Chen, B. Acun, N. Ardalani, Y. Sun, F. Kang, H. Lyu, Y. Kwon, R. Jia, C.-J. Wu, M. Zaharia, J. Zou

Journal Papers

  1. [Micro'21]  [paper] [bib]

    Datacenter-Scale Analysis and Optimization of GPU Machine Learning Workloads

    L. Wesolowski, B. Acun, V. Andrei, A. Aziz, G. Dankel, C. Gregg, X. Meng, C. Meurillon, D. Sheahan, L. Tian, J. Yang, P. Yu, K. Hazelwood
    IEEE Micro, 2021.

  2. [IBM'17]  [paper] [bib]

    Scalable molecular dynamics with NAMD on the Summit system

    B. Acun, D.J. Hardy, L.V. Kalé, K. Li, J.C. Phillips, J.E. Stone
    IBM Journal of Research and Development, 2017.

  3. [COMPUTER'16]  [Cover Featured] [paper] [bib]

    Power, Reliability, Performance: One System to Rule Them All

    B. Acun, A. Langer, H. Menon, O. Sarood, E. Totoni, and L. V. Kalé.
    IEEE Computer, Energy Efficient Computing Special Issue, 2016.

Book Chapter

  1. [CRC Press'17]  [paper] [bib]

    NAMD: Scalable Molecular Dynamics Based on the Charm++ Parallel Runtime System

    B. Acun, R. Buch, , L.V. Kalé, J. C. Phillips
    Exascale Scientific Applications: Scalability and Performance Portability, CRC Press, 2017.

Workshop Papers

  1. [EMC2, ASPLOS'24]  [paper]

    Is Flash Attention Stable?

    A. Golden, S. Hsia, F. Sun, B Acun, B. Hosmer, Y. Lee, Z. DeVito, J. Johnson, G.-Y. Wei, D. Brooks, C.-J. Wu
    EMC2 - Energy Efficient Machine Learning and Cognitive Computing Workshop, 2024.

  2. [HotCarbon'22]  [paper] [bib]

    Carbon Dependencies in Datacenter Design and Management

    B. Acun, B. Lee, F. Kazhamiaka, A. Sundarrajan, K. Maeng, M. Chakkaravarthy, D. Brooks, C.-J. Wu
    Workshop on Sustainable Computer Systems Design and Implementation, 2022.

  3. [E2SC, SC'16]  [paper] [bib]

    Neural Network-Based Task Scheduling with Preemptive Fan Control

    B. Acun, E. K. Lee, Y. Park, L. V. Kalé
    International Workshop on Energy Efficient Supercomputing at Supercomputing Conference, 2016.

  4. [VarSys, IPDPS'16]  [paper] [bib]

    Mitigating Processor Variation with Dynamic Load Balancing

    B. Acun, L. V. Kalé IEEE International Workshop on Variability in Parallel and Distributed Systems at IPDPS.

  5. [PADABS, EUROPAR'15]  [paper] [bib] [code]

    TraceR: A Parallel Trace Replay Tool for Studying Interconnection Networks

    B. Acun, N. Jain, A. Bhatele, L. V. Kalé
    Workshop on Parallel and Distributed Agent-Based Simulations at EUROPAR, 2015.