I am a computer system scientist, pursuing R&D in the area of server memory and storage systems. With more than 8 years of industrial and academic experience, I work on all aspects of systems design i.e. server architecture, software/kernel ecosystem, workload analysis, memory, storage technologies, PoC emulation.
• Research server memory, storage architectures that improve DDR4 and HBM2 performance, capacity
• Develop PoC systems to demonstrate server integration on FPGA with hardware, software implementation
• High performnace DRAM serial link interfaces to suit powermodes for datacenter applications
• Modifications to on-chip timing architecture for energy-proportional datacenter memory
• Quantify Google online datacenter applications with Kernel tools and memory traces for Google websearch
• Statistical framework that accurately detects small performance changes in websearch production clusters
• Timing enhancements to FFT core engine in MediaFLO UBM2 Modem for higher bandwidth
• Won the Roberto Padovani award
• Developed a real time debugger for the modem emulation platform
Dissertation: “Design of SoC for Network Based RFID Applications”
Academic Proficiency Medal
GPA: 9.6
GPA: 9.7
JOURNALS
[J4] M. Gao, C. Delimitrou, D. Niu, K. Malladi , H. Zheng, B. Brennan, C. Kozyrakis. “DRAF: A Low-Power
DRAM-based Reconfigurable Acceleration Fabric,” in IEEE MICRO Top Picks, Volume 37, Issue 3, pp. 70-78,
2017.
[J2] A. Boroumand, S. Ghose, B. Lucia, K. Hsieh, K. Malladi , H. Zheng, O. Mutlu. “LazyPIM: An Efficient
Cache Coherence Mechanism for Processing-in-Memory,” in IEEE Computer Architecture Letters, Volume 16,
Issue 1, pp. 46-50, 2016.
[J3] M. Gao, C. Delimitrou, D. Niu, K. Malladi , H. Zheng, B. Brennan, C. Kozyrakis. “DRAF: A Low-Power
DRAM-based Reconfigurable Acceleration Fabric,” in ACM SIGARCH Computer Architecture News, Volume
44, Issue 3, pp. 506-518, 2016.
[J1] K. Malladi , F. Nothaft, K. Periyathambi, B. Lee, C. Kozyrakis, M. Horowitz. “Towards Energy-Proportional
Datacenter Memory with Mobile DRAM,” in ACM SIGARCH Computer Architecture News, Volume 40, Issue
3, pp. 37-48, 2012.
PEER-REVIEWED CONFERENCES
[C15] S. Li, D. Niu, K. Malladi , H. Zheng. “DRISA: A DRAM-based Reconfigurable In-Situ Accelerator” in
50th IEEE/ACM International Symposium on Microarchitecture (MICRO), Boston, October 2017.
[C14] Q. Xu, K. Malladi , M. Awasthi. “Rack Level Scheduling for Containerized Workloads” in 27th IEEE
International Conference on Network, Architecture and Storage (NAS), Shenzen, China, August 2017.
[C13] K. Malladi , M. Chang, D. Niu, H. Zheng. “FlashStorageSim: Performance Modeling for SSD Architectures”
in 27th IEEE International Conference on Network, Architecture and Storage (NAS), Shenzen, China,
August 2017.
[C12] Q. Xu, M. Awasthi, K. Malladi , J. Bhimani, J. Yang, M. Annavaram. “Performance Analysis of Containerized
Applications on Local and Remote Storage” in 34th International Conference on Massive Storage
Systems and Technology (MSST), Santa Clara, May 2017.
[C11] Q. Xu, M. Awasthi, K. Malladi , J. Bhimani, J. Yang, M. Annavaram. “Docker Characterization on High
Performance SSDs” in 18th IEEE International Symposium on Performance Analysis of Systems and Software
(ISPASS), Santa Rosa, April 2017.
[C10] M. Awasthi, K. Malladi . “KOVA : A Tool for Kernel Visualization and Analysis” in 35th IEEE International
Performance Computing and Communications Conference (IPCCC), Las Vegas, December 2016.
[C9] K. Malladi , M. Awasthi, H.Zheng. “FlexDrive: A Framework to Explore NVMe Storage Solutions” in
18th IEEE International Conference on High Performance Computing and Communications (HPCC), Sydney,
Australia, December 2016.
[C8] K. Malladi , M. Awasthi, H.Zheng. “DRAMPersist: Making DRAM Systems Persistent” in 2nd ACM
International Symposium on Memory Systems (MEMSYS), Washington D.C, October 2016.
[C7] K. Malladi , U. Kong, M. Awasthi, H.Zheng. “DRAMScale: Mechanisms to Increase DRAM Capacity” in
2nd ACM International Symposium on Memory Systems (MEMSYS), Washington D.C, October 2016.
[C6] K. Malladi , M. Awasthi, H.Zheng. “Software-Defined Emulation Infrastructure for High Speed Storage”
in 9th ACM International on Systems and Storage Conference (SYSTOR), Haifa, Israel, June 2016.
[C5] M. Gao, C. Delimitrou, D. Niu, K. Malladi , H. Zheng, B. Brennan, C. Kozyrakis. “DRAF: A Low-Power
DRAM-based Reconfigurable Acceleration Fabric,” in 43rd ACM/IEEE International Symposium on Computer
Architecture (ISCA), Seoul, Korea, June 2016.
[C4] K. Malladi , M. Chang, J. Ping, H.Zheng. “FAME: A Fast and Accurate Memory Emulator for New
Memory System Architecture Exploration”, in 23rd IEEE International Symposium on Modeling, Analysis and
Simulation of Computer and Telecommunication Systems (MASCOTS), Atlanta, October 2015.
[C3] K. Malladi , I. Shaeffer, L. Gopalakrishnan, D. Lo, B.C. Lee, M. Horowitz. “Rethinking DRAM Powermodes
for Energy Proportionality”, in 45th IEEE/ACM International Symposium on Microarchitecture (MICRO), Vancouver,
December 2012.
[C2] K. Malladi , F.Nothaft, K. Periyathambi, B. Lee, C. Kozyrakis, M. Horowitz. “Towards Energy-proportional
Datacenter Memory with Mobile DRAM,” in 39th ACM/IEEE International Symposium on Computer Architecture
(ISCA), Portland, June 2012.
[C1] K. Malladi and David.V.Anderson, “Analog Implementation of SNR Based Gain Adaptation for Denoising”,
in 42nd IEEE International Symposium on Circuits and Systems (ISCAS), May 2009.
CONFERENCE PREPRINTS AND POSTERS
[P2] Q. Xu, K. Malladi , M. Awasthi. “Rack Level Scheduling for Containerized Workloads” in 8th ACM
SIGOPS Asia-Pacific Workshop on Systems (APSys), Mumbai, India, September 2017.
[P1] A. Boroumand, S. Ghose, B. Lucia, K. Hsieh, K. Malladi , H. Zheng, O. Mutlu. “LazyPIM: An Efficient
Cache Coherence Mechanism for Processing-in-Memory,” in arXiv preprint arXiv:1706.03162, June 2016.