Swagath Venkataramani

Overview

Title

Principal Research Scientist, AIU Architecture and Compilers

Location

IBM Research - Yorktown Heights Yorktown Heights, NY USA

Publications

Performance-driven Programming of Multi-TFLOP Deep Learning Accelerators∗
- - Swagath Venkataramani
  - Jungwook Choi
  - et al.
- 2019
- IISWC 2019
DeepTools: Compiler and Execution Runtime Extensions for RaPiD AI Accelerator
- - Swagath Venkataramani
  - Jungwook Choi
  - et al.
- 2019
- IEEE Micro
Dynamic Spike Bundling for Energy-Efficient Spiking Neural Networks
- - Sarada Krithivasan
  - Sanchari Sen
  - et al.
- 2019
- ISLPED 2019
BiScaled-DNN: Quantizing long-tailed datastructures with two scale factors for deep neural networks
- - Shubham Jain
  - Swagath Venkataramani
  - et al.
- 2019
- DAC 2019
SparCE: Sparsity Aware General-Purpose Core Extensions to Accelerate Deep Neural Networks
- - Sanchari Sen
  - Shubham Jain
  - et al.
- 2019
- IEEE TC
A Compiler for Deep Neural Network Accelerators to Generate Optimized Code for a Wide Range of Data Parameters from a Hand-crafted Computation Kernel
- - Eri Ogawa
  - Kazuaki Ishizaki
  - et al.
- 2019
- COOL CHIPS 2019
Data Subsetting: A Data-Centric Approach to Approximate Computing
- - Younghoon Kim
  - Swagath Venkataramani
  - et al.
- 2019
- DATE 2019
A Scalable Multi-TeraOPS Core for AI Training and Inference
- - Sunil Shukla
  - Bruce Fleischer
  - et al.
- 2018
- IEEE SSC-L
A Scalable Multi-TeraOPS Deep Learning Processor Core for AI Trainina and Inference
- - Bruce Fleischer
  - Sunil Shukla
  - et al.
- 2018
- VLSI Circuits 2018
DyHard-DNN: Even more DNN acceleration with dynamic hardware reconfiguration
- - Mateja Putic
  - Alper Buyuktosunoglu
  - et al.
- 2018
- DAC 2018

Patents

- 11 Nov 2024
- US
- 12141513
Method To Map Convolutional Layers Of Deep Neural Network On A Plurality Of Processing Elements With Simd Execution Units, Private Memories, And Connected As A 2d Systolic Processor Array
- 15 Oct 2024
- GB
- 2604060
Hybrid Data-model Parallelism For Efficient Deep Learning
- 16 Sep 2024
- US
- 12094525
Multichannel Memory To Augment Local Memory
- 05 Aug 2024
- US
- 12056594
Low Precision Deep Neural Network Enabled By Compensation Instructions
- 08 Jul 2024
- CN
- ZL201980032566.3
Low Precision Deep Neural Network Enabled By Compensation Instructions
- 02 Jun 2024
- JP
- 7497946
Hybrid Data-model Parallelism For Efficient Deep Learning
- 30 Apr 2024
- TW
- I840790
Single Function To Perform Combined Matrix Multiplication And Bias Add Operations
- 21 Apr 2024
- JP
- 7477249
System-aware Selective Quantization For Performance Optimized Distributed Deep Learning
- 25 Mar 2024
- US
- 11941111
Exploiting Fine-grained Structured Weight Sparsity In Systolic Arrays
- 27 Nov 2023
- US
- 11831467
Programmable Multicast Protocol For Ring-topology Based Artificial Intelligence Systems

Top collaborators

Xiaodong Cui

Principal Research Scientist

Alberto Mannari

Software Developer

Jinwook Jung

Research Staff Member

Mori Ohara

Deputy Director, IBM Research Tokyo, Distinguished Engineer, Chief SW Engineer for Hybrid Cloud on IBM HW