PyTorch 2.1 – New Developments


Pytorch is a deep learning framework used to build AI models that accelerates the path from research prototyping to production deployment. The most recent version of PyTorch, 2.1, boasts some new features around compile, distributed, inference, export and edge. PyTorch also supports inference optimization techniques like memory efficient attention, quantization and pruning which are expected to make the popular generative AI models run efficiently (use less memory and run faster) during inference. By benchmarking popular generative AI models using the latest techniques in PyTorch, we see upto ~8.5x speedup for segment anything and ~5.6x for llama2. In this session we will deep dive into all the new developments and techniques in PyTorch and provide recommendations on how you can accelerate your models using native PyTorch code.

Learning objectives: How to leverage PyTorch to accelerate AI models


Supriya is an Engineering Manager working on PyTorch at Meta. Her team works on architecture optimization techniques like quantization, pruning as well as other core components of PyTorch 2.0 whereby enabling users to run AI models on different HW efficiently using native PyTorch. Prior to Meta, she worked as a software engineer at Nvidia on improving their GPU Architecture and accelerating AI models via TensorRT for inference. Supriya has an MS in CSE from University of Michigan, Ann Arbor and a bachelor's degree from Bits Pilani, India.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google