Selling Out Soon 20% Off | Ends In
Big Data has seen rapid advances in recent years. With some of the sharpest minds in data science presenting, learn the latest techniques and processes to analyze raw data, be able to automate data into mechanical processes and algorithms, and hear use-cases focusing on how data can be used to optimize business performance.
This focus area will cover many of the techniques for drawing conclusions and insights from raw data. You’ll learn from leading experts in the field and complete the conference with an understanding of how more efficiently and accurately analyze data by demonstrating your knowledge in SQL, Python and Data Storytelling.
Some of Our Past Data Engineering & Big Data Speakers

Gary Nakanelua
Gary Nakanelua is a professional technologist with over 17 years of experience and the author of Experiment or Expire. Gary is the Managing Director of Innovation at Blueprint, a data intelligence company based in Bellevue, WA. He’s responsible for the experimentation and creation of Blueprint’s transformative solutions and accelerators. With his diverse background, Gary brings a different perspective to problems that businesses are facing today to create quantifiable solutions driven through a high level of collaborative thought processing, strategic planning, and cannibalization.
Streamlining Your Streaming Analytics with Delta Lake & Rust(Talk)

Timo Walther
Timo Walther is a Principal Software Engineer at Confluent and a long-time member of Apache Flink’s management committee. He studied Computer Science at TU Berlin and was part of the Database Group there – the origins of Apache Flink. He worked as a software engineer at DataArtisans and led SQL team at Ververica. He was a Co-Founder of Immerok which was acquired by Confluent in 2023. In Flink, he is working on various topics in the Table & SQL ecosystem to make stream processing accessible for everyone.

Swagata Ashwani
Swagata is a Data Professional with over 6 years experience in Healthcare, Retail and Platform Integration industry. She is an avid blogger and writes about state of the art developments in the AI space. She is particularly interested in Natural Language Processing, and focuses on researching how to make NLP models work in practical setting. In her spare time, she loves to play her guitar, sip masala chai and find new spots for doing Yoga. Connect with her here – https://www.linkedin.com/in/swagata-ashwani/
Why Did My AI Do That? Decoding Decision-making in Machine Learning(Talk)

Han Wang
Han Wang is the tech lead of Lyft Machine Learning Platform, focusing on distributed computing solutions. Before joining Lyft, he worked at Microsoft, Hudson River Trading, Amazon and Quantlab. Han is the creator of the Fugue project, aiming at democratizing distributed computing and machine learning.

Elliott Cordo
Elliott is an expert in data engineering, data warehousing, information management, and technology innovation with a passion for helping transform data into powerful information. He has more than a decade of experience implementing cutting-edge, data-driven applications. He has a passion for helping organizations understand the true potential in their data by working as a leader, architect, and hands-on contributor.
Elliott has built nearly a dozen cloud-native data platforms on AWS, ranging from data warehouses and data lakes, to real-time activation platforms in companies ranging from small startups to large enterprises.

Mihir Mathur
Mihir Mathur is the lead Product Manager for Machine Learning at Lyft, where he works on building ML/AI tools that power Lyft’s automated intelligent decisions across realtime pricing, ETAs, fraud detection, safety classification etc. In the past Mihir has worked on building delightful products for millions of users at Quora, Houzz, and Thomson Reuters and spoken about his work at conferences such as MLOps World and ODSC. Mihir graduated magna cum laude from UCLA with a Bachelor’s and Master’s in Computer Science.
Powering Millions of Real-time Decisions with Distributed Model Serving(Talk)

Akash Tandon
Akash Tandon is co-founder and CTO of Looppanel where he builds software to help product teams record, store and analyze user research data. He is a co-author of Advanced Analytics with PySpark, published by O’Reilly. Previously, Akash worked as a senior data engineer at Atlan, SocialCops and RedCarpet where he built data infrastructure for enterprise, government and finance use-cases. He has also been a participant and mentor in the Google Summer of Code program with the R Project for Statistical Computing.
From Big Data to NLP insights: Getting started with PySpark and Spark NLP(Workshop)

Freddy Boulton
Freddy Boulton started his career as a data scientist for Nielsen where he built predictive models of television viewing behavior to make television ratings more accurate. This gave him a first hand-view of one of the biggest challenges faced by industry data scientists – being able to easily communicate and share machine learning models with stakeholders. He is currently solving that problem by working on Gradio, an open-source python library that lets data scientists create fully interactive demos of machine learning models with just a few lines of code.
A Practical Tutorial on Building Machine Learning Demos with Gradio(Workshop)
More talks, hands-on workshop and training sessions
See all sessionsYou Will Meet
Some of the world’s best data science speakers
The brains and authors behind today’s most popular open data science tools, topics, and languages
Hundreds of attendees focused on data science
Chief Data Scientists
Thought leaders working in data science
Data Scientists and Analysts
Software Developers
CEOs, CTOs, CIOs
Data Visualization professionals
Venture Capitalists and Investors
Startup Founders and Executives
Attendees from Healthcare, Finance, Education, Business, Intelligence, and other industries
Big data and data science innovators
Why Attend?
Several of the best minds and biggest names in data science will be presenting
Network with attendees from leading data science companies to learn how others are tackling similar problems
Gain quality training in the hottest data science topics, tools, and languages
Learn the latest in data science from industry leaders without having to make room in the budget — tickets are surprisingly inexpensive
What You'll Learn
Talks & Workshops on these topics:
Topics
Data Analytics Systems
Building Advanced Analytics and Data Science Capabilities
Analytics with Graph Representations
Data Analytics with Kubernetes and OpenShift
Distributed Analytical Database
Sentiment Analysis
Analytics: Challenges and Opportunities
Infrastructure Slowing Your Data Analytics and AI Projects
Data Analytics Use Cases
Models
BERT
XLNet
GPT-2
Transformers
Word2Vec
Deep Learning Models
RNN & LSTM
Machine Learning Models
ULMFiT
Transfer Learning
Tools
Tensorflow 2.0
Hugging Face Transformers
PyTorch
Theano
SpaCy
NLTK
AllenNLP
Stanford CoreNLP
Keras
FLAIR
ODSC EAST 2024 - April 23-25th
Register your interestODSC Newsletter
Stay current with the latest news and updates in open source data science. In addition, we’ll inform you about our many upcoming Virtual and in person events in Boston, NYC, Sao Paulo, San Francisco, and London. And keep a lookout for special discount codes, only available to our newsletter subscribers!