It’s been almost a year since ODSC West and our opportunity to gather together in San Francisco again is just around the corner. Missed out on last year’s conference? See what all the excitement is about by checking out recordings of our virtual sessions, available on demand! Solution Showcase talks and Keynote Talks are available for free, while deep dives into topics with training sessions and workshops are available for purchase. Check out a few of the highlights from each group below.

ODSC West Talks

Ask the Experts! ML Pros Deep-Dive into Machine Learning Techniques and MLOps

Seth Juarez | Principal Program Manager, AI Platform | Microsoft

Learn how new, innovative features in Azure machine learning can help you collaborate and streamline the management of thousands of models across teams. This interactive session focused on showcasing the latest capabilities in Azure Machine Learning and answering attendees’ questions.

LLMs in Data Analytics: Can They Match Human Precision?

Gerard Kostin | Director of Data Science | DataGPT

Delve into the capabilities of Large Language Models (LLMs) in data analytics, highlighting the inherent challenges when processing extensive datasets. You’ll take a deep dive into DataGPT’s technology stack, detailing its methodology for efficient data processing and its measures to ensure accuracy and consistency. You’ll cover the integration of LLMs with advanced algorithms in DataGPT, with an emphasis on their collaborative roles in data analysis.

The Tangent Information Modeler, time series modeling reinvented

Philip Wauters | Customer Success Manager and Value Engineer | Tangent Works

Existing techniques for modeling time series data face limitations in scalability, agility, explainability, and accuracy. In this session, you’ll see how the Tangent Information Modeler (TIM) offers a game-changing approach with efficient and effective feature engineering based on Information Geometry. This multivariate modeling co-pilot can handle a wider range of time series use cases. During this talk, you’ll cover several concrete use cases for advanced time series forecasting, anomaly detection, and root cause analysis.

From AI to GX: The Quantum Leap in Algorithmic Evolution

Jepson Taylor | Chief AI Strategist | Dataiku

This session took a deep dive into the transformative shift from generative AI to Generative Algorithms (GX) and how LLMs are poised to redefine the frontier of algorithm design. It challenged traditional notions of creativity, innovation, and human prowess, beckoning a future where AI doesn’t just use algorithms – it creates them, crafting a world of limitless potential and extraordinary complexity.

The Ethics of Digital Minds: A Baffling New Frontier

Nick Bostrom, PhD | Professor at Oxford University | Founding Director of Future of Humanity Institute

This talk delved into the complex issues that we’ll encounter when AIs transcend their status as mere tools and become morally considerable beings. This session explored the formidable theoretical (and ultimately also practical) challenges that arise from the need to develop ethical, legal, and political frameworks that can permit the peaceful and cooperative coexistence of a wide range of different morally considerable beings—including human beings, sentient nonhuman animals, and many and various kinds of AIs.

ODSC West Training Sessions and Workshops

Statistics for Data Science and Measurement

Brian Caffo, PhD | Professor | Johns Hopkins Bloomberg School of Public Health

Babak Moghadas | Post-Doctoral Fellow

Statistics and statistical inference form the core of making sense of data. In this tutorial, attendees considered measurement and inference, especially as it pertains to scientific repeatability. They also paid particular focus on using artificial intelligence and machine learning as methods of measurement and the fundamental role that inference plays with a specific focus on validation.

Introduction to Math for Data Science

Thomas Nield | Instructor at University of Southern California | Founder of Nield Consulting Group and Yawman Flight

In this training, Thomas Nield provided a crash course of carefully curated topics to jumpstart proficiency in key areas of mathematics, including probability, statistics, hypothesis testing, and linear algebra. Along the way, attendees integrated what they learned and saw practical applications for real-world problems, including how statistical concepts apply to machine learning, and how linear algebra is used to fit a linear regression.

Data Wrangling with Python

Sheamus McGovern | CEO at ODSC | Software Architect, Data Engineer, and AI Expert

Data wrangling is the cornerstone of any data-driven project, and Python stands as one of the most powerful tools in this domain. This session gave attendees a hands-on experience to master the essential techniques. From cleaning and transforming raw data to making it ready for analysis, this course equipped attendees with the skills needed to handle real-world data challenges.

Generative AI, Autonomous AI Agents, and AGI – How new Advancements in AI will Improve the Products we Build

Martin Musiol | Co-Founder and Instructor at Generative | Principal Data Science Manager at Infosys Consulting

This engaging workshop provided a hands-on journey into the world of Generative AI and Autonomous Agents, crucial building blocks towards achieving Artificial General Intelligence (AGI). Attendees took a deep dive into the transformative shift in AI, gaining insights into the revolutionary impact of Generative AI across various domains such as text, image, video, and 3D object generation, as well as data augmentation.

Retrieval Augmented Generation (RAG) 101: Building an Open-Source “ChatGPT for Your Data” with Llama 2, LangChain, and Pinecone

Greg Loughnane | Lead Instructor, Building with LLMs at FourthBrain | Founder & CEO at AI Makerspace | AI Professor at Founderz

Chris Alexiuk | Head of LLMs at AI Makerspace | Founding Machine Learning Engineer at Ox

During this workshop, attendees walked through each component of a simple RAG system – from vector stores (using Pinecone) to embedding models (using one from Hugging Face’s embedding leaderboard) to the LLM Ops infrastructure glue that holds it all together, including how to think about each component conceptually in addition to how to set them up in Python code. We will also discuss RAG evaluation and apply RAGAS – an emerging best-practice framework for assessing the quality of RAG outputs – to our system. Attendees also discussed the best way to improve simple RAG systems, from modeling optimization with RAGAS to using prompt or embedding caches to increase efficiency.

How can I learn more at the next conference?

While watching videos on-demand is a great way to learn about AI and data science, nothing beats the live conference experience. To learn more in person in a hands-on setting, or virtually with thousands of other data scientists from around the world, join us at ODSC Europe 2024 (September 5-6) or ODSC West 2024 (October 29-31). Tickets will only go up in price over time, so be sure to get your passes now!