
Abstract: Feature engineering has long relied on manual expertise, demanding domain knowledge and experience. While automated approaches exist, they often involve brute-force methods and subsequent feature selection. The emergence of Large Language Models (LLMs) offers a novel perspective on feature engineering. LLMs encapsulate extensive knowledge, presenting an opportunity to reshape this process.
In this presentation, we explore an approach that utilizes LLMs to guide feature engineering. By leveraging the contextual understanding within LLMs, we have developed a system for LLM-assisted feature engineering. Our research demonstrates the practical benefits of this synergy – from feature ideation to improved feature relevance to enhanced model interpretability and efficiency.
Join us to discover how our framework automates and enhances feature engineering, contributing to better predictive models. This talk showcases the integration of human expertise with language models, revolutionizing feature engineering in data science.
Bio: Sergey is a data scientist with a background in physics and neurobiology. FeatureByte is Sergey's second startup. He was one of the first employees at DataRobot where he created and led a professional services group and helped the company grow into a unicorn. Sergey is widely known for being a Kaggle Grandmaster and holding the #1 rank on Kaggle in the past. Multiple times he was mentioned as one of the top data scientists by various publications. Sergey’s passion is in machine learning, predictive modeling and inventive feature engineering.

Sergey Yurgenson
Title
Head of Semantic Data Science | Featurebyte
