Etienne Goffinet, PhD
Senior Researcher at Technology Innovation Institute
With over 8 years of experience in data science and machine learning, I am passionate about developing and applying innovative solutions to real-world problems. I am currently working as a senior researcher within the Biotechnology Research Center of the Technology Innovation Institute, where I have worked on several projects related to Large Language Models, including LLM applications for source code and protein generation.
All Sessions by Etienne Goffinet, PhD
Introduction to Protein Language Models for Synthetic BiologyML for Biotech and Pharma | Beginner
Protein Language Model are Transformer-like models that are trained on massive sets of protein sequences (represented as text) in an attempt to learn the biological 'grammar' of proteins.These models have a broad range of application, thanks to their generative and embedding abilities. In this workshop, we will get more familiar with this type of model, how they differ from their NLP counterparts and the tasks they can address. we will also get a short overview of the existing open-source models and datasets. During the hands-on session, we will start from a pre-trained language model and develop a basic example of protein function multi label classifier. We will then develop compare and benchmark different classification approaches, including a simple retrieval-augmented enhancement, and fine tuning.