Introduction to NLP and Topic Modeling


In this workshop, I will introduce the basics of Natural Language Processing, including the structure of a typical NLP project, with a focus on topic modeling. We will build a topic modeling system using the BBC news dataset. After the workshop you will have a good grasp on the structure of an NLP project, methods used in NLP, and will have built a topic model project by preprocessing and vectorizing the data, building the topic model, visualizing and evaluating it.

Session Outline
Lesson 1. Learn about the structure of an NLP project and approaches currently used in NLP. At the end of this lesson, you will be able to tell which NLP architecture and which approach should be used for different NLP tasks.
Lesson 2. Learn about preprocessing text data before it can be used in a model. At the end of this lesson, you will be able to clean, preprocess and vectorize the data we will be using for the topic modeling project.
Lesson 3. Learn about different topic modeling approaches, including LDA, and how to choose the number of topics. At the end of this lesson, you will be able to build a topic model using LDA.
Lesson 4. Learn about topic modeling visualization and evaluation. At the end of this lesson, you will be able to create a graphical visualization of your topic model and evaluate it using different methods.

Background Knowledge
Python, Basics of Machine Learning


Zhenya Antić is an NLP consultant and founder of Practical Linguistics Inc. Her projects include document summarization, information extraction, topic modeling and sentiment analysis of consumer reviews, and document similarity. She is the author of the recently published Python Natural Language Processing Cookbook. Zhenya holds a PhD in Linguistics from the University of California Berkeley and a BS in Computer Science from the Massachusetts Institute of Technology.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google