Abstract: This advanced Natural Language Processing (NLP) workshop is focused on text summarization and allows you to automatically generate news headlines powered by Reuters News. Moreover, you’ll get a glimpse into the emerging field of Explainable AI.
NLP is one of the fastest-growing fields within AI. A wide variety of tasks can be tackled with NLP such as text classification, question-answering (e.g. chatbots), translation, topic modeling, sentiment analysis, summarization, and so on. In this workshop, we focus on text summarization, as it is not commonly showcased in tutorials despite being a powerful and challenging application of NLP.
We see a trend towards pre-training Deep Learning models on a large text corpus and fine-tuning them for a specific downstream task (also known as transfer learning). In this hands-on workshop, you’ll get the opportunity to apply a state-of-the-art summarization model to generate news headlines. We finetuned this model on Reuters news data, which is professionally produced by journalists and strictly follows rules of integrity, independence and freedom from bias.
The move towards more complex models for NLP tasks makes the need for AI explainability more apparent. How can we increase trust in what the model generated? With this workshop, we’ll bring you a step closer to answering this question.
The Python programming language will be used as it has a huge community across various industries and has become a standard in applied NLP. We chose Google Colab to host our code and training material to avoid any technical challenges.
The introduced NLP topics around text summarization and explainable AI are strengthened through guided hands-on exercises, supervised by mentors with several years of industry experience. At the end of this session, you will walk away with an interactive notebook to get a head start in applying the learned concepts to your own challenges.
Bio: Nina Hristozova is a Data Scientist at Thomson Reuters (TR) Labs. She has a BSc in Computer Science from the University of Glasgow, Scotland. As part of her role at TR she has worked on a wide range of projects applying ML and DL to a variety of NLP problems. Her current focus is on applied summarization of legal text. She is actively engaged with local technology Meetups to spread the love and knowledge for NLP through tech talks and workshops. In her free time she plays volleyball for the local team in Zug, enjoys going to the mountains and SUP in the lakes.
Junior Data Scientist | Thomson Reuters