Dataset Management for Computer Vision: Possibly the Most Underrated and Important Component to Delivering Successful Computer Vision Solutions in Real-life


When building Computer Vision solutions, emphasis is usually on the modelling side and on leveraging the latest algorithm.
While the model is important, in our experience we found that the key component to deliver a successful solution is to build and maintain a suitable dataset. In the talk, we will distil lessons learned from delivering real-life Computer Vision projects for big organizations that point at this.

In particular, we will discuss:

- disentangling the business goal from the possible technical how-to
- expressing assumptions and controlling for corner-cases through building a suitable dataset
- updating the dataset to reflect new lessons and evolving data


Andrea has 11y experience in predictive analytics and Machine Learning, having worked and led projects across industries for companies like Shell, Aon, Unilever, Barclays, Mizuho, Network Rails. During his career, he has worked on a number of applications, including financial markets predictions, recommender systems for consumer goods, Computer Vision detection models to prevent theft and digitalize documents, NLP models to automate document parsing and HR predictive analytics. In the last few years, he has been building, a platform to understand, prepare and manage Computer Vision datasets.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google