Intro to NLP with spaCy (1): Detecting programming languages | Episode 1: Data exploration




[ad_1]

In this new video series, data science instructor Vincent Warmerdam gets started with spaCy, an open-source library for Natural Language Processing in Python. His mission: building a system to automatically detect programming languages in large volumes of text. Follow his process from the first idea to a prototype all the way to data collection and training a statistical named entity recogntion model from scratch.

SPACY
● Website: https://spacy.io
● GitHub: https://github.com/explosion/spaCy
● Free online course: https://course.spacy.io
● Twitter: https://twitter.com/spacy_io

THIS VIDEO
● Code: https://github.com/koaning/spacy-youtube-material
● Stack Overflow dataset: https://www.kaggle.com/stackoverflow/stacksample

ABOUT VINCENT
Vincent Warmerdam is a co-founder of PyData Amsterdam and experienced data science instructor. He has been evangelizing data and open source for the last 5 years. You might know him from his PyData videos where he attempts to defend common sense over hype in data science.

● Follow Vincent on Twitter: https://twitter.com/fishnets88

Source


[ad_2]

Write a comment