Sentiment Analysis Python – 3 – Cleaning Text for Natural Language Processing (NLP)
All right guys! Welcome back. In this video we are going to learn how to clean the text before we can apply our natural language processing concepts on it. Cleaning is done in two main ways. Making sure everything is in lowercase and secondly we remove all the unwanted characters from it like punctuations.
But even before that we need to read text in our python program.
We need to convert it to lowercase because the words are the soul of analyzing text. And when we compare words in natural language processing a word like an Apple with a capital A, is not equal to the same word in small case, for example an apple with a small ‘a’. Therefore to compare words we need to make sure the entire text which we are going to be analyzing is in lower case. This is will make more sense as we go further along the videos.
Source Code – https://github.com/attreyabhatt/Sentiment-Analysis
Next video – Tokenization and Stop Words
Subscribe – https://www.youtube.com/channel/UCirPbvoHzD78Lnyll6YYUpg?sub_confirmation=1
Website – www.buildwithpython.com
Instagram – http://instagram.com/buildwithpython
#python #nltk #nlp