Machine Learning – Text Classification with Python, nltk, Scikit & Pandas




[ad_1]

In this video I will show you how to do text classification with machine learning using python, nltk, scikit and pandas. The concepts shown in this video will enable you to build your own models for your own use cases. So let’s go!

_About the channel_____________________
TL;DR
Awesome Data science with very little math!

Hello I’m Jo the “Coding Maniac”!
On my channel I will show you how to make awesome things with Data Science. Further I will present you some short Videos covering the basic fundamentals about Machine Learning and Data Science like Feature Tuning, Over/Undersampling, Overfitting, … with Python.

All videos will be simple to follow and I’ll try to reduce the complicated mathematical stuff to a minimum because I believe that you don’t need to know how a CPU works to be able to operate a PC…

GitHub: https://github.com/coding-maniacs

_Equipment _____________________

Camera: http://amzn.to/2hkVs5X
Camera lens: http://amzn.to/2fCEU9z
Audio-Recorder: http://amzn.to/2jNu2KJ
Microphone: http://amzn.to/2hloKBG
Light: http://amzn.to/2w8J92N

_More videos _____________________

More videos in german: https://youtu.be/rtyJyzqeByU, https://youtu.be/1A3JVSQZ4N0
Subscribe “Coding Maniac”: https://www.youtube.com/channel/UCG0TtnkdbMvN5OYQcgNFY1w
More videos on “Coding Maniac”: https://www.youtube.com/channel/UCG0TtnkdbMvN5OYQcgNFY1w

_Social Media_____________________

►Facebook: https://www.facebook.com/codingmaniac/

_____________________

Source


[ad_2]

Comment List

  • Johannes Frey
    November 21, 2020

    very interesting video and accurate and to the point and explained in a decent manner,
    Hi man, I have twitter data and want to classify that data (twitter posts) on the basis of different labels(in separate column) that i gave to each twitter post manually, is it possible through this code ? And what algorithm you suggest are best for text classification? SVM? GBM? or Random forest? anybody guide me please? Johannes ? waiting for the reply

  • Johannes Frey
    November 21, 2020

    hi i have requirement can you please help me

  • Johannes Frey
    November 21, 2020

    Thank you so much for this. I have been looking for such a video for so long…
    thank you!

  • Johannes Frey
    November 21, 2020

    Hi Johannes, I tried to replicate the code using CountVectorizer. I get an error: ValueError: Found input variables with inconsistent numbers of samples: [29, 67]
    Any suggestion on how to solve this ?

  • Johannes Frey
    November 21, 2020

    Hey Hi,
    I already have keywords, is there a way where I can build the model directly

  • Johannes Frey
    November 21, 2020

    how this path has been used ?? with open('../data/yelp_academic_dataset_review.json') as data_file:

  • Johannes Frey
    November 21, 2020

    hi , i would like to ask you something. what techniques should i use to find some keyword in my csv file and then if match with the keyword, i want to assign it to another keyword. the output something like this,
    column A Keyword

    DUMPBLT:TESTING FAILED: OPERATOR PUSHED STOP BUTTON SYSTEM FAILED

    if i found keyword of 'DUMPBLT' and 'PUSHED STOP BUTTON' in column A, i want to assign it to "SYSTEM FAILED" and put to other column. can you help me about this ?

  • Johannes Frey
    November 21, 2020

    Your teaching style is AMAZING. To point, I've only just started learning python 1.5 months ago and now starting to tackle multi-label classification for my app and I understood (I think) your entire video.

    I did have a comment though -> for the tfidf, I would check the distribution of the dataset across the target vector first to avoid wasted time if it's heavily skewed. Unless, of course, the tfidf method already chooses an even distribution across the target vector for training. >.<

  • Johannes Frey
    November 21, 2020

    Hi,
    This video is great!
    I wanted to ask – how do i add precision, recall and F1 score for the validation set?
    Thnks!

  • Johannes Frey
    November 21, 2020

    i want dataset of your code

  • Johannes Frey
    November 21, 2020

    Thanks heaps!

  • Johannes Frey
    November 21, 2020

    Very Nicely explained.

  • Johannes Frey
    November 21, 2020

    I am getting memory error when I am running this code

  • Johannes Frey
    November 21, 2020

    Thanks, can u provide the GitHub link ?

  • Johannes Frey
    November 21, 2020

    Where can i get the dataset?

  • Johannes Frey
    November 21, 2020

    Damn good tutorial!! Subbed! Repo down?

  • Johannes Frey
    November 21, 2020

    Thanks a lot man this will really help to my next project which is called ped.vrs!

  • Johannes Frey
    November 21, 2020

    hello
    dear thanks for this benefit video i want to use Naive Bayes to classify Sentiment Analysis as positive and negative do you have any tutorial please help me if you can thanks

  • Johannes Frey
    November 21, 2020

    You can get the code.

  • Johannes Frey
    November 21, 2020

    pipelines only works in MAC/LINUX. is there any alternative for windows?

  • Johannes Frey
    November 21, 2020

    Can someone share the codes with us please ? the Github link is not working..

  • Johannes Frey
    November 21, 2020
  • Johannes Frey
    November 21, 2020

    Very useful and entertaining at the same time. looking forward for your future posts.

  • Johannes Frey
    November 21, 2020

    Thanks for the video. Can you provide link to the code. Your GitHub link doesn't work.

  • Johannes Frey
    November 21, 2020

    Hi, Nice video, I am trying to learn text mining in python.Can you share the github link for this code.The github link https://github.com/coding-maniac is not working

  • Johannes Frey
    November 21, 2020

    please give me a break…

  • Johannes Frey
    November 21, 2020

    The video is very helpful. Thanks..!!

  • Johannes Frey
    November 21, 2020

    phenomenal video subscribed!

  • Johannes Frey
    November 21, 2020

    Hello , is there a way To extract from a french text only nouns without writing 700 lines of code 😂😂😂

  • Johannes Frey
    November 21, 2020

    Sir please explain the functionality of lymbda x here.

  • Johannes Frey
    November 21, 2020

    very well explained, thank you so much..!!!

  • Johannes Frey
    November 21, 2020

    Great Video! This is very useful! Keep doing great job!

  • Johannes Frey
    November 21, 2020

    Very cool. Thanks!

  • Johannes Frey
    November 21, 2020

    Thanks nice video, Is there any way to dynamically update the dataset without manual intervention ?

  • Johannes Frey
    November 21, 2020

    Thanks for the walkthrough.

  • Johannes Frey
    November 21, 2020

    Cool

  • Johannes Frey
    November 21, 2020

    Vielen Dank für das ausführliche Tutorial 🙂 Hab dir mal ein Abo da gelassen!

Write a comment