Introduction to Data Processing in Python with Pandas | SciPy 2019 Tutorial | Daniel Chen




[ad_1]

This is a tutorial for beginners on using the Pandas library in Python for data manipulation. We will go from the basics of how to load and look at a dataset in pandas (python) for the first time, and begin the process of preparing data for analysis. The topics covered are: – Load and look at slices and views of data – Groupby aggregates to summarize data – Tidy and reshape data – Write functions and apply them to data – Plotting data using Seaborn – Encode dummy variables to prepare for analysis and model fit – Fitting a model using sklearn By the end of this tutorial, you should have a solid foundation on working with datasets in Python. The last topic of encoding dummy variables segues into using other libraries, such as scikit-learn and statsmodels to fit models on your data.

Tutorial information may be found at https://www.scipy2019.scipy.org/tutorial-participant-instructions
See the full SciPy 2019 playlist at https://www.youtube.com/playlist?list=PLYx7XA2nY5GcDQblpQ_M1V3PQPoLWiDAC

Connect with us!
*****************
https://twitter.com/enthought
https://www.facebook.com/Enthought/
https://www.linkedin.com/company/enthought

Source


[ad_2]

Comment List

  • Enthought
    November 19, 2020

    This video explains Pandas so well. Great job Daniel, this is by far the best Pandas video on youtube.

  • Enthought
    November 19, 2020
  • Enthought
    November 19, 2020

    Notes –
    33:20 – groupby

  • Enthought
    November 19, 2020

    Awesome

  • Enthought
    November 19, 2020

    Well explained …Thank you Daniel.

  • Enthought
    November 19, 2020

    Melt around 50:00

  • Enthought
    November 19, 2020

    excellent thanks you

  • Enthought
    November 19, 2020

    one of the best tutorials on pandas

  • Enthought
    November 19, 2020

    Thanks very much for your python's lesson about pandas 🙂

  • Enthought
    November 19, 2020

    This is really a developer disease. You can program in a language or use some library competently, so you flatter yourself thinking you can also teach it effectively. WRONG!!

    Developing and Teaching are totally different skillsets. If you're really inclined, you should at least test this shit out on an audience. You need an overarching motivational project and progress through it.

    The guy is just regurgitating Pandas documentation. You know I can feed that site to a reader and get 80% of the effect. Try harder! all these tutorials are such a waste of time. You can't follow along, you don't know why the hell is the guy randomly jumping from concept to concept, how to apply them, WHY you should apply them, and when, yada yada yada.

    Waste of time if you ask me. And to all the people gushing over it, raise the bar a little higher. Please.

  • Enthought
    November 19, 2020

    I need to remember the syntax, while at the same time excel show you average value ,jus drag to your data , the average showed

  • Enthought
    November 19, 2020

    This is awesome. Thank you.

  • Enthought
    November 19, 2020

    Again you make video. Put that Mobile phone away from your mic.

  • Enthought
    November 19, 2020

    Great tutorial ,great Daniel 🙂 thanks

  • Enthought
    November 19, 2020

    Sir at 1:15:45 , we need to call two str to get the desired value,
    Like, ebola_long['cd_country'].str.split('_').str.get(0)

  • Enthought
    November 19, 2020

    you dropped total_bill in X=tips_dummy no?

  • Enthought
    November 19, 2020

    Can i have the access to your notes u have? please
    of if someone is having ?

  • Enthought
    November 19, 2020

    This is really awesome. I just started as an absolute beginner of coding, only finished Dojo's tutorial for the absolute beginner, and I am able to catch up with most of what you taught so far (1:39:00)!! Thank you!!!

  • Enthought
    November 19, 2020
  • Enthought
    November 19, 2020

    i need an extra tutorial for that

  • Enthought
    November 19, 2020

    in excel the pivot table stuff is much easier, (for me at least)

  • Enthought
    November 19, 2020

    Best Pandas tutorial so far I can find. Thanks.

  • Enthought
    November 19, 2020

    He was really a nice guy

Write a comment