Exploratory Data Analysis in Python using pandas




[ad_1]

In this video, I will be showing you how to perform basic data pre-processing and exploratory data analysis (EDA) in Python using the pandas library. For this tutorial, we will be performing exploratory data analysis to answer practical questions using the NBA Basketball player stats data that we had previously obtained via web scraping.

🌟 Buy me a coffee: https://www.buymeacoffee.com/dataprofessor

πŸ“ŽCODE: https://github.com/dataprofessor/code/blob/master/python/pandas_exploratory_data_analysis.ipynb

β­• Playlist:
Check out our other videos in the following playlists.
βœ… Data Science 101: https://bit.ly/dataprofessor-ds101
βœ… Data Science YouTuber Podcast: https://bit.ly/datascience-youtuber-podcast
βœ… Data Science Virtual Internship: https://bit.ly/dataprofessor-internship
βœ… Bioinformatics: http://bit.ly/dataprofessor-bioinformatics
βœ… Data Science Toolbox: https://bit.ly/dataprofessor-datasciencetoolbox
βœ… Streamlit (Web App in Python): https://bit.ly/dataprofessor-streamlit
βœ… Shiny (Web App in R): https://bit.ly/dataprofessor-shiny
βœ… Google Colab Tips and Tricks: https://bit.ly/dataprofessor-google-colab
βœ… Pandas Tips and Tricks: https://bit.ly/dataprofessor-pandas
βœ… Python Data Science Project: https://bit.ly/dataprofessor-python-ds
βœ… R Data Science Project: https://bit.ly/dataprofessor-r-ds

β­• Subscribe:
If you’re new here, it would mean the world to me if you would consider subscribing to this channel.
βœ… Subscribe: https://www.youtube.com/dataprofessor?sub_confirmation=1

β­• Recommended Tools:
Kite is a FREE AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I’ve been using Kite and I love it!
βœ… Check out Kite: https://www.kite.com/get-kite/?utm_medium=referral&utm_source=youtube&utm_campaign=dataprofessor&utm_content=description-only

β­• Recommended Books:
βœ… Hands-On Machine Learning with Scikit-Learn : https://amzn.to/3hTKuTt
βœ… Data Science from Scratch : https://amzn.to/3fO0JiZ
βœ… Python Data Science Handbook : https://amzn.to/37Tvf8n
βœ… R for Data Science : https://amzn.to/2YCPcgW
βœ… Artificial Intelligence: The Insights You Need from Harvard Business Review: https://amzn.to/33jTdcv
βœ… AI Superpowers: China, Silicon Valley, and the New World Order: https://amzn.to/3nghGrd

β­• Stock photos, graphics and videos used on this channel:
βœ… https://1.envato.market/c/2346717/628379/4662

β­• Follow us:
βœ… Medium: http://bit.ly/chanin-medium
βœ… FaceBook: http://facebook.com/dataprofessor/
βœ… Website: http://dataprofessor.org/ (Under construction)
βœ… Twitter: https://twitter.com/thedataprof/
βœ… Instagram: https://www.instagram.com/data.professor/
βœ… LinkedIn: https://www.linkedin.com/in/chanin-nantasenamat/
βœ… GitHub 1: https://github.com/dataprofessor/
βœ… GitHub 2: https://github.com/chaninlab/

β­• Disclaimer:
Recommended books and tools are affiliate links that gives me a portion of sales at no cost to you, which will contribute to the improvement of this channel’s contents.

#dataprofessor #pandas #EDA #exploratorydataanalysis #datavisualization #datavisualisation #dataviz #datavis #dataframe #dataframes #jupyternotebook #jupyter #googlecolab #colaboratory #notebook #machinelearning #datascienceproject #randomforest #decisiontree #svm #neuralnet #neuralnetwork #supportvectormachine #python #learnpython #pythonprogramming #datascience #datamining #bigdata #datascienceworkshop #dataminingworkshop #dataminingtutorial #datasciencetutorial #ai #artificialintelligence #tutorial #dataanalytics #dataanalysis #machinelearningmodel #datascientist #datascientists

Source


[ad_2]

Comment List

  • Data Professor
    January 10, 2021

    I just finished watching this video. Thanks a lot for explaining each and every step so beautifully.

  • Data Professor
    January 10, 2021

    Hi prof, can I work with 2 dataframes in pandas like a vlookup in Excel? ie, i want to fill a column in df1 with a search criteria match on df2.

  • Data Professor
    January 10, 2021

    Thank you!

  • Data Professor
    January 10, 2021

    thank you sir

  • Data Professor
    January 10, 2021

    Poor dataset choice.

  • Data Professor
    January 10, 2021

    Very nice video. A good quick intro to Pandas for beginners and a good "refresh" for rusty users. Useful and clear. Kudos!

  • Data Professor
    January 10, 2021

    Thanks for the video Prof, much appreciated

  • Data Professor
    January 10, 2021

    good one

  • Data Professor
    January 10, 2021

    That was amazing!

  • Data Professor
    January 10, 2021

    Sir,
    It's very helpful and impressive tutorial of EDA. I really amazed by your style of presentation in Google Colab. Thank You 😊 ❀️

  • Data Professor
    January 10, 2021

    Hi Teacher, Do you have plans to make a video tutorial about de DASH?

  • Data Professor
    January 10, 2021

    soooo good .

  • Data Professor
    January 10, 2021

    This is the best comprehensive EDA intro. Hope you keep making more. Professor.

  • Data Professor
    January 10, 2021

    please consider for next times either using a bigger font sizes or zoom in the screen πŸ™‚ not everyone is watching these videos at 40" screens πŸ™‚

  • Data Professor
    January 10, 2021

    Awesome Explanation as Always!! Professor!!πŸ™‚

  • Data Professor
    January 10, 2021

    getting error below
    Traceback (most recent call last)

    <ipython-input-10-c13fc7d20dd1> in <module>()

    —-> 1 profile = ProfileReport(Iris, title="Pandas Profiling Report")

    ~Anaconda3libsite-packagespandas_profiling__init__.py in __init__(self, df, **kwargs)

    66

    67 # Get dataset statistics

    —> 68 description_set = describe_df(df)

    69

    70 # Get sample

    ~Anaconda3libsite-packagespandas_profilingmodeldescribe.py in describe(df)

    549

    550 # Get correlations

    –> 551 correlations = calculate_correlations(df, variables)

    552

    553 # Check correlations between numerical variables

    ~Anaconda3libsite-packagespandas_profilingmodelcorrelations.py in calculate_correlations(df, variables)

    191 # Get the Phi_k sorted order

    192 current_order = (

    –> 193 correlations["phi_k"].index.get_level_values("var1").tolist()

    194 )

    195

    ~Anaconda3libsite-packagespandascoreindexesbase.py in _get_level_values(self, level)

    3169 """

    3170

    -> 3171 self._validate_index_level(level)

    3172 return self

    3173

    ~Anaconda3libsite-packagespandascoreindexesbase.py in _validate_index_level(self, level)

    1956 elif level != self.name:

    1957 raise KeyError('Level %s must be same as name (%s)' %

    -> 1958 (level, self.name))

    1959

    1960 def _get_level_number(self, level):

    KeyError: 'Level var1 must be same as name (None)'

  • Data Professor
    January 10, 2021

    It's a great video. I have seen the best exploratory data analysis video in youtube. Please add examples with more complex data.

  • Data Professor
    January 10, 2021

    kindly zoom in when you are showing the code. apart from that nice video as always

  • Data Professor
    January 10, 2021

    Thank you for such a beautiful, step-wise & lucid explanation. One of the best videos on EDA.

  • Data Professor
    January 10, 2021

    Love it…

  • Data Professor
    January 10, 2021

    Easily explained , Thanks Professor

  • Data Professor
    January 10, 2021

    Awesome video & channel!! I'm a junior undergrad student in physics and data science πŸ™‚

  • Data Professor
    January 10, 2021

    As usual…. Another great video…

Write a comment