Machine Learning in Python: Principal Component Analysis (PCA) for Handling High-Dimensional Data


In this video, I will be showing you how to perform principal component analysis (PCA) in Python using the scikit-learn package. PCA represents a powerful learning approach that enables the analysis of high-dimensional data as well as reveal the contribution of descriptors in governing the distribution of data clusters. Particularly, we will be creating PCA scree plot, scores plot and loadings plot.

🌟 Buy me a coffee:


β­• Playlist:
Check out our other videos in the following playlists.
βœ… Data Science 101:
βœ… Data Science YouTuber Podcast:
βœ… Data Science Virtual Internship:
βœ… Bioinformatics:
βœ… Data Science Toolbox:
βœ… Streamlit (Web App in Python):
βœ… Shiny (Web App in R):
βœ… Google Colab Tips and Tricks:
βœ… Pandas Tips and Tricks:
βœ… Python Data Science Project:
βœ… R Data Science Project:

β­• Subscribe:
If you’re new here, it would mean the world to me if you would consider subscribing to this channel.
βœ… Subscribe:

β­• Recommended Tools:
Kite is a FREE AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I’ve been using Kite and I love it!
βœ… Check out Kite:

β­• Recommended Books:
βœ… Hands-On Machine Learning with Scikit-Learn :
βœ… Data Science from Scratch :
βœ… Python Data Science Handbook :
βœ… R for Data Science :
βœ… Artificial Intelligence: The Insights You Need from Harvard Business Review:
βœ… AI Superpowers: China, Silicon Valley, and the New World Order:

β­• Stock photos, graphics and videos used on this channel:

β­• Follow us:
βœ… Medium:
βœ… FaceBook:
βœ… Website: (Under construction)
βœ… Twitter:
βœ… Instagram:
βœ… LinkedIn:
βœ… GitHub 1:
βœ… GitHub 2:

β­• Disclaimer:
Recommended books and tools are affiliate links that gives me a portion of sales at no cost to you, which will contribute to the improvement of this channel’s contents.

#dataprofessor #PCA #clustering #cluster #principalcomponentanalysis #scikit #scikitlearn #sklearn #prediction #jupyternotebook #jupyter #googlecolab #colaboratory #notebook #machinelearning #datascienceproject #randomforest #decisiontree #svm #neuralnet #neuralnetwork #supportvectormachine #python #learnpython #pythonprogramming #datascience #datamining #bigdata #datascienceworkshop #dataminingworkshop #dataminingtutorial #datasciencetutorial #ai #artificialintelligence #tutorial #dataanalytics #dataanalysis #factor #principalcomponent #principalcomponents #pc #machinelearningmodel



Comment List

  • Data Professor
    January 9, 2021

    Anyone interested in a tutorial video on Bioinformatics (using Python)? Comments down below πŸ‘‡
    If you find value in this video, please give it a Like πŸ‘ and Subscribe ❀️for more videos on Data Science. πŸ˜ƒ

  • Data Professor
    January 9, 2021

    Thanks professor 😊

  • Data Professor
    January 9, 2021

    can we also make it to plot 2d scatter plots of different combinations instead of 3d with a for loop in plotly or we need to use matplotlib for it ? also what could be the usage of pca for bioinformatics ?

  • Data Professor
    January 9, 2021

    Incredible tutorial, congratulations! The 3d viz are awesome and help a lot to understand loadings x attributes relationship. I was wondering if isnt possible to access the scoring coeeficient matrix that is used internally by the Transform(X). Do you know how can I achieve that?

  • Data Professor
    January 9, 2021

    Hi sir, thank you very much for the video! I have experimental data set of time dependent signals 800(time)x49(signals) voltage values. I used PCA and reduced it to 800×2.How can I reduce further and extract information from these set for ML application and is there any other feature extraction method that you can advice for signal feature extraction ?

  • Data Professor
    January 9, 2021

    How do you transform your own dataset into sklearns format?

  • Data Professor
    January 9, 2021

    How could we know that the components need to be only 3 in the starting
    pca = PCA(n_components=3)

  • Data Professor
    January 9, 2021

    hi, why not use the PCA function directly?

    from sklearn.decomposition import PCA

    pca = PCA(n_components=3)

  • Data Professor
    January 9, 2021

    Interesting! I was just following an online course about PCAπŸ˜„

  • Data Professor
    January 9, 2021

    Wow I loved those plots 😍

  • Data Professor
    January 9, 2021

    Very informative! Thanks for this important lesson πŸ™‚ keep it up!

Write a comment