We’re all used to hearing the never-ending debates over Python vs R…but what about Excel?
Well, this is the Excel vs Python for Data Cleaning comparison video.
After repeatedly being asked “why don’t you just use Python?” in response to so many of my Excel tutorials, either on Linkedin, here on YouTube, or even on Reddit, I finally decided to answer the question the best way I could fathom – a data cleaning collaboration!
In this video I have joined forces with the great Bradon Valgardson from the awesome channel @Chart Explorers, to take up the question and work through a realistic data cleaning exercise using both tools.
Bradon uses his fancy Jupyter notebook and leverages the Pandas and Numpy libraries to execute his data cleaning effort in Python, while I leaned exclusively on PowerQuery in Excel.
The results were very, very close and each tool has it’s respective strengths and weaknesses.
If you want to see who came out on top you’ll have to check out the video and stick around to the end when Bradon and I sit down together and hash out our experiences in the exercise.
Here’s how the Excel vs Python Data Cleaning Edition video is broken down:
0:00 – Introduction to the data cleaning exercise
7:35 – Data Cleaning using Python – Pandas, Numpy, Jupyter Notebooks
30:05 – Data Cleaning using Excel – PowerQuery
50:50 – Data Cleaning Debate and Scorecards for Excel vs Python