High Performance Data Processing in Python

  • 2019-02-14 09:11 AM
  • 1049

numpy and numba are popular Python libraries for processing large quantities of data. This talk explains how numpy/numba work under the hood and how they use vectorisation to process large amounts of data extremely quickly.

Social Network for Developers ☞ https://morioh.com
Developers Chat Channel ☞ https://discord.gg/KAe3AnN
Learn to code for free and get a developer job ☞ https://codequs.com/

numpy and numba are popular Python libraries for processing large quantities of data. This talk explains how numpy/numba work under the hood and how they use vectorisation to process large amounts of data extremely quickly. We use these tools to reduce the processing time of a large, real 600GB dataset from one month to 40 minutes, even when the code is run on a single Macbook

Recommended Courses:

Python for Data Analysis and Visualization - 32 HD Hours !
http://academy.learnstartup.net/p/Hki21QrTOx?utm_source=1

Fundamentals of Data Analysis for Big Data
http://academy.learnstartup.net/p/rJMdBpE9-?utm_source=1

Statistics for Data Analysis Using R
http://academy.learnstartup.net/p/SkthMUjsb?utm_source=1

Big Data Internship Program - Data Ingestion-Sqoop and Flume
http://academy.learnstartup.net/p/rk7vcoDml?utm_source=1