Data Analysis with Python, Pandas, and Numpy Training Course

Overview

Pandas is a Python package that provides data structures for working with structured (tabular, multidimensional, potentially heterogeneous) and time series data.

Requirements

Basic Python and data analysis skills

Course Outline

Day 1

Data Analysis with pandas

  • Using vectorized data in pandas
  • Data wrangling
  • Sorting and filtering data
  • Aggregate operations
  • Analyzing time series

Data visualisation

  • Plotting diagrams with matplotlib
  • Using matplotlib from within pandas
  • Creating quality diagrams
  • Visualizing data in Jupyter notebooks
  • Other visualization libraries in Python

Day 2

Vectorizing Data in Numpy

  • Creating Numpy arrays
  • Common operations on matrices
  • Using ufuncs
  • Views and broadcasting on Numpy arrays
  • Optimizing performance by avoiding loops
  • Optimizing performance with cProfile

Other Python libraries for data analysis

  • scikit-learn
  • Scipy
  • statsmodel
  • RPy2

Leave a Reply

Your email address will not be published. Required fields are marked *