Pandas is a Python package for the handling and analysis of spreadsheet data. The Pandas course deals with loading, cleaning, manipulating, merging, visualizing  spreadsheet like data. This course assumes familiarity with Python. It should be interesting, for example, for people who work a lot with Excel and want to automatize repetitive tasks or analyze more complicated data. The course consists of about 60%-70% exercises with a trainer per 5 to 9 participants helping individually. At the end of the course participants will have a very thorough knowledge and practical experience with Pandas. They will know all the core tools and capabilities of Pandas. The topics of the course: 

  • Numpy
  • Series (creating, working with, interoperating with Numpy)
  • DataFrames (create, manipulate, add/delete rows/columns, slice, side effects, ...)
  • Statistical functions on Series / DataFrames
  • Dealing with missing values
  • Reading / writing DataFrames from/ to csv-files, Excel files, ...
  • Group-by, Split-Calculate-Combine method
  • Merging, concatenating, inner-join, outer-join several Spreadsheets (i.e.DataFrames)
  • Visualizations (with matplotlib, DataFrame.plot()-method and seaborn)

Each of the listed topics has one or more exercise units. The course duration is 5 days.