Training course "Data analysis and data mining with Python"
(Course no. 83/2014 in the training programme of Forschungszentrum Jülich)
Target audience: | Scientists who wnat to use Python for data analysis |
Contents: | |
Prerequisites: | Basic Experience with Python such as provided by the course Introduction to Pythonis assumed and required for the hands-on sessions. |
Language: | This course is given in English. |
Duration: | 3 days |
Date: | 17-19 November 2014, 9:00 - 16:30 |
Venue: | Jülich Supercomputing Centre, Ausbildungsraum 1, building 16.3, room 021 |
Number of participants: | minimum 5 |
Instructors: | Dr. Jan Meinke, Dr. Olav Zimmermann, JSC |
Contact: | Dr. Jan Meinke Phone: +49 2461 61-2315 E-mail: j.meinke@fz-juelich.de |
Registration: | Please register with Dr. Jan Meinke until 31 October 2014. |
Pandas, matplotlib, and scikit-learn make Python a powerful tool for data analysis, data mining, and visualization. All of these packages and many more can be combined with IPython to provide an interactive extensible environment.
In this course, we will explore matplotlib for visualization, pandas for time series analysis, and scikit-learn for data mining. We will use IPython to show how these and other tools can be used to facilitate interactive data analysis and exploration.
Day 1: Basic data analysis and visualization
- Introduction to IPython for interactive data analysis.
- pandas
- NumPy
- matplotlib
Day 2: Advanced data analysis visualization
- pandas
- Statsmodels
- Mayavi2
Day 3: Advanced topics
- Portable data formats
- scikit-learn
- PyMAFIA
This course is aimed at scientists who wish to explore the productivity gains made possible by Python for data analysis.