  • Covariance analysis
  • Dimensional reduction techniques (PCA, tSNE)
  • k-means, Hierarchical clustering, heat maps
  • Linear discriminant analysis and Neural networks for classification
  • ROC curves
  • Resampling methods
  • Big data processing: which analysis to choose when

Programming will be performed in Python.

General competences

The student:
- The student is capable of applying linear algebra in variance, covariance and correlation structures and understand geometrical equivalents of basic multivariate reasoning
- The student is capable of representing real data from large datasets in a comprehensible manner
- The student is capable of carrying out inference about multivariate means
- The student understands and applies basic ordination, discrimination and classification methodologies: Principal Components Analysis, Discriminant Analysis and Cluster Analysis
- The student makes use of existing software packages in R to analyse data.


Students will be evaluated on the basis of a project consisting in analysing a 'big' data set with methods seen in the course (50% of the final grade for the written report and 50% for an individual oral exam consisting in presenting the project as a basis to ask questions about the course content). 

