Aarhus University Seal

Introduction to multivariate data analysis - chemometrics

ECTS credits:
3


Course parameters:

Language: English
Level of course
: PhD course
Time of year
: April 2021
Hours:

3 days of lectures and exercises (24 h)
1 day of working with own data (8 h)
Preparation by reading selected book chapters and articles (20 h)
Writing report and prepare presentation (20 h)
Follow up 1 day presenting data analysis (8 h)
Course fee: 750 DKK for PhD and master students and AU-FOOD staff (covers 1 year software license), 4000 DKK for others (covers 1 year software non-academic license).
Capacity limits
: max. 30, min 8 participants. PhD students have highest priority. The teaching will be carried out as online teaching through Brightspace and Zoom.     

 

Objectives of the course:
The purpose of the course is to give an introduction to some of the common methods in multivariate data analysis, and give the students tools and knowledge to understand and perform PCA and PLS data analysis on their own data.

 

Learning outcomes and competences:
At the end of the course, the student should be able to:

  • Arrange data in a matrix appropriate for PCA and PLS.
  • Apply PCA (exploration) and PLS (regression) on new data and analyze the results.
  • Compare and contrast the methods for a given data analysis situation considering the benefits and the pitfalls of the methods.
  • Apply the most common standardization methods appropriately.
  • Examine relevant plots for outliers in PCA and PLS and thereby classify severe outliers, consider borderline cases and argue for the classification.
  • Apply appropriate validation of PLS models and consider the number of PLS components.
  • Outline the most common preprocessing methods.
  • Outline classification methods such as SIMCA and discriminant PLS.
  • Interpret PCA and PLS models described in scientific literature and describe your own results in a scientific way.
  • Critically evaluate other students work based on model parameters and knowledge of model characteristics.


Compulsory programme:
Attendance for a minimum of 80% of the theoretical and practical lessons is required to obtain the course diploma. Approved report.


Course contents:

Multivariate data analysis (chemometrics) can be used to solve problems involving large amounts of multivariate data generated by e.g. spectroscopy, chromatography or time series of many variables. In chemometrics informative patterns are found and interpreted instead of looking at classical, and often inadequate, univariate measures. Chemometrics is widely used in science and in scientific papers. It is important to know what features to use, how to use them correctly and how to interpret plots. Chemometrics include hypothesis generating methods, but can also be used for classification and prediction.

The course will give a thorough introduction to the chemometric methods, Principal Component Analysis (PCA) and Partial Least Squares (PLS) regression, including common data pre-processing.

Some mathematical and statistical expressions will be used in the course and a variety of data (e.g. chemical, sensory and spectroscopic data) will be used as examples.


Prerequisites:
Enrolled in a science based PhD programme. Master students can participate as a part their master project in agreement with the supervisor.

 

Name of lecturers:
Assistant professor Ulrik Kræmer Sundekilde, Department of Food Science, Aarhus University, Denmark
PhD student Katrine O. Poulsen, Department of Food Science, Aarhus University, Denmark

 

Type of course/teaching methods:
Lectures, computer exercises, data analysis of your own data, writing and presenting report.


Literature:
Selected chapters and papers will be announced later.
 

Course homepage:
Brightspace

 

Course assessment:
Passed/not passed assessment based on written report, presentation and discussion of results considering the learning outcomes.

 

Provider:
Department of Food Science 

Special comments on this course:
The course is organized in combination with the PhD course ‘Introduction to metabolomics’ and it is possible to follow both courses, although the courses can be taken individually if necessary.

Time:

Chemometrics:

  • 12. April: Lectures
  • 13. April: Lectures
  • 15. April: Lectures
  • 16. April: Workshop: Working with own data
  • 23. April: Deadline for handing in report
  • 28. April: Examination seminar (peer-feedback, teacher-feedback)

 

Metabolomics:

  • 8. April: Lectures
  • 9. April: Lectures
  • 16. April: Data preparation, workshop
  • 23. April: Deadline for handing in report
  • 29. April: Notice of assessment

 

Place:
Online on Brightspace and Zoom.    

 

Registration:
Deadline for registration is 21 March 2021. Information regarding admission will be sent out no later than two workdays after registration deadline.

For registration: Opens later

If you have any questions, please contact Ulrik Sundekilde, e-mail: uksundekilde@food.au.dk  

 

Course fee:
Metabolomics only (covering bread/coffee/fruit and folder):
PhD students and master students enrolled at Danish Universities and AU staff: 0 DKK
Others: 1500 DKK

Chemometrics only (covering bread/coffee/fruit, folder and LatentiX license):
PhD students and master students enrolled at Danish Universities and AU staff: 750 DKK
Others: 4000 DKK


Metabolomics and Chemometrics (covering bread/coffee/fruit, folder and LatentiX license):
PhD students and master students enrolled at Danish Universities and AU staff: 750 DKK
Others: 4000 DKK

19647 / i43