Skip to main content

Multivariate Statistics

Study Course Description

Course Description Statuss:Approved
Course Description Version:4.00
Study Course Accepted:14.03.2024 11:36:29
Study Course Information
Course Code:SL_119LQF level:Level 7
Credit Points:2.00ECTS:3.00
Branch of Science:Mathematics; Theory of Probability and Mathematical StatisticsTarget Audience:Life Science
Study Course Supervisor
Course Supervisor:Andrejs Ivanovs
Study Course Implementer
Structural Unit:Statistics Unit
The Head of Structural Unit:
Contacts:23 Kapselu street, 2nd floor, Riga, statistikaatrsu[pnkts]lv, +371 67060897
Study Course Planning
Full-Time - Semester No.1
Lectures (count)6Lecture Length (academic hours)2Total Contact Hours of Lectures12
Classes (count)6Class Length (academic hours)2Total Contact Hours of Classes12
Total Contact Hours24
Part-Time - Semester No.1
Lectures (count)6Lecture Length (academic hours)1Total Contact Hours of Lectures6
Classes (count)6Class Length (academic hours)2Total Contact Hours of Classes12
Total Contact Hours18
Study course description
Preliminary Knowledge:
Higher mathematics, probability, statistics, linear models, basic knowledge of R programming.
Objective:
The aim of the course is to introduce the tools and concepts of multivariate data analysis with a strong focus on applications with R program.
Topic Layout (Full-Time)
No.TopicType of ImplementationNumberVenue
1Introduction to multivariate analysis, multivariate dataset examples, Covariance, correlation, multivariate normal distribution. Repetition of basic linear algebra elements: determinants, inverse, eigenvalues and eigenvectors, decompositions and quadratic forms.Lectures1.00auditorium
2Visualizing of multivariate data with R. Practicing matrix algebra calculations in R.Classes1.00computer room
3Principal components (PC) analysis. A geometrical approach to reducing data matrix dimension. Definition, interpretation and inference on principal components. Normalized principal components.Lectures1.00auditorium
4Real-world data example analysis in R: calculating PCs, determining statistical significance, drawing plots to interpreting PCs.Classes1.00computer room
5Factor analysis. Orthogonal factor model. Interpretation of factors. Test for the number of common factors. Comparison with PC analysis.Lectures1.00auditorium
6Factor analysis in R: estimating the factor model, testing the number of factors, drawing plots to interpret factors.Classes1.00computer room
7Discriminant analysis. Classes, labels and classification accuracy measures. Linear and quadratic discriminant analysis.Lectures1.00auditorium
8Discriminant analysis in R.: estimation, interpretation, comparison of methods.Classes1.00computer room
9Cluster analysis. Proximity between objects, distance functions. Various clusterization algorithms.Lectures1.00auditorium
10Cluster analysis in R: realizing and comparing various clusterization algorithms. Determining the optimal number of clusters.Classes1.00computer room
11Multivariable linear regression. Multivariate normality tests.Lectures1.00auditorium
12Multivariable linear regression in R: estimation, testing and interpretation.Classes1.00computer room
Topic Layout (Part-Time)
No.TopicType of ImplementationNumberVenue
1Introduction to multivariate analysis, multivariate dataset examples, Covariance, correlation, multivariate normal distribution. Repetition of basic linear algebra elements: determinants, inverse, eigenvalues and eigenvectors, decompositions and quadratic forms.Lectures1.00auditorium
2Visualizing of multivariate data with R. Practicing matrix algebra calculations in R.Classes1.00computer room
3Principal components (PC) analysis. A geometrical approach to reducing data matrix dimension. Definition, interpretation and inference on principal components. Normalized principal components.Lectures1.00auditorium
4Real-world data example analysis in R: calculating PCs, determining statistical significance, drawing plots to interpreting PCs.Classes1.00computer room
5Factor analysis. Orthogonal factor model. Interpretation of factors. Test for the number of common factors. Comparison with PC analysis.Lectures1.00auditorium
6Factor analysis in R: estimating the factor model, testing the number of factors, drawing plots to interpret factors.Classes1.00computer room
7Discriminant analysis. Classes, labels and classification accuracy measures. Linear and quadratic discriminant analysis.Lectures1.00auditorium
8Discriminant analysis in R.: estimation, interpretation, comparison of methods.Classes1.00computer room
9Cluster analysis. Proximity between objects, distance functions. Various clusterization algorithms.Lectures1.00auditorium
10Cluster analysis in R: realizing and comparing various clusterization algorithms. Determining the optimal number of clusters.Classes1.00computer room
11Multivariable linear regression. Multivariate normality tests.Lectures1.00auditorium
12Multivariable linear regression in R: estimation, testing and interpretation.Classes1.00computer room
Assessment
Unaided Work:
1. Review of compulsory and additional literature to expand the knowledge acquired in lectures and classes. 2. Students will be expected to prepare five R based home assignments related to each of the topics: a. Principal components analysis (2nd practical class). b. Factor analysis (3rd practical class). c. Discriminant analysis (4th practical class). d. Cluster analysis (5th practical class). e. Multivariable linear regression (6th practical class).
Assessment Criteria:
Assessment on the 10-point scale according to the RSU Educational Order: • 5 home assignments to be handed in – 70%. • Written Exam – 30%.
Final Examination (Full-Time):Exam (Written)
Final Examination (Part-Time):Exam (Written)
Learning Outcomes
Knowledge:Student: • has gained an in-depth knowledge of the theoretic probabilistic concepts related to multivariate analysis. • illustrates the visualization techniques describing the multivariate data. • assesses the most important multivariate techniques such as principal components analysis, factor analysis, cluster analysis and discriminant analysis.
Skills:• Implements appropriate multivariate data visualizations in R programme. • Can independently apply multivariate data analysis techniques in R programme, to carry out research activities or highly qualified professional functions.
Competencies:• Can compare and understand the aims of various multivariate data analysis methods and choose the most appropriate for the analysis of the data set. • Can generate hypothesis and make analysis-based decisions related to multivariate data.
Bibliography
No.Reference
Required Reading
1W. K. Haerdle Härdle, L. Simar, Applied Multivariate Statistical Analysis. Springer. 2015
2D. Zelterman. Applied Multivariate Statistics with R. Springer, Statistics for biology and health series, 2015
Additional Reading
1R. A. Johnson, D.W. Wickern, Applied Multivariate Statistical Analysis, 6th edition. Prentice & Hall, 2007
2T. Hothorn, B. Everitt, An Introduction to Applied Multivariate Analysis with R. Springer, Use R! series, 2011