Abstract

The purpose of this paper is to develop the theory of principal components analysis succinctly from the fundamentals of matrix algebra and multivariate statistics. Principal components analysis is sometimes used as a descriptive technique to explain the variance-covariance or correlation structure of a dataset. However, most often, it is used as a dimensionality reduction technique to visualize a high dimensional dataset in a lower dimensional space. Principal components analysis accomplishes this by using the first few principal components, provided that they account for a substantial proportion of variation in the original dataset. In the same way, the first few principal components can be used as inputs into a cluster analysis in order to combat the curse of dimensionality and optimize the runtime for large datasets. The application portion of this paper will apply these methods to a US Crime 2018 dataset extracted from the Uniform Crime Reports on the FBI’s website.

Advisor

Galkande (Iresha) Premarathna

Committee Member

Mezbahur Rahman

Committee Member

Deepak Sanjel

Date of Degree

2020

Language

english

Document Type

Thesis

Degree

Master of Science (MS)

Department

Mathematics and Statistics

College

Science, Engineering and Technology

Share

COinS
 

Rights Statement

In Copyright