Download this data set and load it into your R workspace: fi…
Download this data set and load it into your R workspace: final2026.RData Two objects are contained in this data set, which you can find in the top-right Environment panel. GeneExp: A data frame of gene expression levels. Each row represents a cancer patient, and each column a gene. Rows are named Sub1, Sub2, … (“Sub” means “subject”), and columns are named Gene1, Gene2, … . For example, the expression level of Gene1 in Sub2 is 9.782938. CancerType: A factor that gives the cancer type of each patient. There are 3 types of cancer in this dataset, COAD (colon adenocarcinoma), KIRC (kidney renal clear cell carcinoma), and PRAD (prostate adenocarcinoma). The order of subjects in GeneExp is the same as that in CancerType. For example, Sub2 has cancer type PRAD. There are five problems, marked by [Problem I: Basic Multivariate Analysis], [Problem II: Advanced Multivariate Analysis], [Problem III: k-means], [Problem IV: Regression], [Problem V: Hierarchical Clustering]. Each problem has multiple sub-questions.
Read Details