Course Notes:

  1.   STOR881-08-22-2017:   Organizational Matters, OODA Book, What is OODA?,  Taste of OODA Examples, Visualization, Scatterplot Matrix Views, Principal Component Analysis (PCA)
  2.   STOR881-08-24-2017:   OODA Basics, Data Object Determination & Representation, Object & Descriptor Spaces, 2-d toy Example, Curves as Data 10-d & 50-d, RNAseq Data, Revisit Mortality Data
  3.   STOR881-08-29-2017:   Correlation PCA, Limitations of PCA, NCI60 data, Marginal Distribution Plots, Start Drug Discovery Data
  4.   STOR881-08-31-2017:   Continue Drug Discovery Data, Marron’s Matlab Software, DiProPerm Hypothesis Test
  5.   STOR881-09-05-2017:   Melanoma Data, Transformations,  Revisit Drug Discovery Data,  Yeast Cell Cycle Data & Fourier Subspace
  6.   STOR881-09-07-2017:   Review of Linear Algebra & Multivariate Probability, PCA as Optimization, Redistribution of Energy
  7.   STOR881-09-12-2017:   Different Views of PCA, Data Representation –  Simulation – Visualization, Dual PCA & Mortality Data, Cornea Data & Robustness
  8.   STOR881-09-14-2017:   Cornea Data, Robustness: Center & PCA, Spherical PCA, Elliptical PCA
  9.   STOR881-09-19-2017:   GWAS Analysis, Classification: Fisher Linear Discrimination, Gaussian Likelihood Ratio, Mean Difference
  10.   STOR881-09-21-2017-part1, STOR881-09-21-2017-part2, STOR881-09-21-2017-part3:   HDLSS Discrimination, Maximal Data Piling
  11.   STOR881-09-26-2017:    Kernel Embedding, Support Vector Machine, Distance Weighted Discrimination, Faces Data
  12.   STOR881-09-28-2017:    DWD Simulations, Batch Adjustment, HDLSS Asymptotics – Jonathan Williams {Bayesian HMM}, Ruibin Ma {Generalized Cylindrical Surface Deformation}
  13.   STOR881-10-03-2017:    Why DWD for Batch Adjustment, HDLSS Asymptotics – Yunxiao Liu {Integrated Volatility Functionals}
  14.   STOR881-10-05-2017:     HDLSS Asymptotics – Jack Prothero {Image Textures}
  15.   STOR881-10-10-2017:    Meilei Jiang {Angle Based Joint & Individual Variation Explained}
  16.   STOR881-10-12-2017:    University Day – No Class
  17.   STOR881-10-17-2017:    Radial DWD, Melanoma Data & ROC curves, Introduction to Clustering – Zhenlin Xu {Introduction to 3D deep learning}, Dylan Glotzer {Extreme Ship Motions}
  18.   STOR881-10-19-2017:    Fall Break
  19.   STOR893-10-24-2017:    Statistical Smoothing – Brendan Brown, Chen Shen, Wesley Hamilton {Topological Data Analysis)
  20.   STOR881-10-26-2017:    SiZer for Inference and Analysis of Mass Flux & Cell Cycle Data, Clustering, K-means, SWISS – Duyeol Lee {PCA in Credit Risk Modelling}
  21.   STOR881-10-31-2017:    Hierarchical Clustering, SigClust, QQ Plots, QQ Envelope – Kevin Donovan {Non-parametric inference for immune response thresholds of risk in vaccine studies}, Matt Jansen {Text Mining}
  22.   STOR881-11-02-2017:    SigClust, Shapes as Data Objects – Aniish Sridhar {Analytics Competition}, Aditya Balaram {Single Pass PCA}
  23.   STOR881-11-07-2017:    Landmark Based Shape, Equivalence Relations, Quotient Spaces, Shape Representations, Male Pelvis Data & S-Reps – Gang Li {Boosting Methods}, Peiyao Wang {Sparse gradient learning}, Michael Conroy {Regularized PCA}
  24.   STOR881-11-09-2017:     Manifold Data Analysis, Principal Nested SpheresBackwards PCA – Mark He {Commuting networks amongst US counties}, Adam Waterbury {Reproducing Kernels for FDA}
  25.   STOR881-11-14-2017:    Backwards PCA, Nonnegative Matrix Factorization – Aman Barot {Introduction to Deep learning}, Pooja Saha {LASSO regression}, Yue Jiang {CART}
  26.   STOR881-11-16-2017:    Nested Constraints, Principal Nested Submanifolds – Shengjie Chai {Cancer Metastesis}, Di Qin {Kernel PCA}, Yaoyu Chen {Introduction to Generative Adversarial Networks}
  27.   STOR881-11-21-2017:    Curve Registration, Fisher Rao Approach,  – Xi Yang {Multi-View Weighted Network}, Hang Yu {Introduction to multiple kernel learning}, Zhipeng Ding {Fast Predictive Simple Geodesic Regression}
  28.   STOR881-11-23-2017:    Thanksgiving
  29.   STOR881-11-28-2017:    Curve Registration, TIC Data, PNS Approach, Juggling Data Yumeng Wang {Efficacy Analysis}, Jiawei Xu {Childbirth and breast cancer risk}
  30.   STOR881-11-30-2017:    Probability Distributions as Data Objects, Random Matrix Theory,  Zhengling Qi {Classification in personalized medicine}, Zhiyuan Liu {CPNS Visualization in Pablo}, Fuhui Fang {DiProPerm Analysis of OsteoArthritis Data}
  31.   STOR881-12-05-2017-part1, STOR881-12-05-2017-part2:    Tree Structured Data Objects


Link to Marron’s Matlab Software (.zip file, expand to 4 directories, and put those in Matlab Path)

LungCancer2011.m for Analysis of 2011 RNAseq Lung Cancer Data (you need to remove suffix “.txt” from file name)

counts, for 2011 RNAseq Lung Cancer Data

exonsMarron, for 2011 RNAseq Lung Cancer Data

Single .zip file with above 3, plus generated graphics


