Research entry
Genomics PCA Dashboard
2021 · Academic Archive
The original interactive dashboard for the genomics PCA work, built as the first web interface for exploring population structure in the 1000 Genomes dataset.
About
Built as a college project to explore population genetics using principal component analysis. The dashboard lets users interactively explore how genome-wide SNPs cluster by population ancestry.
This page is kept as the original dashboard version of the genomics work. The broader public-facing evolution of the project now lives in the newer explorer app.
Features
- Scatter matrix — all PCs plotted against each other, coloured by country
- 3D PCA plot — PC1/PC2/PC3 in 3D space, coloured by superpopulation
- Choropleth map — geographic distribution of sample ancestry
- Scree plot — cumulative variance explained by each principal component
- Dropdown filter — filter by superpopulation (AFR, AMR, EAS, EUR, SAS)
Current Public App
Currently hosted on Railway. The cleaner long-term presentation is to map this to a dedicated genomics subdomain on the main portfolio domain.
Tech Stack
Python · Dash · Plotly · Pandas · NumPy · Dash Bootstrap Components · Railway (hosting)