Computational Biology
What I do
I design scalable, reproducible computational frameworks for microbiome multi-omics—optimized for high-performance computing (HPC) systems and large-scale ecological datasets.
Core Areas
Workflow Automation and Pipelines Snakemake • SLURM • Shell scripting
- Snakemake-driven workflow architecture
- SLURM job scheduling and resource-aware pipeline design
- Fully automated multi-step bioinformatics pipelines
- Reproducible multi-omics workflows across large datasets
- HPC optimization (CPU/memory tuning, job arrays, logging strategies)
HPC, Cloud & Environment Management SLURM • Palmetto Cluster • Conda • Modules
- SLURM-based HPC workflow management
- Experience with Palmetto Cluster and shared computing environments
- Conda / mamba environment creation and dependency isolation
- HPC module systems and software stack troubleshooting
- Resource scaling for genome-resolved and multi-omics projects
Machine Learning & Statistical Computing scikit-learn • Linear models • Multivariate ecology
- scikit-learn for applied classification and regression workflows
- Linear and mixed-effects models (when appropriate)
- Multivariate ecological statistics in R (vegan and related tools)
- Bray–Curtis dissimilarity, PERMANOVA, RDA, db-RDA
- PCA / PCoA, clustering, ordination, ecological modeling
Version Control & Reproducibility Git • GitHub • Structured project design
- Git-based version control for research workflows
- GitHub repositories for transparent and collaborative development
- Branch-based workflow management
- Structured README documentation and reproducible project organization
Visualization & Scientific Figure Design R • Python • Illustrator • Inkscape
- ggplot2 for publication-grade statistical graphics
- Python matplotlib (and seaborn when appropriate)
- High-quality scientific figure production
- Final figure polishing and layout design (Adobe Illustrator, Inkscape)
Operating Systems Linux • macOS • Windows
- Linux-based research environments (primary development platform)
- macOS and Windows workflow compatibility
- Cross-platform troubleshooting and reproducible environment setup