About This Blog
Welcome to Daily Bioinformatics from Jojy’s Desk — my living notebook of daily bioinformatics work, microbial ecology, MAGs, MTX/MG, HPC troubleshooting, and coding.
- Metagenome & metatranscriptome analysis
- MAGs, viruses, CAZymes, energy metabolism markers
- Functional redundancy (FRed) modeling
- Hybrid-assembly & whole-genome workflows
- Machine-learning MAG binning (GPU/CPU)
- Daily troubleshooting, R/Python tips, figures
I post short updates daily. Use the folders below to browse ↓
📚 Series
- amplicon (7 posts)
- Applied Statistics for Microbiome Data (8 posts)
- From Laptop to HPC: Scaling Computational Biology Workflows (6 posts)
- Metagenome Analysis Series (11 posts)
- Metatranscriptomics: From Raw RNA to Ecological Interpretation (5 posts)
- mycobiome (1 posts)
- Size-Fractionated Microbiome Series (8 posts)
- viromics (6 posts)
- Visualization Series (5 posts)
- Whole Genome Analysis Series (5 posts)
🗂 Topics
- Mt (2 posts)
- R (1 posts)
- Databases (2 posts)
- Genomics (13 posts)
- Linux (5 posts)
- Machine learning (2 posts)
- Pangenomics (3 posts)
-
-
Your Code Is a Publication: How to Make Bioinformatics Workflows Citable with Zenodo
A peer-reviewed paper is not the only thing that counts as a scholarly contribution. The workflow you spent six months developing, the tutorial series that helped 200 students understand QIIME2, the reproducible pipeline that three other labs are now using — these are real contributions to science. They should be citable, findable, and part of your academic record. This post covers how to archive bioinformatics workflows on Zenodo, get a permanent DOI, add resources to your ORCID, CV, and website, and why this matters for faculty applications, grants, and building long-term scholarly impact.
-
Excel Skills Every Bioinformatician Should Know (And Isn't Embarrassed to Use)
Let's be honest. Every bioinformatician uses Excel. Metadata lives in Excel. Collaborators send Excel files. You do a quick check in Excel before you write the R code. This post covers the Excel skills I reach for every single day — filters, freeze panes, conditional formatting, pivot tables, VLOOKUP, XLOOKUP, COUNTIF, SUMIF, text functions — with real examples from MAG tables, taxonomy summaries, differential abundance results, and sample metadata. Plus: an honest answer to Excel vs R vs Python, because pretending you never use spreadsheets is just not true.
-
Why Every Bioinformatician Should Learn Git (And How to Set It Up Today)
You spend weeks writing a pipeline. It works. You make a small change. It breaks. You cannot remember what you changed. This is the scenario Git was built to prevent. This post explains what Git and GitHub are, why they matter for bioinformatics specifically, and walks through a complete setup guide — installing Git, connecting to GitHub with HTTPS authentication, cloning lab repositories, and the daily push-pull-commit workflow. Includes a lab organization setup, common authentication problems, and an optional SSH configuration for password-free access.
-
Re-working Figure 5: Visualizing Functional Redundancy (FRed) Across Bays, Seasons, and Salinity
Final version of my R-based figure for the FRed manuscript revision, including data cleaning, outlier filtering, and multi-layered ggplot styling.
-
My Journey into Microbiology, the Deep Sea, and Bioinformatics
From traditional medicine to deep-sea expeditions to becoming a computational microbial ecologist — this is my story.