Detecting differential changes

Bioinformatics

• 09 / 01 / 23

Share

Detecting differential changes between conditions

AgustÍn Gonzalez-Reymundez

A common end goal of an RNA-Seq experiment is to identify what genes have responded to a treatment. For example, has a newly developed drug increased the expression of a target or has a knockdown worked to decrease expression? In order to answer these questions we perform a type of analysis called differential expression (DE) analysis.

A DE analysis is a statistical procedure that identifies differentially up or downregulated genes between two or more conditions or samples. It involves comparing the expression levels of each gene in one group of samples (e.g., disease samples) to the expression levels in another (e.g., healthy samples) to identify genes that have changed across conditions. DE analysis can have a significant impact by identifying disease-associated genes (which can be used as potential drug development targets), and identify biomarkers that can be used for diagnosis, prognosis, or monitoring of disease progression.

DE analysis is typically performed using specialized software to maximize our ability to identify differences when the number of replicates by the condition is small (e.g., 2 or 3 technical repetitions) while accounting for differences in library size and false discovery rate due to multiple tests conducted all at once. At Eclipsebio, we use a powerful tool called DESeq2 to identify differentially expressed genes. One way that we use DESeq2 is with our eRibo service, where we can detect changes in ribosome-associated and total transcriptome counts between different conditions.

DESeq2 uses information across all genes in the experiment to produce a robust estimate of the variability (dispersion) between samples for each gene in a way that considers the logarithmic nature of read count data. It then uses these dispersions to divide the log2 fold changes between conditions and calculate a statistical test called the “Wald” test. This test helps us determine whether our observed differences are likely real or just due to chance, and provides robust lists of DE genes to support answering specific scientific questions.

The same framework that is used to identify differentially expressed genes can also be applied different data modalities. For example, a similar analysis can be performed with eCLIP peaks to determine if a region has differential enrichment following a treatment. In the case of eCLIP, to account for the presence of an input we compare the ratio of fold changes rather than the observed counts in the immunoprecipitated libraries alone.

Creating the right framework for an accurate differential analysis can take a lot of effort. At Eclipsebio we have experts with extensive experience in statistical methods to take out the guesswork of identifying differential genes or peaks. Contact us today to see how we can help you examine differentials in your experiment.

Latest eBlogs

Charting a New Era: How RNA Is Unlocking N‑of‑1 Cures

RNA is unlocking the ability to develop life‑saving therapies at unprecedented speed. The recent success with baby KJ demonstrates how RNA medicine can move a personalized treatment from diagnosis to clinic in less than a year.

Therapeutics

•

06 / 10 / 25

The three pillars of AI in RNA biology: why data is the hardest to get right

Artificial intelligence (AI) is transforming how we approach RNA research and drug discovery. In this eBlog we review how data is one of the key pillars for the successful use of AI.

AI

•

05 / 20 / 25

Contact us today to learn how our team can help you

Contact us