Gene Metrics Table

The following table shows important quality control metrics for each sample.

Gene Length (bp): The total number of base pairs (bp) in the gene annotation.

Total reads: The total number of reads mapping to the gene of interest in the database across all 6 replicates.

Mean Coverage: The mean read depth computed across all gene bases.

Coverage Coefficient of Variation (CV): The ratio of the standard deviation of read depth to the mean read depth, a measure of relative variability of coverage across all gene bases.

Coverage uniformity: The percentage of gene bases with more than 20% of the mean coverage.

% Bases with coverage > XXXx: The percentage of gene bases with coverage > 1000x/500x/300x/160x. Note that parts of the selected isoform may not be expressed in the sample, therby falsely lowering these scores. Also check the gene coverage plot.

FPKM: The number of fragments mapping to the gene of interest, per kilobase of gene, per million.

Gene	Gene Length (bp)	Total Reads	Mean Coverage	Coverage CV	Coverage Uniformity	% Bases with coverage > 1000x	% Bases with coverage > 500x	% Bases with coverage > 300x	% Bases with coverage > 160x	FPKM
GENEX	1,285	516,729	7,323	0.16	100.0%	100.0%	100.0%	100.0%	100.0%	1,288.08

Coverage Plot: invitro GENEX

The following plots show coverage in the eSHAPE db-K562 for the gene of interest. A sudden drop in coverage, especially at the 3’ and 5’ UTRs, might indicate expression of a shorter isoform in the sample. Dips in coverage, for the most part, are at exon edges.

Mutation Rate Plots: invitro GENEX

The following plots show mutation line plots across the gene of interest. Corresponding spikes in mutation, found in NAI and DMSO samples, might indicate a SNP in the sample compared to the reference genome.

The following box plot shows mutation rate distributions for DMSO and NAI samples for the gene of interest.

Reactivity Plot: invitro GENEX

The following plot shows SHAPE reactivity scores across the gene of interest at each position with >300x coverage. Higher reactivity scores correlate with a higher likelihood of being unpaired.

Data Files Table: invitro GENEX

The following table outlines the data files delivered with this eSHAPE db gene package.

File	Description
GENEX.bed	Gene annotation coordinate file used to compute gene metrics.
GENEX.fa	Gene annotation sequence fasta file to be used as input into the RNAStructure algorithm with the .shape file above for fold prediction.
eSHAPE_db-K562_invitro_DMSO_GENEX.bam	eSHAPE db-K562 invitro DMSO (untreated control) reads aligned on the gene. *
eSHAPE_db-K562_invitro_NAI_GENEX.bam	eSHAPE db-K562 invitro NAI (treated) reads aligned on the gene. *
eSHAPE_db-K562_invitro_reactivity_GENEX.bedgraph	Reactivity score at each position across the gene with >300x coverage. *
eSHAPE_db-K562_invitro_reactivity_GENEX.shape	Reactivity values formatted for use as input into the RNAStructure algorithm to guide the fold prediction.

Each .bam file also has a corresponding .bai index file to facilitate visualization in a genome viewer.

* Uses hg38 reference genome.

eSHAPE db-K562 Report

invitro GENEX

TABLE OF CONTENTS

Gene Metrics Table

Coverage Plot: invitro GENEX

Mutation Rate Plots: invitro GENEX

Reactivity Plot: invitro GENEX

Data Files Table: invitro GENEX