eSHAPE db-K562 Report

invitro GENEX


TABLE OF CONTENTS


Gene Metrics Table



The following table shows important quality control metrics for each sample.

Gene Length (bp): The total number of base pairs (bp) in the gene annotation.

Total reads: The total number of reads mapping to the gene of interest in the database across all 6 replicates.

Mean Coverage: The mean read depth computed across all gene bases.

Coverage Coefficient of Variation (CV): The ratio of the standard deviation of read depth to the mean read depth, a measure of relative variability of coverage across all gene bases.

Coverage uniformity: The percentage of gene bases with more than 20% of the mean coverage.

% Bases with coverage > XXXx: The percentage of gene bases with coverage > 1000x/500x/300x/160x. Note that parts of the selected isoform may not be expressed in the sample, therby falsely lowering these scores. Also check the gene coverage plot.

FPKM: The number of fragments mapping to the gene of interest, per kilobase of gene, per million.


Gene Gene Length (bp) Total Reads Mean Coverage Coverage CV Coverage Uniformity % Bases with coverage > 1000x % Bases with coverage > 500x % Bases with coverage > 300x % Bases with coverage > 160x FPKM
GENEX1,285516,7297,3230.16100.0%100.0%100.0%100.0%100.0%1,288.08

Coverage Plot: invitro GENEX


The following plots show coverage in the eSHAPE db-K562 for the gene of interest. A sudden drop in coverage, especially at the 3’ and 5’ UTRs, might indicate expression of a shorter isoform in the sample. Dips in coverage, for the most part, are at exon edges.

Mutation Rate Plots: invitro GENEX



The following plots show mutation line plots across the gene of interest. Corresponding spikes in mutation, found in NAI and DMSO samples, might indicate a SNP in the sample compared to the reference genome.


The following box plot shows mutation rate distributions for DMSO and NAI samples for the gene of interest.



Reactivity Plot: invitro GENEX


The following plot shows SHAPE reactivity scores across the gene of interest at each position with >300x coverage. Higher reactivity scores correlate with a higher likelihood of being unpaired.



Data Files Table: invitro GENEX



The following table outlines the data files delivered with this eSHAPE db gene package.


File Description
GENEX.bedGene annotation coordinate file used to compute gene metrics.
GENEX.faGene annotation sequence fasta file to be used as input into the RNAStructure algorithm with the .shape file above for fold prediction.
eSHAPE_db-K562_invitro_DMSO_GENEX.bameSHAPE db-K562 invitro DMSO (untreated control) reads aligned on the gene. *
eSHAPE_db-K562_invitro_NAI_GENEX.bameSHAPE db-K562 invitro NAI (treated) reads aligned on the gene. *
eSHAPE_db-K562_invitro_reactivity_GENEX.bedgraphReactivity score at each position across the gene with >300x coverage. *
eSHAPE_db-K562_invitro_reactivity_GENEX.shapeReactivity values formatted for use as input into the RNAStructure algorithm to guide the fold prediction.



Each .bam file also has a corresponding .bai index file to facilitate visualization in a genome viewer.

* Uses hg38 reference genome.