DDX21


This analysis report summarizes DDX21 eCLIP data from ENCODE. The current tab provides an overview of the results, and the other tabs list identified peaks and motifs.

Start your customizable DDX21 eCLIP today:

Experiment Summary


Experiment ID File ID Sample Antibody source Antibody ID Antibody lot Release date Laboratory Number of peaks
ENCSR040QLVENCFF378OYAK562MBLIRN090PW0012018-08-30Gene Yeo, UCSD73

DDX21 Peak Assignments


The following plots show the assignment of peaks into different genic features:

Introns contain other subdivisions, please see the subsequent plot for a breakdown of those features.

This plot is interactive. You can hover over the bars to see the percentage and number of identified peaks. You can select which features to show by toggling the region names in the legend, and you can download the plot by moving your mouse to the top of the plot and clicking on the camera icon.

DDX21 Intron Assignments


The following plots show the assignment of peaks into different intronic features:

  •  5’ splice site: the peak is within the first 100 bp of an intron (5’ to 3’ direction
  •  3’ splice site: the peak is within the last 100 bp of an intron (5’ to 3’ direction)
  •  Proximal intron: the peak is within 100 bp to 500 bp of the nearest exon
  •  Distal intron: greater than 500 bp away from the nearest exon

This plot is interactive. You can hover over the bars to see the percentage and number of identified peaks. You can select which features to show by toggling the region names in the legend, and you can download the plot by moving your mouse to the top of the plot and clicking on the camera icon.

DDX21 Motifs


The following plot shows a summary of the top five motifs by significance in each experiment. Motifs were detected with varying length parameters, you may observe a similar motif with different number of bases included multiple times. A motif present in more peaks and with a higher significance will be in the upper right quadrant of the plot, while the lower left quadrant has motifs that are less significant and in fewer peaks.

This plot is interactive. You can hover over the bars to see the percentage and number of identified peaks. You can select which features to show by toggling the region names in the legend, and you can download the plot by moving your mouse to the top of the plot and clicking on the camera icon.



The following table contains called DDX21 peaks. To identify peaks, clusters (regions of read enrichment) in the immunoprecipitated (IP) samples were found using the peak calling tool CLIPper. To account for background signal, a cluster was identified as a peak if the log2 fold enrichment over input was ≥ 3 and the p-value ≤ 0.001.

Peaks were annotated using transcript information from GENCODE. Each annotated peak is labeled with specific annotation feature types, first split by coding and non-coding transcripts, then by transcript regions, and then by intron/exon proximity regions. For overlapping transcript regions, the following hierarchy is used to label the region: coding sequence (CDS), 5’ or 3’ untranslated region (UTR), intron, non-coding exon, then non-coding intron. For example, if a peak is in the 5’ UTR of one transcript that overlaps an intron of another transcript, the peak region will be labeled as 5’ UTR. All gene annotations are from GENCODE release v41.

The following columns are included in the table:

  •  Chromosome: chromosome where the peak is located.
  •  Peak start: start coordinate for the peak.
  •  Peak stop: stop coordinate for the peak.
  •  Strand: strand of DNA where the peak was called.
  •  -log10(p-value): significance of the cluster call by CLIPper. Values are log transformed so a larger number indicates a greater significance.
  •  log2(fold change): log transformed fold enrichment of the IP signal over a matched input.
  •  Gene name: name (symbol) of the gene that overlaps with the peak.
  •  Ensembl ID: unique identifer of the gene that overlaps with the peak.
  •  Feature: which part of the listed gene overlaps with the peak.

Please note, this table is interactive. You can search for specific genes or features, and can sort the table by any of the columns.

ENCSR040QLV (K562) Peaks


Chromosome Peak start Peak stop Strand -log10(p-value) log2(fold change) Gene name Ensembl ID Feature
chr2238,486,18038,486,285-7.2963.211DDX17ENSG00000100201.23CDS
chr9133,350,899133,350,915+77.8283.390SNORD36CENSG00000252542.1ncRNA
chr9133,350,857133,350,899+87.8853.298SNORD36CENSG00000252542.1ncRNA
chr772,897,87972,897,935+3.8104.585POM121ENSG00000196313.13Distal intron
chr3198,174,489198,174,542+6.1375.522FAM157AENSG00000236438.7ncRNA
chr8141,415,772141,415,828+5.5275.529PTP4A3ENSG00000184489.13Distal intron
chr933,952,77533,952,835-7.2953.633SNORD121AENSG00000238886.1ncRNA
chr1538,484,49138,484,541+4.5614.515FAM98BENSG00000171262.11CDS
chr218,401,8228,401,837+173.61710.880ENSG00000280441ENSG00000280441.3ncRNA
chr218,401,8378,401,858+91.53110.090ENSG00000280441ENSG00000280441.3ncRNA
chr218,401,8188,401,822+177.6439.914ENSG00000280441ENSG00000280441.3ncRNA
chr218,401,7938,401,818+400.0009.446ENSG00000280441ENSG00000280441.3ncRNA
chr218,401,7898,401,793+172.4858.212ENSG00000280441ENSG00000280441.3ncRNA
chr218,401,7858,401,789+59.2546.911ENSG00000280441ENSG00000280441.3ncRNA
chr126,510,5246,510,537+31.6843.318SCARNA10ENSG00000239002.3ncRNA
chr126,510,5436,510,546+25.6443.145SCARNA10ENSG00000239002.3ncRNA
chr1719,062,07819,062,140+35.6977.114SNORD3B-1ENSG00000265185.6ncRNA
chr12131,917,329131,917,400+4.7063.207ULK1ENSG00000177169.103' splice site
chr1759,648,28559,648,340+4.7083.514CLTCENSG00000141367.13CDS
chr9123,267,759123,267,798-6.1375.841STRBPENSG00000165209.19Distal intron
chr1946,197,94746,197,999-14.3183.421IGFL2-AS1ENSG00000268621.6ncRNA
chr218,213,9318,213,979+10.7835.661ENSG00000278996ENSG00000278996.1ncRNA
chr1117,074,72817,074,739-10.5823.835SNORD14AENSG00000272034.1ncRNA
chr1193,721,68593,721,720+57.2723.479SCARNA9ENSG00000254911.3ncRNA
chr1162,854,61962,854,646-29.7033.198SNHG1|SNORD28ENSG00000274544.1||ENSG00000255717.9ncRNA
chrX149,940,294149,940,341+6.7505.729EOLA2-DTENSG00000235703.7ncRNA
chrM14,46414,492-25.5233.241MT-ND6ENSG00000198695.2CDS
chr1689,489,34489,489,396-6.3823.676ANKRD11ENSG00000167522.17Distal intron
chr7152,148,084152,148,130-12.4735.513KMT2CENSG00000055609.21CDS
chr191,409,2551,409,282+4.7083.374DAZAP1ENSG00000071626.17Distal intron
chr11123,058,907123,058,910-68.8033.652HSPA8ENSG00000109971.145' UTR
chr11123,058,934123,058,973-400.0003.356HSPA8ENSG00000109971.145' UTR
chr11123,058,911123,058,921-61.5863.212HSPA8ENSG00000109971.145' UTR
chr11123,058,921123,058,934-64.6933.139HSPA8ENSG00000109971.145' UTR
chr218,218,7608,218,838+400.0006.808ENSG00000278996ENSG00000278996.1ncRNA
chr218,218,7418,218,760+400.0006.554ENSG00000278996ENSG00000278996.1ncRNA
chr2026,209,16226,209,184-12.6126.276MIR663AHGENSG00000227195.11ncRNA
chr1117,074,70317,074,721-66.8793.930SNORD14AENSG00000272034.1ncRNA
chr198,485,8868,485,960+10.5063.600HNRNPMENSG00000099783.12CDS
chr1913,151,52613,151,557+4.5614.341IER2ENSG00000160888.7Distal intron
chr1117,075,83617,075,851-35.7963.521SNORD14BENSG00000201403.1ncRNA
chr1117,075,77617,075,816-400.0003.355SNORD14BENSG00000201403.1ncRNA
chr1117,075,81617,075,836-50.1653.166SNORD14BENSG00000201403.1ncRNA
chr218,217,9938,218,065+29.1575.207ENSG00000278996ENSG00000278996.1ncRNA
chr89,772,1009,772,175+7.6624.890TNKSENSG00000173273.16Proximal intron
chr685,643,10785,643,168-6.5373.981SYNCRIPENSG00000135316.19Proximal intron
chr616,266,16816,266,181+38.3314.997GMPRENSG00000137198.10Distal intron
chr616,266,18116,266,209+85.6234.675GMPRENSG00000137198.10Distal intron
chr616,266,20916,266,252+73.9404.125GMPRENSG00000137198.10Distal intron
chr11126,276,325126,276,366+5.1585.515FOXRED1ENSG00000110074.123' splice site
chr5112,161,580112,161,618+60.2463.085SNORA13|EPB41L4A-AS1ENSG00000238363.1||ENSG00000224032.9ncRNA
chr1782,272,83082,272,881-7.0953.779CSNK1DENSG00000141551.15Proximal intron
chr20381,613381,669+4.0663.109TRIB3ENSG00000101255.12Proximal intron
chr1108,746,758108,746,785+4.4764.320STXBP3ENSG00000116266.11CDS
chr296,894,30996,894,359-46.3645.747FAM178BENSG00000168754.15Proximal intron
chr296,894,22896,894,309-54.5944.124FAM178BENSG00000168754.15Proximal intron
chr16397,740397,777+5.5275.718NME4ENSG00000103202.135' UTR
chr198,486,0268,486,098+5.7453.205HNRNPMENSG00000099783.12CDS
chr482,895,85282,895,924-25.6103.279THAP9-AS1ENSG00000251022.6ncRNA
chr2026,209,31826,209,328-42.9325.892MIR663AHGENSG00000227195.11ncRNA
chr2026,209,32826,209,389-400.0005.650MIR663AHGENSG00000227195.11ncRNA
chr11123,059,370123,059,373-400.0003.285HSPA8ENSG00000109971.145' UTR
chr11123,059,373123,059,415-400.0003.267HSPA8ENSG00000109971.145' UTR
chr11123,059,332123,059,370-400.0003.212HSPA8ENSG00000109971.145' UTR
chr9122,880,220122,880,261-14.6143.074SNORD90ENSG00000212447.1ncRNA
chr218,208,0988,208,145+4.8533.155hsa-mir-3648-1MI0016048miRNA
chrX119,787,365119,787,399-32.5433.008SNORA69ENSG00000206622.1ncRNA
chrM1,5301,543+400.0003.852MT-RNR1ENSG00000211459.2ncRNA
chrM1,5501,559+400.0003.799MT-RNR1ENSG00000211459.2ncRNA
chr540,832,65640,832,727-25.5233.021SNORD72ENSG00000212296.1ncRNA
chr1117,074,65117,074,689-44.7203.159SNORD14AENSG00000272034.1ncRNA
chr1173,866,305173,866,311-25.3783.926GAS5ENSG00000234741.10ncRNA
chr1173,866,311173,866,330-25.6313.824GAS5ENSG00000234741.10ncRNA

The following table contains identified motifs, where a motif is a short sequence of nucleotides that is significantly enriched in called peaks. Motifs were identified using HOMER's findMotifsGenome.pl tool. Motifs were called at multiple lengths, and results from all examined lengths are aggregated in the table.

The following columns are included in the table:

  •  Motif: logo plot for the identified motif.
  •  IUPAC: sequence of motif in IUPAC code.
  •  Motif length: length of the called motif.
  •  P-value: Significance of the called motif.
  •  -log10(p-value): log transformed significance, a larger number indicates a greater significance.
  •  % Peaks: percent of examined peaks with the identified motif.
  •  % Background: percent of non-peak background regions with the motif.

Please note, this table is interactive. You can search for specific sequences, and can sort the table by any of the columns. The search buttons at thr bottom of the table allow for specific filtering by sequence or motif length.

ENCSR040QLV (K562) Motifs


Motif IUPAC sequence Motif length P-value -log10(p-value) % Peaks % Background
ACAUUCGUAG101e-1228.545.970.0