SF3B1
This analysis report summarizes SF3B1 eCLIP data from ENCODE. The current tab provides an overview of the results, and the other tabs list identified peaks and motifs.
Start your customizable SF3B1 eCLIP today:
Experiment Summary
| Experiment ID | File ID | Sample | Antibody source | Antibody ID | Antibody lot | Release date | Laboratory | Number of peaks | 
|---|---|---|---|---|---|---|---|---|
| ENCSR133QEA | ENCFF887ARJ | K562 | Abcam | ab66774 | GR216813-1 | 2016-10-31 | Gene Yeo, UCSD | 35 | 
SF3B1 Peak Assignments
The following plots show the assignment of peaks into different genic features:
Introns contain other subdivisions, please see the subsequent plot for a breakdown of those features.
This plot is interactive. You can hover over the bars to see the percentage and number of identified peaks. You can select which features to show by toggling the region names in the legend, and you can download the plot by moving your mouse to the top of the plot and clicking on the camera icon.
SF3B1 Intron Assignments
The following plots show the assignment of peaks into different intronic features:
- 5’ splice site: the peak is within the first 100 bp of an intron (5’ to 3’ direction
- 3’ splice site: the peak is within the last 100 bp of an intron (5’ to 3’ direction)
- Proximal intron: the peak is within 100 bp to 500 bp of the nearest exon
- Distal intron: greater than 500 bp away from the nearest exon
This plot is interactive. You can hover over the bars to see the percentage and number of identified peaks. You can select which features to show by toggling the region names in the legend, and you can download the plot by moving your mouse to the top of the plot and clicking on the camera icon.
SF3B1 Motifs
The following plot shows a summary of the top five motifs by significance in each experiment. Motifs were detected with varying length parameters, you may observe a similar motif with different number of bases included multiple times. A motif present in more peaks and with a higher significance will be in the upper right quadrant of the plot, while the lower left quadrant has motifs that are less significant and in fewer peaks.
This plot is interactive. You can hover over the bars to see the percentage and number of identified peaks. You can select which features to show by toggling the region names in the legend, and you can download the plot by moving your mouse to the top of the plot and clicking on the camera icon.
The following table contains called SF3B1 peaks. To identify peaks, clusters (regions of read enrichment) in the immunoprecipitated (IP) samples were found using the peak calling tool CLIPper. To account for background signal, a cluster was identified as a peak if the log2 fold enrichment over input was ≥ 3 and the p-value ≤ 0.001.
Peaks were annotated using transcript information from GENCODE. Each annotated peak is labeled with specific annotation feature types, first split by coding and non-coding transcripts, then by transcript regions, and then by intron/exon proximity regions. For overlapping transcript regions, the following hierarchy is used to label the region: coding sequence (CDS), 5’ or 3’ untranslated region (UTR), intron, non-coding exon, then non-coding intron. For example, if a peak is in the 5’ UTR of one transcript that overlaps an intron of another transcript, the peak region will be labeled as 5’ UTR. All gene annotations are from GENCODE release v41.
The following columns are included in the table:
- Chromosome: chromosome where the peak is located.
- Peak start: start coordinate for the peak.
- Peak stop: stop coordinate for the peak.
- Strand: strand of DNA where the peak was called.
- -log10(p-value): significance of the cluster call by CLIPper. Values are log transformed so a larger number indicates a greater significance.
- log2(fold change): log transformed fold enrichment of the IP signal over a matched input.
- Gene name: name (symbol) of the gene that overlaps with the peak.
- Ensembl ID: unique identifer of the gene that overlaps with the peak.
- Feature: which part of the listed gene overlaps with the peak.
Please note, this table is interactive. You can search for specific genes or features, and can sort the table by any of the columns.
ENCSR133QEA (K562) Peaks
| Chromosome | Peak start | Peak stop | Strand | -log10(p-value) | log2(fold change) | Gene name | Ensembl ID | Feature | 
|---|---|---|---|---|---|---|---|---|
| chr8 | 23,571,198 | 23,571,230 | + | 6.276 | 4.743 | SLC25A37 | ENSG00000147454.14 | Proximal intron | 
| chr19 | 893,560 | 893,589 | + | 33.143 | 5.342 | RNU6-9 | ENSG00000207507.1 | ncRNA | 
| chr22 | 42,615,300 | 42,615,302 | + | 400.000 | 4.485 | ENSG00000270022|RNU12 | ENSG00000270022.3||ENSG00000276027.1 | ncRNA | 
| chr22 | 42,615,353 | 42,615,358 | + | 400.000 | 4.220 | RNU12|ENSG00000270022 | ENSG00000270022.3||ENSG00000276027.1 | ncRNA | 
| chr22 | 42,615,306 | 42,615,347 | + | 400.000 | 3.714 | ENSG00000270022|RNU12 | ENSG00000276027.1||ENSG00000270022.3 | ncRNA | 
| chr22 | 42,615,347 | 42,615,353 | + | 400.000 | 3.714 | ENSG00000270022|RNU12 | ENSG00000276027.1||ENSG00000270022.3 | ncRNA | 
| chr16 | 2,770,289 | 2,770,378 | + | 43.646 | 5.106 | SRRM2 | ENSG00000167978.17 | CDS | 
| chr16 | 2,770,254 | 2,770,289 | + | 14.389 | 4.736 | SRRM2 | ENSG00000167978.17 | 3' splice site | 
| chr2 | 96,894,291 | 96,894,370 | - | 14.870 | 4.676 | FAM178B | ENSG00000168754.15 | Proximal intron | 
| chr2 | 96,894,229 | 96,894,290 | - | 15.996 | 4.297 | FAM178B | ENSG00000168754.15 | Proximal intron | 
| chr19 | 49,104,576 | 49,104,620 | + | 22.644 | 5.246 | SNRNP70 | ENSG00000104852.15 | 3' splice site | 
| chr19 | 49,104,520 | 49,104,576 | + | 24.123 | 5.062 | SNRNP70 | ENSG00000104852.15 | 3' splice site | 
| chr1 | 26,474,014 | 26,474,072 | + | 16.005 | 5.257 | HMGN2 | ENSG00000198830.11 | 3' splice site | 
| chr6 | 43,778,196 | 43,778,243 | + | 400.000 | 4.804 | VEGFA | ENSG00000112715.26 | Proximal intron | 
| chr6 | 43,778,133 | 43,778,196 | + | 400.000 | 4.046 | VEGFA | ENSG00000112715.26 | Proximal intron | 
| chr6 | 43,778,418 | 43,778,463 | + | 26.355 | 3.956 | VEGFA | ENSG00000112715.26 | CDS | 
| chr16 | 2,760,203 | 2,760,298 | + | 23.812 | 3.274 | SRRM2 | ENSG00000167978.17 | 3' splice site | 
| chr19 | 54,461,135 | 54,461,242 | + | 38.596 | 3.376 | LENG8 | ENSG00000167615.17 | CDS | 
| chr11 | 62,841,687 | 62,841,721 | - | 400.000 | 7.105 | WDR74 | ENSG00000133316.16 | 5' UTR | 
| chr11 | 62,841,666 | 62,841,687 | - | 55.420 | 5.882 | WDR74 | ENSG00000133316.16 | 5' UTR | 
| chr8 | 23,568,299 | 23,568,353 | + | 15.819 | 3.882 | SLC25A37 | ENSG00000147454.14 | CDS | 
| chr17 | 43,387,332 | 43,387,392 | - | 24.712 | 9.133 | LINC00910|RNU2-4P | ENSG00000188825.16||ENSG00000277084.1 | ncRNA | 
| chr6 | 43,778,269 | 43,778,294 | + | 16.075 | 6.306 | VEGFA | ENSG00000112715.26 | Proximal intron | 
| chr11 | 62,841,758 | 62,841,780 | - | 46.106 | 10.159 | WDR74 | ENSG00000133316.16 | 5' UTR | 
| chr11 | 62,841,736 | 62,841,758 | - | 108.284 | 8.838 | WDR74 | ENSG00000133316.16 | 5' UTR | 
| chr11 | 62,841,721 | 62,841,736 | - | 142.320 | 8.256 | WDR74 | ENSG00000133316.16 | 5' UTR | 
| chr21 | 8,401,805 | 8,401,829 | + | 15.443 | 7.339 | ENSG00000280441 | ENSG00000280441.3 | ncRNA | 
| chr21 | 8,401,785 | 8,401,805 | + | 15.443 | 7.228 | ENSG00000280441 | ENSG00000280441.3 | ncRNA | 
| chr21 | 8,401,829 | 8,401,851 | + | 5.086 | 7.085 | ENSG00000280441 | ENSG00000280441.3 | ncRNA | 
| chr22 | 42,615,265 | 42,615,288 | + | 400.000 | 4.951 | ENSG00000270022|RNU12 | ENSG00000270022.3||ENSG00000276027.1 | ncRNA | 
| chr22 | 42,615,288 | 42,615,300 | + | 400.000 | 4.791 | RNU12|ENSG00000270022 | ENSG00000276027.1||ENSG00000270022.3 | ncRNA | 
| chr21 | 8,218,741 | 8,218,769 | + | 44.152 | 6.232 | ENSG00000278996 | ENSG00000278996.1 | ncRNA | 
| chr21 | 8,218,769 | 8,218,791 | + | 48.661 | 6.079 | ENSG00000278996 | ENSG00000278996.1 | ncRNA | 
| chr22 | 42,615,254 | 42,615,261 | + | 400.000 | 5.018 | RNU12|ENSG00000270022 | ENSG00000276027.1||ENSG00000270022.3 | ncRNA | 
| chr11 | 17,077,496 | 17,077,551 | - | 13.599 | 6.394 | RPS13 | ENSG00000110700.7 | 5' splice site | 
The following table contains identified motifs, where a motif is a short sequence of nucleotides that is significantly enriched in called peaks. Motifs were identified using HOMER's findMotifsGenome.pl tool. Motifs were called at multiple lengths, and results from all examined lengths are aggregated in the table.
The following columns are included in the table:
- Motif: logo plot for the identified motif.
- IUPAC: sequence of motif in IUPAC code.
- Motif length: length of the called motif.
- P-value: Significance of the called motif.
- -log10(p-value): log transformed significance, a larger number indicates a greater significance.
- % Peaks: percent of examined peaks with the identified motif.
- % Background: percent of non-peak background regions with the motif.
Please note, this table is interactive. You can search for specific sequences, and can sort the table by any of the columns. The search buttons at thr bottom of the table allow for specific filtering by sequence or motif length.