tsrPicker
The tsrPicker
tool identifies transcription start regions from PRO-Cap, PRO-Seq, and related sequencing experiments.
Usage
Usage:
PolTools tsrPicker [-h] [-r [radius]] seq_file min_seq_depth
Required Arguments |
Description |
---|---|
Sequencing File |
Bed formatted file from a sequencing experiment. |
Min Seq Depth |
The minimum number of 5’ reads to be considered as a TSR. |
Optional Arguments |
Description |
---|---|
-r, –radius |
Number of base pairs to expand the TSR from. Default is 5. For example, a value of 5 generates 11 bp TSRs as 5 base pairs are added on each side. |
Behavior
tsrPicker
generates a file named the sequencing file plus the minimum sequencing depth and -TSR.bed.
This bed formatted file contains TSRs with the number of 5’ ends at the max TSS. tsrPicker
selects the
base with the largest number of 5’ ends and generates a TSR of size 1 + (2 * radius) centered on that base. This process
is repeated with all bases that are not within a TSR that meet the minimum sequencing depth.
For example:
$ head seq_file.bed
chr1 11981 12023 A00876:119:HW5F5DRXX:1:2168:2248:1407 255 -
chr1 13099 13117 A00876:119:HW5F5DRXX:1:2203:31403:26757 255 -
chr1 13356 13423 A00876:119:HW5F5DRXX:1:2151:15808:7827 255 -
chr1 13435 13477 A00876:119:HW5F5DRXX:1:2273:15781:19241 255 -
chr1 13739 13772 A00876:119:HW5F5DRXX:1:2256:29966:10520 255 -
chr1 13741 13773 A00876:119:HW5F5DRXX:1:2235:4101:11882 255 -
chr1 14178 14203 A00876:119:HW5F5DRXX:1:2115:8241:31422 255 -
chr1 14734 14768 A00876:119:HW5F5DRXX:1:2165:23764:2440 255 -
chr1 14988 15012 A00876:119:HW5F5DRXX:1:2219:16134:32784 255 -
chr1 18337 18362 A00876:119:HW5F5DRXX:1:2149:32054:31328 255 -
$ PolTools tsrPicker seq_file.bed 200
$ head seq_file_min_200-TSR.tab
chr1 156216434 156216445 TSR740 88595 +
chr1 149832651 149832662 TSR741 24039 +
chr1 203305513 203305524 TSR742 18139 +
chr1 146376801 146376812 TSR743 16896 +
chr1 16740510 16740521 TSR744 16140 +
chr1 149851056 149851067 TSR745 15735 +
chr1 16895974 16895985 TSR746 15223 +
chr1 28648594 28648605 TSR747 15157 +
chr1 85580755 85580766 TSR748 14528 +
chr1 148522595 148522606 TSR749 13905 +