Pausing Distance Distribution from Max TSS

The pausing_distance_distribution_from_maxTSS tool reports the number of transcripts for each transcript length starting at the center of the regions provided (typically the 5’ end at the maxTSS).

Usage

Usage:

PolTools pausing_distance_distribution_from_maxTSS
     [-h] [-t [threads]]
     regions_filename sequencing_files [sequencing_files ...]

Required Arguments

Description

Regions filename

Bed formatted file containing all the regions you want to quantify (must be centered on +1 nt). This file can be generated from the make_regions_file_centered_on_max_tss program

Sequencing Files

Bed formatted file from a sequencing experiment.

Optional Arguments

Description

-m, –max_transcript_length

Longest transcript length in the output. Default is 100.

-t, –threads

Maximum number of threads. Default is the number of threads on the system. This program will not use more threads than the number of sequencing files provided

Sequencing Files

Additional sequencing files can be provided.

Behavior

pausing_distance_distribution_from_maxTSS will report the transcript length and the number of transcripts that length that begin at the center of the regions (typically +1 nt)

For example:

_images/pausing_distance_distribution_from_maxTSS.png

$ head seq_file.bed
chr1    11981   12023   A00876:119:HW5F5DRXX:1:2168:2248:1407   255     -
chr1    13099   13117   A00876:119:HW5F5DRXX:1:2203:31403:26757 255     -
chr1    13356   13423   A00876:119:HW5F5DRXX:1:2151:15808:7827  255     -
chr1    13435   13477   A00876:119:HW5F5DRXX:1:2273:15781:19241 255     -
chr1    13739   13772   A00876:119:HW5F5DRXX:1:2256:29966:10520 255     -
chr1    13741   13773   A00876:119:HW5F5DRXX:1:2235:4101:11882  255     -
chr1    14178   14203   A00876:119:HW5F5DRXX:1:2115:8241:31422  255     -
chr1    14734   14768   A00876:119:HW5F5DRXX:1:2165:23764:2440  255     -
chr1    14988   15012   A00876:119:HW5F5DRXX:1:2219:16134:32784 255     -
chr1    18337   18362   A00876:119:HW5F5DRXX:1:2149:32054:31328 255     -

$ head regions_centered_on_max_tss.bed
chr1    959251  959261  NOC2L   46      -
chr1    960627  960637  KLHL17  27      +
chr1    966516  966526  PLEKHN1 8       +
chr1    1000092 1000102 HES4    87      -
chr1    1000290 1000300 ISG15   12      +
chr1    1020114 1020124 AGRN    35      +
chr1    1074302 1074312 RNF223  10      -
chr1    1116102 1116112 C1orf159        9       -
chr1    1231967 1231977 SDF4    321     -
chr1    1232237 1232247 B3GALT6 174     +

$ PolTools pausing_distance_distribution_from_maxTSS regions_centered_on_max_tss.bed seq_file.bed -m 30
Transcript Length       seq_file.bed
0       0
1       0
2       0
3       0
4       0
5       0
6       0
7       0
8       0
9       0
10      0
11      0
12      0
13      0
14      0
15      0
16      0
17      2292
18      2183
19      2286
20      2665
21      2996
22      3558
23      3416
24      3925
25      4998
26      5632
27      6737
28      8640
29      11155
30      15272