Inputs

ORFribo inputs#

ORFribo takes as inputs several files:

A fasta file containing the nucleotide sequences of the chromosomes or contigs of a genome
The original annotation of the genome in a gff file (see the GFF3 documentation for more details on the gff3 format).
The gff file containing the ORFs to be analyzed. We strongly recommend using the gff file that contains the ORF annotation of the input genome according to ORFtrack definition and parameters (ORF length and overlap). That said, the user can provide his/her own ORF annotation file. In this case, the ORF category of each annotated ORF (even if there is only one category, it must be named explicitly) must be indicated in the 3rd column of the gff file (see the ORFtrack output as example in ORFmine/examples/database/mapping_orf_Scer.gff)
A fasta file with the sequences you do not want to treat and that you want to remove from the mapping, usually rRNA sequences (Ex : Scer_rRNA.fa)
Ribosome Profiling data (i.e. the fastq(.gz) files).

An example of inputs can be found in the ORFmine/examples/ directory.