When running GeneFinding the sequences receive a name with the predicted genes. The first part of the sequence identifier comes from the genome reference sequence name (de-novo assembly) and then a _orfx is appended, where x is a number. Sometimes this name is not useful to proceed with downstream analysis or compare results from other experiments. Is there any way
Sometimes databases provide the whole genome and the GFF or GTF files but not the exon or CDS FASTA files.With OmicsBox/Blast2GO it is possible to load a Fasta sequences and to extract the exons or the CDS from the genome using the GFF file.
Tips And Tricks
Helpful Features, Tips and Tricks
Use Cases, Reviews, Tutorials
Product Tutorial, Quickstarts, New Features, etc.