When running GeneFinding the sequences receive a name with the predicted genes. The first part of the sequence identifier comes from the genome reference sequence name (de-novo assembly) and then a _orfx is appended, where x is a number. Sometimes this name is not useful to proceed with downstream analysis or compare results from other experiments. Is there any way
Introduction You have: Newly aligned genome of a bacterial non-model organism. You want: Perform functional annotation and analysis of its potential proteins. You need: Predict all potential genes or coding regions before proceeding to the functional annotation: Gene-Finding How can this be done? Use Glimmer, a set of algorithms which uses interpolated Markov models to distinguish coding from non-coding DNA in bacteria, archaea, and viruses. Glimmer
Tips And Tricks
Helpful Features, Tips and Tricks
Use Cases, Reviews, Tutorials
Product Tutorial, Quickstarts, New Features, etc.