.gff file you find further information about the coverage of mapping reads mapped by SSAHA. The coverage of perfect mapping reads before and after the correction, and a pileup representation of three rows upstream and downstream of the called correction. It is a representation of multiple alignments over this region. An example of an entry is:
6. When iCORN finishes
iCORN stops after the set amount of iteration, or if it does find no more errors.
After the last iteration, all the Reference.X..gff files are joined to All..gff.
The files Stats.Mapping.csv and Stats.Correction.csv are generated.
Beside the ssaha coverage plots, also perfect mapping plots are generated in the directory plots/. The plots are generated for the original sequence and the "best corrected" reference (Final.corrected.fa to symbolic link to Reference.X).
7. How to read the iCORN results
First the Stats.Mapping.csv file should be consulted. Depending on the quality of the fastq file, around 90% of the reads should map. Depending on the amount of repeats the amount of uniquely mapping reads (read pairs) should be 60-80%. If mapping to a draft genome, this number might be around 40% as most mate pairs are on different contigs.
In Stats.Correction. csv for each "correction tag" the amount of found events are reported. If the reference is a multiple fasta file, this file is difficult to read.
Next the results can be loaded in Artemis:
* Start Artemis
* Load the sequence or EMBL file. ( + )
* Load feature file All.Reference.gff file. (If your reference was in multi-fasta file, you will need to open each sequence on his own. The sequence will be in Sequence.1/ (original file) and in Sequence.best/ (for the best correction).
* Load the graph from the plot directory. ( + )
* Then you can analyse the correction.
* A good tutorial for Artemis can be found here.
7. Optional transformation of results
Transform results to Gap4 - sorry program still beta version:
You will further need the padded consensus of the "reference" sequence. The contigs/chromosomes must have the same name in the padded and unpadded version.
Call: icorn.ToGap4.pl < <(cat All.*.gff)>
A TAG file with the name Resultname.gap4 is generated that can be loaded in GAP4 or GAP5. The information are equivalent to the All..gff files.