Analysis | Globe Artichoke Genome Database

SSR mining
Perfect, imperfect and compound SSRs were in-silico mined using the SciRoKo SSR-search module (http://kofler.or.at/bioinformatics/SciRoKo). A minimum of four repetitions together with a minimum length of 15nt was requested; so any sequence was considered as a perfect SSR where a motif was repeated at least 15 times (1nt motif), eight times (2nt), five times (3nt) or four times (4-6nt), allowing for only one mismatch. For compound repeats, the maximum default interruption (spacer) length was set at 100bp.

SSR motif frequency and distribution
On the whole, about 177,200 perfect SSR motifs were identified, including 37,748 compound SSRs. The SSR loci identified were classified on the basis of the repeat motif and their distribution over linkage groups (Table 1) as well as the number of repeat units (Table 2). Di-nucleotides are the most frequent (73.0%), followed by tri- (11.2%), mono (4.7%) and tetra-nucleotides (6.1%); penta- & hexa-nucleotides are rare (2.4 and 2.5% respectively). The imperfect SSR motifs identified were more than 224,000.

CyMSatDB - Chromosome wise distribution of perfect SSRs

Table 1 – Chromosome wise distribution of perfect SSRs

Circos SSR (perfect) – from outside to inside: all SSR, Mono, Di, Tri, Tetra, Penta, Hexa. The central histogram resembled the overall microsatellite distribution

CyMSatDB - Frequency of the main identified SSR motifs

Table 2 – Frequency of the main identified SSR motifs (considering sequence complementary)

Discover the Database Search capabilities >

Meet the team >