Perfect, imperfect and compound SSRs were in-silico mined using the SciRoKo SSR-search module (http://kofler.or.at/bioinformatics/SciRoKo). A minimum of four repetitions together with a minimum length of 15nt was requested; so any sequence was considered as a perfect SSR where a motif was repeated at least 15 times (1nt motif), eight times (2nt), five times (3nt) or four times (4-6nt), allowing for only one mismatch. For compound repeats, the maximum default interruption (spacer) length was set at 100bp.
SSR motif frequency and distribution
On the whole, about 177,200 perfect SSR motifs were identified, including 37,748 compound SSRs. The SSR loci identified were classified on the basis of the repeat motif and their distribution over linkage groups (Table 1) as well as the number of repeat units (Table 2). Di-nucleotides are the most frequent (73.0%), followed by tri- (11.2%), mono (4.7%) and tetra-nucleotides (6.1%); penta- & hexa-nucleotides are rare (2.4 and 2.5% respectively). The imperfect SSR motifs identified were more than 224,000.