A segregating population comprising of F2 individuals was developed using 3096A (female parent) as sterile and 866R (male parent) as restorer, and this study mapped the candidate fertility restoration on 1.35Mb of chrD05 and 20 candidate genes were identified for the first time, revealing that there may be differences between Lines of 104-7A and Gossypium harknessii in fertility restoration genes. Moreover, 42 InDel markers of the whole genome resequencing were also detected. These results will provide important information for further study of CMS restoration genes in cotton.
Background: Cytoplasmic male sterility (CMS) is a maternally inherited trait failing to produce functional pollen. It plays a pivotal role in the exploitation of crop heterosis. The specific locus amplified fragment sequencing (SLAF-seq) as a high-resolution strategy for the identification of new SNPs on a large-scale is gradually applied for functional gene mining. The current study combined the bulked segregant analysis (BSA) with SLAF-seq to identify the candidate genes associated with fertility restorer gene (Rf) in CMS cotton.
Methods: Illumina sequencing systematically investigated the parents. A segregating population comprising of 30 + 30 F2 individuals was developed using 3096A (female parent) as sterile and 866R (male parent) as a restorer. The original data obtained by dual-index sequencing were analyzed to obtain the reads of each sample that were compared to the reference genome in order to identify the SLAF tag with a polymorphism in parent lines and the SNP with read-associated coverage. Based on SLAF tags, SNP-index analysis, Euclidean distance (ED) correlation analysis, and whole genome resequencing, the hot regions were annotated.
Results: A total of 165,007 high-quality SLAF tags, with an average depth of 47.90× in the parents and 50.78× in F2 individuals, were sequenced. In addition, a total of 137,741 SNPs were detected: 113,311 and 98,861 SNPs in the male and female parent, respectively. A correlation analysis by SNP-index and ED initially located the candidate gene on 1.35 Mb of chrD05, and 20 candidate genes were identified. These genes were involved in genetic variations, single base mutations, insertions, and deletions. Moreover, 42 InDel markers of the whole genome resequencing were also detected.
Conclusions: In this study, associated markers identified by super-BSA could accelerate the study of CMS in cotton, and as well as in other crops. Some of the 20 genes’ preliminary characteristics provided useful information for further studies on CMS crops.
Fig. 1 SLAF distribution and SNP markers on chromosome. Note: The abscissa is the length of the chromosome. Each yellow band represents a chromosome. The genome is divided by every 1Mbp. The more the number of SLAF tags in each window, the deeper the color and lesser the number of SLAF tags, the lighter the color. The darker area in the figure is the area where the SLAF tags are centrally distributed. The left panel shows the distribution of the SLAF tag, and the right panel is the distribution of SNP.