African Journal of
Biotechnology

  • Abbreviation: Afr. J. Biotechnol.
  • Language: English
  • ISSN: 1684-5315
  • DOI: 10.5897/AJB
  • Start Year: 2002
  • Published Articles: 12501

Full Length Research Paper

Genetic diversity and population structure of common bean (Phaseolus vulgaris L) germplasm of Ethiopia as revealed by microsatellite markers

Zelalem Fisseha
  • Zelalem Fisseha
  • Addis Ababa University, Department of Microbial, Cellular, and Molecular Biology, P.O.Box 1176, Addis Ababa, Ethiopia.
  • Google Scholar
Kassahun Tesfaye
  • Kassahun Tesfaye
  • Addis Ababa University, Department of Microbial, Cellular, and Molecular Biology, P.O.Box 1176, Addis Ababa, Ethiopia.
  • Google Scholar
Kifle Dagne
  • Kifle Dagne
  • Addis Ababa University, Department of Microbial, Cellular, and Molecular Biology, P.O.Box 1176, Addis Ababa, Ethiopia.
  • Google Scholar
Matthew W. Blair
  • Matthew W. Blair
  • Department of Agricultural and Environmental Sciences, Tennessee State University, Nashville, TN 37209, United States of America.
  • Google Scholar
Jagger Harvey
  • Jagger Harvey
  • Biosciences Eastern and Central Africa (BecA), ILRI, Nairobi, Kenya.
  • Google Scholar
Martina Kyallo
  • Martina Kyallo
  • Biosciences Eastern and Central Africa (BecA), ILRI, Nairobi, Kenya.
  • Google Scholar
Paul Gepts
  • Paul Gepts
  • Department of Plant Sciences / MS1, University of California, 1 Shields Avenue, Davis, CA 95616, USA.
  • Google Scholar


  •  Received: 13 May 2016
  •  Accepted: 09 December 2016
  •  Published: 28 December 2016

 ABSTRACT

The Ethiopian genetic center is considered to be one of the secondary centers of diversity for the common bean. This study was conducted to characterize the distribution of genetic diversity between and within ecological/geographical regions of Ethiopia. A germplasm sample of 116 landrace accessions was developed, which represented different common bean production ecologies and seed types common in the country. This sample was then analyzed with 24 simple sequence repeat (SSR) markers to assess the genetic diversity within and between common bean landraces, classifying them based on SSR clustering, and determining relationships between genetic and agroecological diversity. Representatives of both Andean and Mesoamerican gene pools were identified by STRUCTURE software analysis, as well as a high proportion of hybrid accessions as evidenced by a STRUCTURE K = 2 preset. At the optimum K = 5 preset value, mixed membership of Andean and Mesoamerican genotypes in some of the clusters was also seen, which supported previous findings. Cluster analyses, principal coordinate analysis, and analysis of molecular variance all indicated clustering of accessions from different collection sites, accompanied by high gene flow levels, highlighting the significant exchange of planting materials among farmers in different growing regions in the country. Values of allelic diversity were comparable to those reported in previous similar studies, showcasing the high genetic diversity in the landrace germplasm studied. Moreover, the distribution of genetic diversity across various bean-growing population groups in contrasting geographical/ecological population groups suggests elevated but underutilized potential of Ethiopian germplasm in common bean breeding. In summary, this study demonstrated the geographical, as well as gene pool diversity in common bean germplasm of Ethiopia. This substantial diversity, in turn, should be utilized in future common bean breeding and conservation endeavors in the nation.

 

Key words: Hybridity, simple sequence repeat, microsatellite, structure, seed exchange, gene flow.


 INTRODUCTION

Common bean is the most widely consumed legume species of the genus Phaseolus (Freytag and Debouck, 2002).  It is a pulse crop used since pre-Columbian times in the Americas and, since the 16th century, in other regions of the world (Gepts et al., 2008). It is a true diploid (2n = 2x = 22) with a small genome (580 Mbp; Broughton et al., 2003). Originating in the Neotropics, common bean was domesticated in Mesoamerica and the Andes (Gepts and Bliss, 1986; Gepts, 1988). The crop has high diversity that is broadly classified into six or seven domesticated races distributed into two gene pools (Singh et al., 1991a, b, c; Blair et al., 2007, 2010b; Pallottini et al., 2004; Kwak and Gepts, 2009; Kwak et al., 2012). The crop is a major legume in Eastern and Southern Africa, occupying more than 4 million ha annually and providing food for ≥100 million people (Wortmann et al., 1998; Fisseha, 2015).  Of the total production in sub-Saharan Africa of over 3.5 million MT, 62% is in Eastern and Central African countries (Wortmann et al., 1998; Fisseha, 2015). Common bean became established with the African-European trade, even before the widespread era of colonization (Allen and Edje, 1990; Asfaw et al., 2009). Historical accounts show that common bean was introduced to Ethiopia in the 16th century by Portuguese traders and rapidly became an important component of the diet there (Assefa, 1985; Fisseha, 2015). Ever since the introduction of common bean into Ethiopia, farmers have developed farming practices adapted to local conditions by preservation and exploitation of useful alleles, which have resulted in a range of morphologically diverse landraces (Sperling, 2001). Moreover, recent efforts of the national bean-breeding program in Ethiopia have targeted improvement of on-farm common bean productivity and have benefited since the 1980’s from continuous introduction of new germplasm from different parts of the world (Fisseha, 2015).

Today, Ethiopia is among the major bean producers in Sub-Saharan Africa (Wortmann et al., 1998). However, the national bean yield still lags behind the global average (Fisseha, 2015). This can be attributed partially to the low yielding capacity of cultivars under use (Assefa, 1990; Fisseha, 2015). To this end, it is essential to tap the potential of landrace genetic resources in order to introgress novel genes of adaptation, resistance to diseases and pests, and tolerance to abiotic stresses. According to Hornakova et al. (2003), landraces grown by small farmers are rich sources of valuable genes.

East Africa is a secondary center of diversity for common beans, due to the wide range of landraces there (Martin and Adams, 1987; Asfaw et al., 2009, Blair et  al.,2010b). Understanding the patterns and levels of genetic diversity of bean landraces and cultivars can shed light on potential adaptation and direction and level of gene flow, and eventually help bean breeding and conservation in Ethiopia. Hence, this research project was undertaken with the principal goal of evaluating the genetic diversity within and between common bean landraces, to classify genotypes based on clustering and to understand the distribution of genetic diversity between and within ecological/geographical regions of Ethiopia.


 MATERIALS AND METHODS

Plant materials

A total of 116 landrace accessions collected from a range of common bean production agro-ecologies in Ethiopia, four Ethiopian cultivars, three Kenyan cultivars, and two other cultivars, used as control genotypes for the Andean and Mesoamerican gene pools, respectively, were grown in August, 2012, in a greenhouse in the Biosciences Eastern and Central Africa (BecA-ILRI) hub in Nairobi, Kenya, for DNA extraction and analysis. The Ethiopian accessions were sampled from potential bean growing areas in the country (Supplementary Table 1 and Figure 1). The seeds of the control and commercial cultivars were acquired from the Ethiopian National Bean Research Project, based at Melkassa Agricultural Research Center, Adama, Ethiopia. The landrace accessions were provided by the Gene Bank of the Ethiopian Biodiversity Institute (EIB). A total of ten plants per each accession were planted in a single row in the screen house of BecA-ILRI hub, Nairobi, Kenya in August, 2012 for DNA extraction.

 

 

Genomic DNA extraction

For the molecular diversity assessment, total genomic DNA for each accession was isolated from a bulked leaf tissue sample of one week old plants from five randomly selected plants per accession using cetyltrimethylammonium bromide (CTAB) method (Doyle and Doyle, 1987) with some minor modifications, as described in supplementary part 1. However, 47 accessions did not produce enough genomic DNA, probably due to poor leaf sample qualities, which, in turn, imposed the need to  repeat DNA extraction from the same, using the Zymoplant seed DNA extraction kit (descriptions on the protocol are presented in Supplementary Part 2).

 

Microsatellite amplification

Twenty-four (24) microsatellite markers from all the 11 linkage groups were selected based on their Polymorphic Information Content (PIC) values and dispersed map locations (Yu et al., 2000; Pedrosa-Harand et al., 2008; Kwak and Gepts, 2009). Out of the 24 SSR markers, 15 were genomic, and the remaining nine were non-genomic (genic) markers (Supplementary Table 2). Markers were PCR amplified with 6-FAM, NED, PET or VIC 5’-labeled forward primers and unlabeled reverse primers. The primers were run in multiplexes, based on their fluorescence dye and allele size using BIONEER ACCUPOWER® Multiplex PCR Premix Kits (Supplementary part 3). Out of the 24 SSR markers, seven were dropped after preliminary evaluation, because they either produced no amplification (BM172 and BMd1) or were monomorphic (BM188, BM183, BMd16, PV-AG001, and PV-AT001). PCR products were run on an ABI PRISM 3730xl fragment analyzer (Applied Biosystems, Foster City, CA, USA) at the BecA-ILRI hub (Sequencing, genotyping, and Oligo unit, SegoLip), and allele sizes were determined by comparing with Genescan LIZ500 size standard using GeneMapper v. 3.7.3.7 software. The observed allele sizes were then adjusted for the discrete allele size using the AlleloBin software (http://test1.icrisat.org/gt-bt/download_allelobin.htm).

 

SSR genetic diversity analysis

Genalex 6.5b3 (Peakall and Smouse, 2012; http://biology.anu.edu.au/GenAlEx/) was used to calculate genetic diversity parameters, such as genetic distance, number of alleles (Na); number of effective alleles (Ne); number of private alleles (Npa); observed heterozygosity (Ho); expected heterozygosity (He); Shannon’s information index (I); analysis of molecular variance (AMoVA); and principal coordinate analysis (PCoA). Genetic associations were determined using the neighbor-joining coefficient with Darwin V. 5.0 (http://darwin.cirad.fr/darwin). Genepop V.4 (Rousset, 2008) and Popgene32 (Yeh et al., 1999) programs were also used to determine genetic diversity, polymorphic loci, gene flow, levels of heterozygosity, fixation index, and F-values,. Finally, PowerMarker v. 3.25 (Liu and Muse, 2005) was used to estimate the number of alleles, polymorphic information content (PIC) values, genetic distance matrices, observed heterozygosity (Ho); and  expected heterozygosity (He) for each marker across all genotypes and then across genotypes within and between gene pools.

 

Analysis of population structure

The software program STRUCTURE was run for K values ranging from 2 to 8. Each run was performed using the admixture model and 5,000 replicates for burn-in and 50,000 during the analysis (Pritchard et al., 2000). Evanno et al. (2005) test was performed after 10 simulations per K value. The repeated simulations were conducted for every subpopulation number from K = 2 to K=8 using 5,000 replicates for burn-in and 50,000 replicates according to previous suggestions (Rosenberg et al., 2002; Evanno et al., 2005; Ehrich, 2006).  The Δ statistic showed that K = 5 was the optimal number of subpopulations in this analysis (Supplementary Figure 1). This ideal K value presented the highest peak for change in value from and to the previous and subsequent numbers of subpopulations, respectively. This showed a gain in precision from subdividing the genotypes into five subpopulations versus any lower or higher numbers of subpopulations. The K=2 analysis was done with a particular interest of distinguishing between Andean and Mesoamerican accessions (Koenig and Gepts, 1989; Kwak and Gepts, 2009). To this end, five independent runs were performed with the admixture model and 5,000 replicates for burn-in and 50,000 replicates during analysis. The clustering in different runs was almost identical (similarity coefficient 0.9914). The run with the lowest likelihood value was selected among the five runs, and the accessions with more than 80% posterior assignment probability in the Mesoamerican cluster were assigned to the Mesoamerican gene pool (and vice versa for the Andean gene pool) (Supplementary Table 3). Lower posterior assignment probability values (that is, between 50 and 80%) may actually indicate hybrids or admixed accessions rather than ‘‘pure’’ accessions (Kwak and Gepts, 2009). Nonetheless, such accessions were included in the K=2 analysis, as they are important in future studies towards shedding light on the population structure of the common bean in Ethiopia, and as baseline information in breeding/improvement programs.

 


 RESULTS

Population structure into the Andean and Mesoamerican gene pools in the common bean germplasm

The     population     subdivision    (as     determined     by STRUCTURE) (Figure 2), the NJ tree (Figure 3), and the PCoA (Figure 4), showed significant Andean–Mesoamerican gene pool divergence as well as racial differentiation within gene pools. The accessions were assigned to the respective gene pools of origin, as per the methods explained in the “Materials and Methods” for K=2. Consequently, 78 accessions out of the total 125 fell into the Mesoamerican group, whereas the remaining 47 were classified into the Andean group. This classification was based on posterior assignment probabilities p>0.5. This split was generally maintained from K=2 to 3, but broke down for K = 4 and 5 (Figure 2; Supplementary Table 3). The analysis for K = 2 populations showed individual genotypes distributed between the two gene pools, which was congruent with the neighbor-joining and PCoA analyses, which clearly separated the Mesoamerican and Andean gene pools. At K=3, looking jointly into the bar-graphs produced and membership coefficient values, the Mesoamerican gene pool genotypes further separated into two sub-groups but no meaningful interpretation of population structure could be made, while the Andean gene pool genotypes did not show any separation. At K=4, the Mesoamerican accessions further subdivided into two groups with a mild level of admixture but no meaningful interpretation of population structure could be made. At K = 5, the Andean accessions further subdivided into three groups withsome admixture level, whereas the Mesoamerican accessions did not subdivide further. In the following section, we describe in further details the five groups of K = 5.

 

 

Genetic diversity among accessions and cluster groups in STRUCTURE preset K=5

For K=5, the groups were identified as Andean Cluster 1 (K4); Andean Cluster 2 (K5); Andean control (K1); Mesoamerican Cluster 1 (K2) and Mesoamerican control (K3). On average, Fst values for Andean populations (K1, K4, and K5) were lower (0.213) compared to those of Mesoamerican populations (K2, and K3) (0.451) (Table 1). We also quantified population admixture for each accession (Figure 2; Supplementary Table 3). The Andean gene pool had a higher proportion of non-hybrid accessions than the Mesoamerican gene pool (51and 35% at the 0.8 cutoff, respectively; Table 2). The proportion of non-hybrid accessions in each K group ranged from 28% (Mesoamerican Controls K3) to 54% (Andean Cluster 2 K5) at the 0.8 cutoff values (Table 2).

The proportions of polymorphic loci were 100% in the Andean Cluster 1 (K4) genotypes; 94% in the Andean cluster 2 (K5), Andean control (K1), and the Mesoamerican cluster 1 (K2); 76% in the Mesoamerican control (K3) (Table 3). On average, the Andean groups had a higher number of alleles (Na), number of effective alleles (Ne); Shannon Index (I), observed heterozygosity, expected heterozygosity, fixation index, percent of polymorphic loci; genetic distance; and number of private alleles. On the other hand, the Mesoamerican groups had higher hybridity rates than the Andean groups. The highest number of alleles, genetic distance (GD), observed heterozygosity (Ho), hybridity rate (t), and percent of polymorphic loci was recorded for the Andean cluster 1 (K5). The Andean control cluster had the highest Shannon index (I), fixation index (F), number of private alleles (Npa); and number of effective alleles (Ne).

 

 

Analysis of Molecular Variance (AMoVA) among accessions and cluster groups in STRUCTURE preset K=5

The AMOVA results showed that 50% of allelic diversity was attributed to individuals within gene pool (P<0.001), 31% among individuals in the total population, and the remaining19% was attributed to the diversity among populations (Figure 5). A highly significant genetic differentiation among subpopulations (0.186, P<0.01) was observed. Some lower level of gene flow between different cluster of accessions was also reported (that is, 1.1), with higher values among accessions from different Andean gene pool clusters (that is, 1.6) values observed among different Mesoamerican clusters (i.e. 0.3) (Table 4). The average Nei’s unbiased genetic distance was higher within each gene pool (0.8), but slightly lower between the Andean and Mesoamerican gene pools (0.7). Within gene pool, the Mesoamerican representatives presented lower genetic distances (0.7) than the Andean gene pool representatives (0.8) (Table 4).

 

 

Genetic associations among accessions

Genetic associations among accessions from different populations in Ethiopia with respect to Andean and Mesoamerican control genotypes were identified using variation for fluorescent microsatellite markers (Figures 3 and 4). Both the PCoA and Neighbor-Joining graphs indicated the clustering of the bean genotypes into either of the Andean or Mesoamerican control genotypes. In the context of the geographical sample collection sites (Supplementary Table 1), genotypes from the same collection site were often in different clusters and likewise accessions from different collection sites often clustered together (Figure 6), indicating the possibility of gene flow by seeds between sites and regions within Ethiopia.

A principal coordinate analysis (PCoA) was conducted using five populations identified by STRUCTURE. The overall variation explained by the PCoA was 64% with dimensions 1, 2 and 3 explaining 26, 21and 19%, respectively. PCoA separated the bean genotypes into their corresponding centers of domestication (Andean/ Mesoamerican) along the first axis (Figure 4). Exceptions were Andean Cluster 4 genotypes in the second quadrant (four in number) and one genotype of the Andean Control cluster (quadrant III), which showed mixed cluster membership with the Mesoamerican Cluster. The mixed membership of Andean Cluster 1 (K4) was consistent between the STRUCTURE and neighbor-joining analysis results. However, the mixed clustering of Andean Control Cluster (K1) with the Mesoamerican groups was exhibited only in the PCoA and neighbor-joining tree.

 

 

Microsatellite diversity of Ethiopian common bean landrace accessions with respect to collections sites

Allelic patterns/diversity

A total of 149 alleles were identified, giving an average of 8.8 alleles per locus for the 17 microsatellites evaluated, of which 12 were genomic markers and 5 were genic (gene-based) (Supplementary Table 2). The range in allele number was 4 to 15, with the marker BM143 having the highest number of alleles, followed by GATS91, GATS54 and BM140, with 14, 13, and 13 alleles, respectively. All these markers were genomic. The highest number of alleles found for a gene-based microsatellite was for BMd53 with 9 alleles, followed by BMd36 and BMd42 having 6 alleles each. The mean number of alleles for genomic microsatellites was 1.5 times more than that of genic microsatellites. The observed heterozygosity on average was 0.51 across all the 17 markers evaluated. The markers with the highest levels of observed heterozygosity were GATS91 (0.68) and BM143 (0.67), whereas the genic marker PV-CCTT001 had the lowest value, 0.01. With respect to the values recorded for expected heterozygosity (He), the SSR markers had an average of 0.564, with the highest being the genomic SSR, GATS91 (0.817), and the lowest for the genic SSR marker, PV-CCTT001 (0.011).

On the other hand, the allelic patterns across the studied populations are presented in Supplementary Figure 2. The figure also depicts the number of alleles, number of effective alleles, Shannon’s diversity index, number of private alleles, and number of less common alleles in bars of different colors. The line above the bars indicates pattern of variation in expected heterozygosity among the different groups of accessions. The ‘Amhara’ and Southern Nations, Nationalities, and People (SNNP) had the highest expected heterozygosity. Nonetheless, the overall variation observed in accessions from different populations (collection sites) vis-à-vis the expected heterozygosity values was moderate. The calculated values for each of the aforementioned allelic measures are given in Table 5. The table corroborates the patterns depicted by Supplementary Figure 2. According to Table 5, accessions from Oromiya and SNNP had the highest average number of alleles (6.9and 6.4, respectively) (Table 5). On the other hand, accessions from ‘Amhara’ and the released varieties’ group had the highest number of alleles with frequencies ≥5% (measurement taken to alleviate the sampling error associated with the sampling of race or distinct alleles, that is, with frequencies ≤ 5%), (Na Freq. ≥5%), whereas accessions from ‘Amhara’ and  SNNP had the highest number of effective alleles (Ne) (Table 5). From the perspective of this study, the ‘Amhara’ and SNNP regions may be the most important population of accessions owing to the higher number of alleles with frequencies ≥5% (excluding rare alleles) and number of effective alleles. Similar to the observations regarding the number of effective alleles (Ne), ‘Amhara’ and SNNP had the highest genetic diversity measures (Shannon’s index=I) (Table 5). This may further strengthen the argument made above regarding the two populations, namely that the ‘Amhara’ and SNNP regions contain the highest level of bean diversity. Furthermore, accessions from ‘Oromiya’ and SNNP had the highest numbers of private alleles (0.82 and 0.59, respectively), and fewer common alleles with frequencies less than 50% (1.77and 1.60, respectively). This may imply that, upon further determination of what functional characters, if any, these private/less common alleles encode for or which genome region they mark, it may be possible to harness the potential of accessions in the population in future common bean breeding/improvement and genetic conservation endeavors in Ethiopia. Finally, the highest values for both expected and unbiased expected heterozygosity were recorded for accessions from ‘Amhara’, SNNP, and the released varieties’ group.

 

 

Analysis of Molecular Variance (AMOVA) in the ecological/geographic population groups

Results of AMOVA are presented in  Table 6  and  Figure7. Figure 7 shows that 58% of the total variation was attributed to genetic diversity prevalent within individuals from different populations, whereas 40% was due to variation among individuals within the same population. In contrast, a smaller portion (2%) of the total variation differentiated populations. In comparison, when the cluster groups identified at STRUCTURE preset K=5 (discussed below) were considered, AMOVA showed that 50% of allelic diversity was attributed to individuals within each of the groups (P<0.001); 31% among individuals in the total population; and the rest 19% was attributed to the diversity among populations. Moreover, highly-significant genetic differentiation among subpopulations (0.186, P<0.01) was observed.

In view of the F-statistics values (Table 7), the extent of genetic differentiation among the six populations in terms of allele frequencies measured was small (Fst=0.015*). Furthermore, the pair-wise Nm values among the six populations studied indicate that the highest values for putative gene flow were recorded for the following pairs of populations: BenishangulGumuz and SNNP (Nm=63); BenishangulGumuz and Kenya (Nm=55); and Oromiya and SNNP (Nm=31) (Supplementary Table 4).

 

 

Cluster analysis and Principal Coordinate Analysis (PCoA)

Cluster analysis with respect to populations (collection sites) was performed on the allelic frequency data using the Neighbor-joining method as implemented in the Darwin 5 and PowerMarker V3.25 software programs. Figure 8 shows the dendrogram clustering pattern for individual accessions in different populations (collection sites). As can be seen from the dendrogram, five different groups were identified. Furthermore, accessions from  different populations (collection sites) clustered together. On the other hand, Supplementary Figure 3 shows the results of the cluster analysis done based on Nei’s average unbiased genetic distance (Nei, 1983) among the accessions studied. Based on these results, four groups of populations were identified among the common bean landrace accessions from six different populations. Group 1 belonged to accessions from the Amhara region/population; the second group comprised accessions from the southern Ethiopia and Oromiya regions/populations. Another neighbor-joining dendrogram was constructed based on the shared-allele frequency genetic distances measured (Supplementary Figure 4). In comparison, this dendrogram identified five groups (compared to the four groups identified in the Nei’s genetic distance NJ dendrogram), with Oromiya and Southern regions in the farthest end and Benishangul-Gumuz and Amhara, being group 3 and 4. Finally, yet importantly, the shared-allele frequency NJ dendrogram, similarly with the Nei’s NJ dendrogram, clustered accessions from Kenya and the released varieties’ group.On the other hand, the first three axes of the PCoA accounted together for 65 % of the total variation, with 27, 21 and 17% explained by PC axis 1, 2, and 3, respectively. Results of the PCoA are displayed in Figure 6. It can be seen  from  this  figure  that  accessions  from different collection sites often clustered together.

 

 

 

 

 


 DISCUSSION

The hierarchical classification scheme into subpopulations comprised of Andean and Mesoamerican genotypes obtained here was in agreement with that reported for common bean germplasm in various studies (Singh et al., 1991a; Gepts, 1998; Diaz and Blair, 2006; Blair et al., 2007, 2010a, 2011; Okii et al., 2014b). Moreover, the moderate to mostly large differentiation among subpopulations (Fst values) were higher than was reported in other studies (Asfaw et al., 2009; Okii et al., 2014b). On the other hand, the higher differentiation recorded in the present study among Mesoamerican subpopulations compared to their Andean counterparts was in contrast with the results of Asfaw et al. (2009) and Okii et al. (2014b). The separation of bean accessions into the two gene pools was also evidenced in the NJ and PCoA analyses. Five cluster groups were identified, which supports the findings reported in previous studies (Kwak and Gepts, 2009; Burle et al., 2011). Furthermore, the presence of moderate admixture level agrees with previous reports (Asfaw et al., 2009; Blair et al., 2010b; Okii   et   al.,   2014b).   Similarly,    the   concurrence    ofSTRUCTURE results with that of PCoA and NJ analyses was in agreement with that reported by Asfaw et al. (2009) and Okii et al. (2014b). 

Five subpopulations with moderate admixture level and some switching of membership were observed in the present study. In line with this, Okii et al. (2014b) noted that the high level presence of admixture is indicative of the considerable mixing of common bean germplasm in planting and consumption and in hybridization in breeding. The considerable presence of admixture and switching of membership in some instances was supported by the PCoA and NJ analyses. These agree with some previous reports (Asfaw et al., 2009; Blair et al., 2010b; Burle et al., 2011; Okii et al., 2014b). In addition, the PCoA and NJ analyses in terms of geographic/ecological sampling of accessions indicated that accessions from different collection sites clustered together, which implied there was significant exchange of planting materials among farmers in different growing regions in the country. Moreover, the analysis of hybrid/ non-hybrid accessions indicated the Mesoamerican genotypes had higher instances of hybridity than the Andean counterparts. This observation supported previous results (Asfaw et al., 2009; Kwak and Gepts, 2009). On the other hand, from a population differentiation viewpoint, Andean genotypes were more differentiated than those from the Mesoamerican gene pool, which concurs with the findings reported previously for East African common bean germplasm (Asfaw et al., 2009; Okii et al., 2014b). 

Accessions with Andean origin had higher allelic parameter values (Na, Ne, PIC, etc.) than Mesoamerican accessions. This contrasted with results from other related studies (Asfaw et al., 2009; Burle et al., 2011; Okii et al., 2014 a,b). Such differences may be attributed to differences in genetic samples and respective sampling methods employed. On the other hand, the hybridity values recorded in our study were much higher than those reported previously (Blair et al., 2010b; Okii et al., 2014a). This might be explained by the fact that most of the accessions (>90%) were acquired from the National Gene Bank, which, in turn, had collected these accessions from subsistence farmers with a culture of keeping mixed seeds for consumption and subsequent planting seasons. Moreover, the higher allelic values of genomic than genic markers were comparable to those reported in some previous studies (Asfaw et al., 2009; Blair et al., 2010b; Okii et al., 2014b). The high genetic diversity of the Ethiopian common bean landraces was also evident when considering their ecological or geographical distribution. Such presence of high diversity in terms of both gene pools and the existence of ecologically- or geographically differentiation populations can have potential applications prospective common bean breeding programs in Ethiopia.

The presence of higher levels of gene flow within each gene pool than that found between gene pools observed in our study agrees with the result of Asfaw et al. (2009). This may be explained, in part, due to the lack of flowering synchronization, which could reduce inter-gene pool gene flow. A larger proportion of the accessions (i.e., 58%) were introgressions, which contradicts the report of Asfaw et al. (2009) about the lower level of introgression with Ethiopian and Kenyan bean landraces/cultivars. This, in turn, negates the assumption of the aforementioned authors implying that the genetic divergence in Ethiopian bean germplasm could be mainly due to the original differences in introduced germplasm from the primary centers of origin. Rather, the presence of a higher number of introgressions may be partially explained by the fact that the accessions were gene bank collections from farmers’ fields often characterized by a higher level of mixtures. The common practice of subsistence farmers in the country who cultivate for consumption and save segregant genotypes, resulting from any natural hybridization, as planting materials for subsequent generations, could result in such type of introgressions (Blair et al., 2010b; Worthington et al. 2012). A final noteworthy remark may be the fact that inter-gene pool introgressions are often endowed with useful combination of traits, including enhanced adaptation to environmental stresses, higher resistance to diseases and pests, and higher nutritional quality; hence, the introgressions identified in this study are of considerable importance in future bean breeding and conservation endeavors in Ethiopia. These merits of hybrids were evidenced in Islam et al. (2005) and Blair et al. (2010b), who reported that introgressions had higher mineral compositions than their respective non-hybrid parents. Consequently, it may be essential to tap into the useful genetic diversity found in such types of inter-gene pool introgressions, to be harnessed in further common bean breeding, improvement, and genetic conservation programs of beans in Ethiopia.


 CONCLUSION

This study formulates new insights about the pattern and extent of genetic diversity and population structure of common bean landrace germplasm in Ethiopia. The results in the context of both the two gene pools of origin and ecological/geographic populations shed light on the presence of adequate genetic diversity organized into the Andean and Mesoamerican gene pools, and distributed across various ecological/geographic populations. This in turn should be strengthened by identifying the cluster groups identified by STRUCTURE via integrating molecular marker evaluations with phenotypic data.  


 CONFLICTS OF INTEREST

The authors have not declared any conflict of interest.


 ACKNOWLEDGMENTS

We are grateful to the financial and material support provided by the Rural Capacity Building Project (RCBP)-Ministry of Agriculture, Ethiopia; the African Biosciences Challenge Fund (ABCF) at the BecA-ILRI hub, Nairobi, Kenya; and Addis Ababa University. Moreover, we are thankful to the Ethiopian Biodiversity Institute, Ethiopia, and the National Common Bean Research Project based at Melkassa Agricultural Research Center, Ethiopia for availing the plant materials for our study.


 SUPPLEMENTARY TEXT 1: GENOMIC DNA EXTRACTION.

For the molecular diversity assessment, total genomic DNA for each accession was isolated from a bulked leaf tissue sample of five randomly selected, one-week-old plants per accession using cetyltriethylammonium bromide (CTAB) method (Doyle and Doyle, 1990) with some minor modifications, as described in the following sections.

About 200 mg of fresh leaf tissue samples/leaf were placed in a 2 ml autoclaved and labeled Eppendorf tubes, covered by paraffin paper with a small slot at one side for air circulation, and freeze-dried for two days at-80⁰C. Subsequently, a drop of polyvinyl polypyrrolidone (PVPP) was added to the Eppendorf tubes. Then, 500 μl of 1× CTAB was added to each tube to break open cells and soluble cellular contents. Next, the contents in each tube were mixed using a Vortex, and kept in a gently-shaking water bath for 1 hour at 65°C. After the samples were taken out of the water bath, they were centrifuged at 14,000 rpm for 30 min, using an Eppendorf centrifuge (5417R). Afterwards, the supernatant suspension was transferred into new Eppendorf tubes, and 250 μl of potassium acetate was added. A total of 400 μl of ice-cold isopropanol was added to the supernatant solution harvested, after centrifuging the samples at 14,000 rpm for 30 min. At this point, the samples were left at -20°C overnight. The following day, the samples were removed from the -20°C freezer; centrifuged at 14,000 rpm for 30 min at -4°C. The supernatant was then poured off and the pellet dried. In order to remove the remaining isopropanol drops, the tubes were placed upside down on a paper towel. The pellets were air-dried for 30 min at room temperature.

Subsequently, 200 μl of TE and 3 μl of RNAse were added to each tube, which were then left in a water bath at 37°C. Following this, chlorophyll and some denatured proteins were removed by dissolving in a 200 μl mixture of phenol, chloroform, and isoamyl alcohol at a ratio of 25:24:1, which was mixed with manual inversions from 5 to 10 times. Next, the samples were incubated at room temperature for 10 min. Subsequently, a fixed volume of supernatant (180 μl) was harvested from each tube into new sets of 1.5 ml Eppendorf tubes. Three hundred μl of ice-cold 100% ethanol plus 15 μl of sodium acetate (at pH 5.2) was added to each. Following incubation at -80°C for 5 min, centrifugation was performed at 14,000 rpm for 30 min; the supernatant was poured off and the inside of each tube was washed with 200 μl 70% ethanol, and another centrifuging was applied at 14,000 rpm for 30 min at -4°C. Following this, DNA pellets were air-dried for an hour, and re-suspended with 30 μl of low salt buffer. DNA quality and quantity were measured by gel electrophoresis (using 1% agarose gel for 1 hour using λ-DNA as a size marker) (Figure 11).

 



 REFERENCES

Allen DJ, Edje OT (1990). Common bean in Africa Farming System. In: 1990 Progress in Improvement of Common Bean in Eastern and Southern Africa; Smithson JB. Ed. Bean Research, CIAT Africa Workshop Series. 5(20):32.

 

Asfaw A, Blair MW, Almekinders C (2009). Genetic diversity and population structure of common bean (Phaseolus vulgaris L.) landraces from the East African highlands. Theor. Appl. Genet. 120:1-12.
Crossref

 
 

Blair MW, Díaz LM, Buendía HF, Duque MC (2010a). Genetic diversity, seed size associations and population structure of a core collection of common beans (Phaseolus vulgaris L.). Theor. Appl. Genet. 119:955-73
Crossref

 
 

Blair MW, Díaz LM, Gill-Langarica HR, Rosales-Serna R, Mayek-Perez N, Acosta-Gallegos JA (2011). Genetic relatedness of Mexican common bean cultivars as revealed by microsatellite markers. Crop Sci. 51: 1–13.
Crossref

 
 

Blair MW, Giraldo MC, Buendia HF, Tovar E, Duque MC, Beebe SE (2007). Microsatellite marker diversity in common bean (Phaseolus vulgaris L.). Theor. Appl. Genet. 113:100-109.
Crossref

 
 

Blair MW, Gonzalez LF, Kimani PM, Butare L (2010b). Genetic diversity, inter-gene pool introgression and nutritional quality of common beans (Phaseolus vulgaris L.) from Central Africa. Theor. Appl. Genet. 121:237-248.
Crossref

 
 

Broughton WJ, Hernandez G, Blair MW, Beebe SE, Gepts P, Vanderleyden J (2003). Beans (Phaseolus spp.) model food legumes. Plant Soil 252:55-128.
Crossref

 
 

Burle L, Fonseca JR, Peloso M, Melo L, Temple SR, Gepts P (2011). Integrating phenotypic evaluations with a molecular diversity assessment of a Brazilian collection of common bean landraces. Crop Sci. 51:2668-2680.
Crossref

 
 

Diaz LM, Blair MW (2006). Race structure within the Mesoamerican gene pool of common bean (Phaseolus vulgaris L.) as determined by microsatellite markers. Theor. App. Genet. 114:143-154.
Crossref

 
 

Doyle JJ, Doyle JL (1987). A rapid DNA isolation procedure from small quantities of fresh leaf tissue. Phytochem. Bull. 19:11-15.

 
 

Ehrich D (2006). AFLPDAT: a collection of R functions for convenient handling of AFLP data. Mol. Ecol. Notes 6:603-604.
Crossref

 
 

Evanno G, Regnaut S, Goudet J (2005). Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14:2611-2620.
Crossref

 
 

Freytag GF, Debouck DG (2002). Taxonomy, Distribution, and Ecology of the Genus Phaseolus (Leguminosae-Papilionoideae) in North America, Mexico and Central America. Bot. Res. Inst. Of Texas, Brit. Press, Ft. Worth, TX.

 
 

Gepts P (1988). A Middle American and an Andean Common Bean Gene Pool. In. Genetic resources of Phaseolus beans; Gepts, P. Ed. Kluwer, Dordrecht, The Netherlands. Pp. 375-390.
Crossref

 
 

Gepts P (1998). Origin and evolution of common bean: Past events and recent trends. HortScience 33:1124-1130.

 
 

Gepts P, Aragão FJL, Barros E, Blair MW, Broughton RBW, Galasso I, Hernández G, Kami J, Lariguet P, McClean P, Melotto M, Miklas PN, Pauls P, Pedrosa-Harand A, Porch T, Sánchez F, Sparvoli F, Yu K (2008). Genomics of Phaseolus Beans, a Major Source of Dietary Protein and Micronutrients in the Tropics. In. Genomics of Tropical
Crossref

 
 

Gepts P, Bliss FA (1986). Phaseolin variability among wild and cultivated common beans (Phaseolus vulgaris) from Colombia. Econ. Bot. 40:469-478.
Crossref

 
 

Habtu A (1990). Research Highlights on Haricot Bean in Ethiopia: An Assessment of the Progress, Priorities, and Strategies. In. Proceedings of National Workshop; 1-3 October, 1990, Addis Ababa, Ethiopia. pp. 50-53.

 
 

Hornakova O, Zavodna M, Zakova M, Kraic J, Debre F (2003). Diversity of common bean landraces collected in the Western and Eastern Carpatien. Czech. J. Genet. Plant Breed. 39(3):73-83.

 
 

Imru A (1985). Bean production in Ethiopia. In. Regional Workshop on Potential for Field Beans in West Asia and North Africa. Allepo, Syria, 10-13 October 1983, pp. 15-38.

 
 

Islam FM, Basford KE, Jara C, Redden RJ, Beebe S (2005). Seed compositional and disease resistance differences among gene pools in cultivated common bean. Genet. Res. Crop. Evol. 49:285-293.
Crossref

 
 

Koenig R, Gepts P (1989). Allozyme diversity in wild Phaseolus vulgaris: further evidence for two major centers of diversity. Theor. Appl. Genet. 78:809-817.
Crossref

 
 

Kwak M, Gepts P (2009). Structure of genetic diversity in the two major gene pools of common bean (Phaseolus vulgaris L., Fabaceae). Theor. Appl. Genet. 118:979-992.
Crossref

 
 

Kwak M, Toro O, Debouck D, Gepts P (2012). Multiple origins of the determinate growth habit in domesticated common bean (Phaseolus vulgaris L.). Ann. Bot. 110:1573-1580.
Crossref

 
 

Liu KJ, Muse SV (2005). PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 21:2128-2129.
Crossref

 
 

Martin GB, Adams MW (1987). Landraces of Phaseolus vulgaris (Fabaceae) in Northern Malawi: Generation and maintenance of variability. Econ. Bot. 41:204-215.
Crossref

 
 

Okii D, Tukamuhabwa P, Kami J, Namayanja A, Paparu P, Ugen M, Gepts P (2014b). The genetic diversity and population structure of common bean (Phaseolus vulgaris L.) germplasm in Uganda. Afr. J. Biotechnol. 29:2935-2949.

 
 

Okii D, Tukamuhabwa P, Namayanja A, Mukabaranga J, Paparu P, Gepts P (2014a). Morphological diversity of tropical common bean germplasm. Afr. Crop Sci. J. 22:59-67.

 
 

Pallottini L, Garcia E, Kami J, Barcaccia G, Gepts P (2004). The genetic anatomy of a patented yellow bean. Crop Sci. 44:968-977.
Crossref

 
 

Peakall R, Smouse PE (2012). GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research—an update. Bioinformatics 28:2537-2539.
Crossref

 
 

Pedrosa-Harand A, Porch T, Gepts P (2008). Standard nomenclature for common bean chromosomes and linkage groups. Bean Improvement Cooperative, East Lansing, MI, USA. 

View.

 
 

Pritchard JK, Stephens M, Donnelly P (2000). Inference of population structure using multilocus genotype data. Genetics 155:945-959.

 
 

Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK, Zhivotovsky LA, Feldman MW (2002). Genetic structure of human populations. Science 298:2381-2385.
Crossref

 
 

Rousset F (2008). Genepop'007: a complete re-implementation of the genepop software for Windows and Linux. Mol. Ecol. Resour. 8:103-106.
Crossref

 
 

Singh SP, Gepts P, Debouck DG (1991a). Races of common bean (Phaseolus vulgaris L., Fabaceae). Econ. Bot. 45:379-396.
Crossref

 
 

Singh SP, Gutierrez JA, Molina A, Urea C Gepts P (1991b). Genetic diversity in cultivated common bean: II. Marker-based analysis of morphological and agronomic traits. Crop Sci. 31:23-29.
Crossref

 
 

Singh SP, Nodari R, Gepts P (1991c). Genetic diversity in cultivated common bean. I. Allozymes. Crop Sci. 31: 19–23.
Crossref

 
 

Sperling L (2001). The effect of the civil war on Rwanda's bean seed systems and unusual bean diversity. Biodivers. Conserv. 10:989-1009.
Crossref

 
 

Worthington M, Soleri D, Aragón-Cuevas F, Gepts P (2012). Genetic composition and spatial distribution of farmer-managed bean plantings: an example from a village in Oaxaca, Mexico. Crop Sci. 52:1721-1735.
Crossref

 
 

Wortmann CS, Kirkby RA, Eledu CA, Allen DJ (1998). Atlas of common bean (Phaseolus vulgaris L.) production in Africa. CIAT Pan-African Bean Research Alliance, Vol. 133.

 
 

Yeh FC, Yang R, Boyle T (1999). POPGENE VERSION 1.31-Microsoft Window-based Freeware for Population Genetic Analysis. University of Alberta Edmonton, AB Canada T6G 2H1.

 
 

Yu K, Park S, Poysa V, Gepts P (2000). Integration of simple sequence repeat (SSR) markers into a molecular linkage map of common bean (Phaseolus vulgaris L.). J. Hered. 91:429-434.
Crossref

 
 

Fisseha Z (2015). Genetic diversity and population structure of common bean (Phaseolus vulgaris, L.) germplasm from Ethiopia. PhD Thesis. Addis Ababa University, Addis Ababa, Ethiopia.

 

 




          */?>