The Pitfalls of Heterosis Coefficients

Heterosis (hybrid vigour) is a universal phenomenon of crucial agro-economic and evolutionary importance. We show that the most common heterosis coefficients do not properly measure deviation from additivity because they include both a component accounting for “real” heterosis and a term that is not related to heterosis, since it is derived solely from parental values. Therefore, these coefficients are inadequate whenever the aim of the study is to compare heterosis levels between different traits, environments, genetic backgrounds, or developmental stages, as these factors may affect not only the level of non-additivity, but also parental values. The only relevant coefficient for such comparisons is the so-called “potence ratio”. Because most heterosis studies consider several traits/stages/environmental conditions, our observations support the use of the potence ratio, at least in non-agronomic contexts, because it is the only non-ambiguous heterosis coefficient.


Introduction
Non-linear processes are extremely common in biology. In particular, genotype-phenotype or phenotype-phenotype relationships frequently display concave behaviours, resulting in the dominance of "high" over "low" alleles [1] and in positive heterosis for a wide range of polygenic traits [2,3]. Properly quantifying the degree of non-additivity is an essential prerequisite for interpreting and comparing genetic studies and for making predictions in plant and animal breeding. In this commentary paper, we first recap the different ways non-additivity is measured in genetics. Subsequently, we analyse the formal relationships between the different heterosis coefficients and provide examples drawn from experimental studies in maize and cotton. Finally, we show the extent to which the most commonly used heterosis coefficients may lead to interpretation errors. D F = z 12 −z z 2 −z 1 2 wherez = z 1 +z 2 2 . D F varies in the opposite direction to D W : its value is 1 if z 12 = z 2 (complete dominance of A 2 over A 1 ), −1 if z 12 = z 1 (A 2 is fully recessive with respect to A 1 ) and 0 if there is additivity. In the case of overdominance, D W < 0 and D F > 1, and in the case of underdominance, D W > 1 and D F < −1 (Table 1). Table 1. Dominance and heterosis coefficients. D W : Wright's dominance coefficient [1]. D F : Falconer's dominance coefficient [5]. H mp , H MP , H PR , H bp and H BP : heterosis coefficients. Subscripts: mp or MP, mid-parent; PR, potence ratio; bp or BP, best-parent. z 1 (resp. z 2 ): the phenotypic value of parental homozygote 1 or of parent 1 (resp. 2). z 12 : the heterozygote or hybrid value.z: the mean parental value. By convention, z 2 > z 1 .

Reference Coefficient Coefficient Scales with Their Characteristic Values
Negative Addi-Positive Best-parent heterosis mid-parent tivity mid-parent heterosis heterosis heterosis Best-parent Best-parent heterosis or additivity

Non additivity
Best-parent heterosis or additivity The D W and D F coefficients are linearly related: Thus, dominance can be quantified with either coefficient, since both of them give the position of the heterozygote relative to the parental homozygotes.
For polygenic traits, either coefficient could be used to quantify non-additivity, i.e., "real" heterosis, without any ambiguity. Actually, one finds five heterosis coefficients in the literature (see their characteristic values in Table 1).
The two most popular coefficients are the best-parent (BP) and mid-parent (MP) heterosis coefficients (e.g., [6,7]): where z 2 , z 12 andz are, respectively, the phenotypic values of the parent 2 (with z 2 > z 1 ), of the parent 1 × parent 2 hybrid and of the parental mean.
In some instances, the authors do not normalize the difference between the hybrid and the bestor mid-parent value. Fonseca & Patterson [8] proposed: and Falconer [5]: Finally, the so-called "potence ratio" [9] has the same expression as Falconer's dominance coefficient (D F ): A value of 0 indicates additivity, 1 indicates z 12 = z 2 (hybrid value = best-parent value), −1 indicates z 12 = z 1 (hybrid value = worst-parent value), and > 1 (resp. < −1) indicates best-parent (resp. worst-parent) heterosis (Table 1). H PR explicitly includes the values of the three genotypes, whereas the other coefficients lack one of the parental values (H BP and H bp ) or both (H MP and H mp -a given mean can correspond to an infinity of parental values). From a genetic point of view, H PR is explicitly expressed in terms of the five genetic effects contributing to heterosis (Supplementary Table S1). Thus, the potence ratio, which is still by far the least used heterosis coefficient, is the only one that informs us of the exact position of the hybrid value relative to the parental values. Wright's dominance coefficient has the same property, but its inverse direction of variation, which makes comparisons less easy, probably explains why it is not used in this context.

Relationship between the Potence Ratio and the other Heterosis Coefficients
It is easy to show that the relationship between H PR and the other coefficients is (with z 2 > z 1 ): where z m = z 2 −z 1 z 1 +z 2 (the difference between the parental values normalized by their sum) and z b = z 2 −z 1 z 2 (the difference between the parental values normalized by the best parental value). For a given H PR value, the coefficients H MP and H BP are linearly related to z m and z b , respectively, i.e., they depend on the scale of the parental values. More specifically, the relationship between H MP and z m is negative when H PR < 0 and positive when H PR > 0, while the relationship between H BP and z b is negative when H PR < 1 and positive when H PR > 1. As z m and z b are positive, we see from Equation (1) that for H PR = 0, we have and Regarding H BP , we see from Equation (2) that, for H PR = 1, we have  Figure S1). For instance, H MP ≈ 0.4 can correspond to both mid-parent heterosis (H PR = 0.5, z m ≈ 0.8) and best-parent heterosis (H PR = 2, z m ≈ 0.21).
We illustrate this by using experimental data from maize. We measured six traits (flowering time, plant height, ear height, grain yield, thousand-kernel weight, and kernel moisture) in four crosses (B73 × F252, F2 × EP1, F252 × EP1, F2 × F252) grown in three different environments in France (Saint-Martin-de-Hinx in 2014, Jargeau in 2015, and Rhodon in 2015). We computed H PR , H MP and H BP for the 72 trait-cross-environment combinations. Figure 1A,B shows that the relationship between H PR and the other two coefficients is very loose, if any. A given H PR value can correspond to a wide range of H MP or H BP values, and vice versa. We performed the same analyses using the data published by Shang et al. [10], who measured five traits in two crosses of cotton grown in three environments. The same loose relationship between H PR and either heterosis coefficient was observed ( Figure 1C,D). This means that the normalized differences between the parents, which are not related to heterosis, since they do not include values from the hybrids, markedly affect H MP and H BP .
Regarding the H mp and H bp coefficients, which are not dimensionless, they only provide the direction of heterosis. For a given H PR value, H mp can vary from −∞ to 0 when H PR < 0 and from 0 to +∞ when H PR > 0, and H bp can vary from −∞ to 0 when H PR < 1 and from 0 to +∞ when H PR > 1 (Equations (3) and (4)).
Let us examine the possible interpretation errors that may result from the use of H MP and H BP .

The Pitfalls of the Most Commonly Used Heterosis Coefficients
The non-univocal relationship between H PR and the most commonly used heterosis coefficients has two consequences. (i) Comparing the coefficient values for a given trait in different crosses and/or environments and/or developmental stages leads to erroneous conclusions whenever these factors have an effect on the scale of the trait and/or on the difference between the parental values (i.e., on z m or z b ). Possible differences in deviations from additivity between these conditions cannot be detected. (ii) This problem is even more pronounced when studying different traits, because each trait has its own scale of variation, making H MP and H BP (and to a greater extent H mp and H bp ) useless for comparing the real levels of heterosis of these traits.
These pitfalls can easily be illustrated from our maize dataset. Figure 2A shows Figure 2B). Finally, the effect of the environment on heterosis also reveals obvious discrepancies between H PR on the one hand and H MP or H BP on the other hand ( Figure 2C). It is also informative to compare the profiles of heterosis coefficients for a trait measured during development or growth. A Hill function was used to fit the percentage of flowering individuals over time in the W117 × F192 and W117 × F252 hybrids and their parents: where x is the time, a and b are constants, and n is the Hill coefficient. We then computed the heterosis coefficients over time for the percentage of flowering individuals estimated from the fitted curves ( Figure 3). Again, H PR tells a different story when compared to H MP and H BP . Because hybrid and parental values converge as flowering progresses, both H MP and H BP inevitably decrease when flowering nears 100%. The evolution of H PR , which quantifies the "real" heterosis, is clearly different, with a monotonous increase in the W117 × F192 hybrid and a fluctuating profile in the W117 × F252 hybrid. Similar results were observed in a simulation describing the increase in population size of a unicellular organism, which exhibits logistic growth. We used: where y is the size of the population, K the carrying capacity, a a constant, r the growth rate, and θ the time. We assumed that the parents only differed in their growth rate r and that there was additivity for this parameter. The results show that the H MP and H BP profiles for population size over time are clearly not congruent with that of H PR (Supplementary Figure S2).

Discussion
Using simple theoretical considerations and relying on experimental data and simulations, we showed that the most commonly used heterosis coefficients, i.e., H MP and H BP (and their non-normalized forms H mp and H bp ) cannot and should not be used if the heterosis levels are to be compared between different traits, environments, genetic backgrounds, or developmental stages. Because their expression does not explicitly include the two parental values in addition to the hybrid value, these coefficients, unlike the potence ratio H PR , do not quantify the deviation from additivity but only the normalized distance between the hybrid value and either the best or the mean parental value. The extent to which erroneous conclusions can be drawn when performing comparisons using these coefficients was illustrated with data from maize and cotton, and from population growth simulation in a micro-organism.
If H MP , H BP , H mp , and H bp do not provide reliable information on non-additivity, why are they so commonly used? There are probably both historical and technical reasons: (i) the first scientists who quantified heterosis were plant breeders [11,12]. From an economic perspective, the goal was, and still is, to develop hybrids that are "better" than the best-or mid-parent values for the desired agronomic traits, and not to know where the hybrid value is relative to the parental values. Heterosis coefficients have been defined accordingly and the habit has remained; (ii) the coefficients giving the right non-additivity values, H PR for heterosis and D W or D F for dominance, can take on high to very high values when the parents are close, due to the small differences z 2 − z 1 in the denominator of the fractions. This can produce extreme values that are not easy to represent and manipulate for statistical treatments. Nevertheless, such values are biological realities that precisely convey the inheritance of the traits under study, something that H MP , H BP , H mp , and H bp do not. Note that the two dominance coefficients used for monogenic traits have the same property, which does not prevent their use to the exclusion of any other. In addition, from a practical point of view, a single coefficient is sufficient to know the position of the hybrid relative to the mid-or best-parent, whereas in a number of studies the authors compute and comment both H MP and H BP (or H mp and H bp ).
In conclusion, to compare the amplitude of heterosis between traits, developmental stages, crosses, or environmental conditions, there is no other choice but to use the only heterosis coefficient-H PR -that is not affected by the scale of the parental values and that accounts for the position of the hybrid in the parental range.
Supplementary Materials: The following are available online at http://www.mdpi.com/2223-7747/9/7/875/s1, Table S1. Heterosis coefficients expressed as functions of genetic effects. Figure S1. Influence of the scale of the parental values on H MP and H BP for different values of the potence ratio H PR . Figure S2. Heterosis for population size (simulations).

Funding:
The experiments in maize were funded by the French Agence Nationale de la Recherche (Amaizing project ANR-10-BTBR-01).