DEA Performance Measurements in Cotton Production of Harran Plain, Turkey: A Single and Double Bootstrap Truncated Regression Approaches

A single and a double bootstrap of data envelopment analysis examines Harran Plain cotton farming in Turkey. The single bootstrap technique was employed to derive the bias-corrected efficiency values under both constant returns to scale (CRS) and versus variable returns to scale (VRS) technologies while discriminating between the two technologies using a smoothed bootstrap test statistic. Results indicated that the farms operated under VRS technology. Given that VRS technology prevailed across Harran Plain cotton farmers sampled, we then determined factors affecting the bias-corrected technical efficiencies using the double bootstrap technique. Another important finding in the single bootstrap analysis is that cotton farmers in the region have a U-shaped technical efficiency based on the input and output scale. Thus, small-scale farmers tend to use their resources more efficiently in cotton farming than that of both mediumand large-scale farmers. Interestingly, the medium-scale farmers with resource inefficiency are at the forefront of the other two types of farmers (i.e., small-scale and large-scale) on the Harran Plain in Turkey. The results also showed that most of the farm and farmer specific as well as economic factors play a significant role in explaining the technical efficiency values.


Introduction
Cotton has strategic importance in Turkish agriculture, industry, and trade. Turkey is one of the top cotton-producing countries and produced 2,450,000 MT in 2017 [1], while the textile industry represents one of the leading sectors in the Turkish economy and accounted for 16% of total export value in 2017. Exports of ready-to-wear cotton items were worth $17 billion and textiles were valued at $8 billion in 2017 [2].
Rapid economic development and a changing demographic structure in Turkey has led to a fast increase in domestic demand for textile products. Specifically, due to the speedy increase in the number of textile and clothing stores and shopping centers throughout the country, domestic textile sales have increased significantly in recent years. Turkey's growing young population, migration to urban areas, refugee influx from Syria and other countries, and the increase in tourism have contributed to analyze the technical efficiency of cotton production on the Harran Plain using an up-to-date data enveloping analysis, DEA, (including the application of the double bootstrap technique) and seeks to close the current knowledge gap by providing empirical evidence on resource utilization efficiency. In this context, the available double bootstrap DEA techniques developed by Simar and Wilson [8], hereafter SW, were applied to the data. SW have empirically shown that the traditional two-stage DEA method (TTS-DEA) involves severe limitations. First, the TTS-DEA method is incompatible with the underlying data generation process (DGP) to produce meaningful statistical properties (e.g., unbiasedness) to describe technical efficiency scores. Additionally, SW have shown the correlation of environmental factors with error term since input and output variables interact with environmental factors. Secondly, the DEA efficiency scores are serially correlated invalidating statistical inferences. To overcome the constraints, SW used a double bootstrap procedure with consistent inferences to explain and predict efficiency scores with valid standard errors and confidence intervals [9].
Another feature that distinguishes our study is that earlier studies have implemented a double bootstrap technique by choosing the technical efficiency of the farmers (e.g., constant returns to scale, CRS, versus variable returns to scale, VRS) based on verbal assumption without empirically testing, whereas this study empirically presents the existence of the farm technical efficiency by using a single bootstrap technique. After the return to scale parameters for the sampled farms, assuming either CRS or VRS are determined by conducting a statistical test based on a single bootstrap technique, a double bootstrap method was then applied to determine both the efficiency amount and non-discretionary factors playing roles in the efficiency of cotton production on the Harran Plain. Therefore, the current study is novel in this respect.
Included in the subsequent sections is the outline of methods used in the analysis and data description. Model comparison together with a specification test and discussion of the effects of variables on the DEA efficiency scores are given in Section 3. The final part presents conclusions along with some policy implications.

Survey Design and Variable Selection for the Empirical Specification
First, 51 villages inhabited by 1029 cotton farmers were purposely selected, thought to be representative of the study area. Next, a total of 126 cotton farmers were selected using a stratified random sampling design introduced by Yamane [10] with an allowable percentage error margin of 5%. To assure the representativeness of the cotton farmer population, the selected farmers were divided into four size segments leading to a stratified distribution yielding 49, 49, 21, and 7 cotton farmers in each stratum. Farmer participation was voluntary. Farmers were supplied with diaries at the beginning of the 2012 production season. For the production season of 2012, we made a comparison between the variations in cotton yields obtained from the participant and non-participant farmers and no significant difference existed, indicating the Hawthorne effect was not present in our data set. In addition, we ascertain that our data set would not run into the problem of self-selection bias reasoning that our participatory farmers were randomly selected and even if the effects of both the self-selection bias and the Hawthorn are possible these effects would be offset thanks to the bootstrap technique used in this analysis. Meanwhile, in a standard DEA context, homogeneity in DMU's refers to the condition where all DMUs under investigation are subject to similar conditions in terms of the topography, climate, and commonly applied farming techniques. When we apply this perspective, the homogeneity requirement in DMU's in our data set is attained because all our farmers operate in the same region (the Harran Plains), face the same climatic conditions, and, therefore, apply similar farming techniques. Additionally, our output and input sets are constructed by taking representative (identical) measures into consideration. For example, while the farmers had a choice of using different fertilizer types, we converted those usages into the net nitrogen and phosphorous amounts. Similarly, in the calculation of family and hired labor, we converted the total number of hours and used the net Agriculture 2020, 10, 108 4 of 17 man-hour equivalent instead. Thus, we met the homogeneity requirement across all DMUs for every input used and output produced by the cotton sector of the Harran Plain region.
The recording of the information was controlled by 10-20 visits to farmers throughout the season depending on the village location. Such an approach built a trusted relationship between the survey workers and the farmers, which better motivated farmers, minimizing the risk of recording false entries. Farmers received payments for recording financial and production information after harvest. Each farmer was also interviewed on matters related to (1) production characteristics, including measures such as size of the operation, ownership type, yields, and land characteristics; and (2) farmer characteristics such as gender, age, and education (Table 1). After deleting an outlier value in one of our inputs in DEA, the remaining 125 farms formed our working sample. Table 1 lists definitions of variables used in the empirical analyses including units of measurement. Five variables (e.g., seed (SEED), the amounts of nitrogen and phosphorus (FRTLZR), family and hired labor (LABOR), herbicide and insecticide value (PESTICIDE), and value of working capital (OTHRCAP) capture the inputs used in cotton production per decare (YIELD). Farm and farmer characteristics used as explanatory variables affecting DEA efficiency scores include mutually exclusive multiple dummy variables representing farmer experience (Table 1), household size, education level, location, and farm size along with single dummy variables indicating off-farm work and land ownership. Other performance-related determinants that are discrete in nature include farmer age in years, the number of parcels owned or rented, and the irrigation frequency (the time irrigation recurs on the field). The only performance-related determinant measured in continuous fashion is the percentage share of family labor input in the total labor force, ranging from zero to one.
Location dummies indicate the municipal division of the Sanliurfa province and are used to identify the impact of location on the farm performance. It is hypothesized that farmers located in the central district are more efficient than those located in the Harran and Akcakale districts because farmers in the central district may have easier access to information. To avoid the dummy identification problem, one of the location dummies is used as a reference variable. Similarly, land ownership could have an ambiguous impact on efficiency. For example, land ownership could create an incentive to use soil-improving techniques in favor of efficiency, while tenancy might encourage the tenant farmer to use inputs more efficiently. Additionally, the impact of the share of family labor has an ambiguous effect (positive or negative): a larger share of hired labor may imply a more specialized, and thus more productive labor, but it might also be a source of moral hazard [11]. Farmer age could be expected to have a positive impact on efficiency as older farmers are more experienced, but some authors discuss reasons for the opposite relationship [12], perhaps due to physical deficiencies as the farmer gets older (e.g., age impairment non-linearity). Similarly, higher efficiency scores are expected for farms where full-time experienced farmers are more educated, household size is smaller, farmer operates on the land parcels close to each other, and irrigates the land. In small-scale families, there may be an inter-individual division of labor in which each member specializes in his/her task, and this attitude can be seen as a factor that increases technical efficiency. On the other hand, technical efficiency in scattered parcels can, of course, be disadvantageous compared to peers in parcels that are close to each other.

The Modeling Approach
The DEA production frontier is constructed using linear programming techniques, which render a piece-wise linear frontier that envelops the observed input and output data. Technologies produced in this way possess the standard properties of convexity and strong disposability [13]. The DEA technique measures relative efficiencies of a collection of farms in transforming inputs into outputs. Its origins date back to Charnes et al. [14], who introduced the CCR model based on the works of Farrell [15] and others. Later, Banker et al. [16] introduced the BCC model and accounted for variable returns to scale by adding a convexity constraint. The original DEA specification has led to the multi-stage model development to cope with slacks and to meet criteria identifying the nearest efficient points [17] and making the model invariant to units of measurement.
An input-oriented DEA model is given below for n decision-making units (farms), each producing Y outputs by using m different inputs. In this formulation X i is the ith farm's (mx1) input vector. For the whole sample, Y represents the (1xn) cotton yield vector and X denotes the (mxn) input per decare matrix. Focusing on the unit area limits variability by minimizing the effects on input and output values. Under the assumption of constant returns to scale (CRS), the efficiency score is the set of solutions to the following linear programming problem: Agriculture 2020, 10, 108 6 of 17 whereθ is the technical efficiency score for the ith farm; λ is an Nx1 vector of constants, where N is the number of farms in the sample; y i shows the ith farm output per decare, while x i denotes a vector of inputs per decare used in the production of y i by the ith farm; Y λ and X λ denote a projected point due to radial contraction of the input vector x i . The objective of the above linear programming is to find the minimumθ so that the input vector x i reduces to X λ , while holding the output level y i constant. In this context, the value ofθ will range between zero and one, with a score of near-zero implying inefficiency, while a score of one implying a point on the frontier where the farm in the region is technically efficient. For a specification under variable returns to scale (VRS), the additional convexity constraint 1λ = 1 is added to the above linear programming, where 1 is a vector of ones [16]. The constraint guarantees that an inefficient farm in the region is only benchmarked against units of a similar size. The scale efficiency (SE) of an ith farm is obtained by dividing the technical efficiency scores under CRS to the technical efficiency scores under VRS and is at a (0, 1) interval. The approach allows the comparison of the technical efficiency under the CRS and VRS technologies by using a smoothed single bootstrap technique [18]. The bias-corrected technical efficiency values under both technologies were then derived. When the SE is 1, the farm has an efficient economy of scale, otherwise, inputs used in production are not efficient in scale [19].
To determine under what scale these technical efficiencies are derived, the test is applied whether the scale efficiency is 1 (CRS) or against the alternative hypothesis that the SE < 1 (VRS). The test statistic is: and the H 0 is rejected if S is significantly smaller than one. As such, a critical threshold value (C α ) for statistic S is searched and if this critical value C α is smaller than S, the H 0 hypothesis is rejected. Unfortunately, the true distribution of S under H 0 is unknown (the hypothesis of CRS) so C α cannot be directly calculated, but Simar and Wilson [8,18] showed that one can bootstrap the distribution of S under H 0 in their FEAR R package (FEAR R package, obtains bootstrapped CRS and VRS, respectively, as follows: Bc < −boot.sw98(x,y,NREP = nrep, RTS = 3) and Bv < −boot.sw98(x,y,NREP = nrep, RTS = 1, XREF = x, YREF = y, DREF = 1/e), where RTS = 3 and RTS = 1 for CRS and VRS, respectively; nrep is the number of bootstrap replications (e.g., here, 10,000), and DREF = 1/e, where 1/e is the efficiency score under the CRS technology. The estimated value of S in R package can be called for S < −colSums(1/Bc$boot)/colSums(1/Bv$boot), while in benchmark package in R, critValue(S,0.05) reports the C α value for the test). The next step quantifies the technical efficiency scores by non-discretionary factors using a double bootstrap technique after empirically revealing the derived technical efficiency values. The quantification involves taking the inverse of the technical efficiency values in Equation (1) and defining it asδ i = (1/θ i ). As a result, the variable dependent on the set of non-discretionary variables is transformed from the double boundary dimension to the single boundary dimension. In such a case, δ i is confined to the intervalδ i [1, ∞) and the left-limit truncation regression is used to determine factors associated with the reciprocal of the technical efficiency scores. The value ofδ i equals one indicates an efficient farm, while the largerδ i value indicates an inefficient operation.
The relation between the dependent variable and non-discretionary variables can be shown as: where z is an (NxK) matrix of non-discretionary variables, β is a vector of associated parameters to be estimated, and ε i is a continuous idd random error term. Given the necessity ofδ i ≥ 1 − z i β, ε i is distributed normally with left truncation at (1 − z i β) and standard deviation σ . Under this assumption, the parameters of the model in Equation (3) are obtained using the following left-truncated likelihood function: where ∅ and Φ are the univariate probability density and cumulative distribution functions for the standard normal, respectively. Fortunately, as long as the data generating process (DGP) is defined, the double bootstrap technique can be used to empirically approximate the sampling distributions of β and σ and construct the confidence interval for proper inferences [20]. Detailed information about how the bootstrap works is included in the appendices provided at the end of this manuscript.  The results indicate that an average cotton farmer that performs as efficiently as its benchmark can achieve the same level of output using 28.8% and 16.5% fewer inputs, on average, under CRS and VRS, respectively. The original DEA efficiency scores range from 26.10 to 100 for CRS and from 39.70 to 100 for VRS. The farms with the worst performance could save approximately 74% of their inputs by shifting to the efficiency frontier.

Results and Discussion
The production under VRS technology is more efficient than that of CRS technology (Figure 1). The efficiency scores obtained from the VRS technology for 109 out of 125 farms (87.3%) are greater than the efficiency scores obtained from the CRS technology, providing sufficient evidence that there is room to improve cotton production efficiency by modifying input use. Only 16 of 125 farms and 32 of 125 farms for CRS and VRS technologies, respectively, were found to be fully efficient. Interestingly, the VRS technology yields twice as many fully efficient farms than the CRS technology. Additionally, VRS values performed above 40% when the efficiency value was below 40% in CRS (Figure 1). (The higher efficiency of the VRS technology relative to that of the CRS technology is shown in Figure 1). The percentage of cotton farmers who use inputs at a 90% efficiency or above is 18.40% (approximately 23 farms) and 37.6% (approximately 47 farms) under CRS and VRS technologies, respectively, showing the potential for a 10% reduction in input use without a decrease in cotton yield.
18.40% (approximately 23 farms) and 37.6% (approximately 47 farms) under CRS and VRS technologies, respectively, showing the potential for a 10% reduction in input use without a decrease in cotton yield. Table 2 compares the original DEA efficiency scores with those of the bias-corrected bootstrap efficiency estimators under CRS and VRS technologies. The bootstrap technique specified here, which is the focus of the discussion hereafter, includes the analysis of the first appendix (Appendix A) in the appendices. The signs of bias are negative for all input-oriented cotton production. All original efficiency scores under both technologies are greater than the bias-corrected results (Figure 2). At a performance below 60% in CRS, VRS is more likely to exceed this value in DMU. A similar situation can be extended to other technology levels.  The average efficiency scores for CRS and VRS are 64.11% and 76.75%, respectively, indicating the opportunity for use from 35.89% to 23.25% for the same level of cotton yield (Table 2). For example, under the bias-corrected DEA, the efficiency of VRS technology outweighs the efficiency of  Table 2 compares the original DEA efficiency scores with those of the bias-corrected bootstrap efficiency estimators under CRS and VRS technologies. The bootstrap technique specified here, which is the focus of the discussion hereafter, includes the analysis of the first appendix (Appendix A) in the appendices. The signs of bias are negative for all input-oriented cotton production. All original efficiency scores under both technologies are greater than the bias-corrected results (Figure 2). At a performance below 60% in CRS, VRS is more likely to exceed this value in DMU. A similar situation can be extended to other technology levels.
The efficiency scores obtained from the VRS technology for 109 out of 125 farms (87.3%) are greater than the efficiency scores obtained from the CRS technology, providing sufficient evidence that there is room to improve cotton production efficiency by modifying input use. Only 16 of 125 farms and 32 of 125 farms for CRS and VRS technologies, respectively, were found to be fully efficient. Interestingly, the VRS technology yields twice as many fully efficient farms than the CRS technology. Additionally, VRS values performed above 40% when the efficiency value was below 40% in CRS (Figure 1). (The higher efficiency of the VRS technology relative to that of the CRS technology is shown in Figure 1). The percentage of cotton farmers who use inputs at a 90% efficiency or above is 18.40% (approximately 23 farms) and 37.6% (approximately 47 farms) under CRS and VRS technologies, respectively, showing the potential for a 10% reduction in input use without a decrease in cotton yield. Table 2 compares the original DEA efficiency scores with those of the bias-corrected bootstrap efficiency estimators under CRS and VRS technologies. The bootstrap technique specified here, which is the focus of the discussion hereafter, includes the analysis of the first appendix (Appendix A) in the appendices. The signs of bias are negative for all input-oriented cotton production. All original efficiency scores under both technologies are greater than the bias-corrected results (Figure 2). At a performance below 60% in CRS, VRS is more likely to exceed this value in DMU. A similar situation can be extended to other technology levels.  The average efficiency scores for CRS and VRS are 64.11% and 76.75%, respectively, indicating the opportunity for use from 35.89% to 23.25% for the same level of cotton yield ( Table 2). For example, under the bias-corrected DEA, the efficiency of VRS technology outweighs the efficiency of The average efficiency scores for CRS and VRS are 64.11% and 76.75%, respectively, indicating the opportunity for use from 35.89% to 23.25% for the same level of cotton yield ( Table 2). For example, under the bias-corrected DEA, the efficiency of VRS technology outweighs the efficiency of CRS technology in yield of cotton production by about 12 points. Under the assumption of CRS technology with the bias-corrected DEA estimators, the share of farmers who use inputs efficiently at 50% or above and 90% or above is 79.20 and 0.80 percent points, respectively. The corresponding shares under VRS technology are 99.2% and 7.2%, respectively. Under the VRS, the share of farmers who use 10% or fewer inputs inefficiently is 0.8% under the biased-corrected DEA, but it increases to 37.6% under the corresponding uncorrected DEA. Therefore, efficiency scores derived from the uncorrected (CRS or VRS) technologies tend to be upward biased. The uncorrected DEA bias was alleviated by the bootstrap technique (Figure 3). Under the CRS and VRS technologies, respectively, 20.8% and 0.8% of the farms in the region have an efficiency score of less than 50%.   The confidence interval for the mean value of the biased-corrected DEA efficiency scores can be calculated in two different ways. The confidence interval can be obtained by taking the averages of the confidence intervals calculated for the individual efficiency scores using the bootstrap estimators. The other, more robust method is the same as applied in calculating the confidence interval for the mean of the uncorrected DEA efficiency scores using the classical method. The choice of either method is unclear in the literature [9,[21][22][23]. As can be seen in Table 2, the confidence intervals obtained from the average of the individual confidence intervals of the bootstrap technique are much larger than those obtained by the classical method. While the confidence intervals obtained from the mean of the individual confidence intervals of the bootstrap estimators indicate that the efficiency scores vary widely, the confidence intervals obtained by the classical method show that this variability is limited for both technologies. The uncorrected and bias-corrected DEA efficiency scores for both CRS and VRS technologies have similar confidence interval widths, while the unadjusted DEA scores for both technologies have a higher confidence interval. For example, the confidence interval obtained by the classical method for VRS technology implies that the farmers could reduce their inputs in cotton production by 14.05%-18.95% while maintaining the same level of yield.
To find out which operating technology is prevalent on the Harran Plain, one can test returns to scale parameters by using the DEA efficiency scores and bootstrapping procedure [8,18,24]. The null hypothesis of the prevalence of the CRS technology can be rejected if the estimated statistic is less than the critical value obtained by the bootstrapping estimators. The computed test statistic is S = 0.8527 and the bootstrapping analysis computed the critical value of 0.9360 at 5% significance level, resulting in the rejection of CRS technology in favor of VRS technology. Such an outcome supported expectations based on the comparison between CRS and VRS technologies, making the VRS technology the dominant cotton production technology among the Harran Plain farmers.
The level of association between inputs, outputs, and biased-corrected efficiency levels was examined once the appropriate cotton production technology was identified. Table 3 classifies the inputs used in cotton production, using the logarithmic scale. The results indicate that farmers cluster in the third and fourth groups. In each input group, the bias-corrected technical efficiency variable showed a different structure and high variability in cotton production. Especially in the second, third, and fourth groups, the value of bias-corrected technical efficiency decreased sharply as the use of resources increased to achieve higher cotton output. However, in the fourth group, the average efficiency values recovered slightly more than the values in the other groups. Nevertheless, the value is still below the average value of the first group in the classification of all inputs used in cotton production. When farms expanded their inputs, up to 35% of input was wasted until the fourth group. Therefore, the diseconomies of scale in cotton production prevail, a cost disadvantage, leading to increase per unit-cost cotton production. The relationship between the usage levels of aggregated inputs (e.g., capital, labor, and land), the cotton yield and corresponding biased-corrected efficiency scores are depicted in Figures 4-6. For example, when the cotton yield and efficiency performance of farmers are evaluated, an outward-looking U-shaped curve emerges (Figure 3). In cotton yield, it is observed that medium-sized farm enterprises are inefficient compared to small and large-scale farm enterprises. Agricultural enterprises with average cotton yield per decare have higher input losses compared to low and high yield enterprises in terms of yield production. These businesses can reach the same efficiency level by using less input on their cotton production. Business farms that plan high yields per decare generally use resource allocation efficiently, rewarding themselves with higher efficiency scores. On the other hand, when we look at the relationship between biased-corrected efficiency scores and capital input (Figure 4), it seems that the efficiency scores of the farmers who are poor in capital are higher than the farms with medium and high levels of capital. Therefore, the input losses of capital-intensive farms will be higher than the farms in other groups. This is most likely due to idle capital accumulation in farms (e.g., more than one tractor), and excess capital needs to be redirected towards the production of other products or combinations of production process.      There is a similar U-shaped relationship between efficiency values and farmers' labor input use levels ( Table 5). Those who use medium-scale labor-intensive resources in cotton production will have higher input losses than the other two types of farm enterprises on the Harran Plain. Considering that such cotton farm enterprises are in sharecropper (tenant) types, it is possible for them to reach the cotton yield level of small and high-scale enterprises by making more savings in the amount of labor. The relationship between the land amount and efficiency scores is similar to the    There is a similar U-shaped relationship between efficiency values and farmers' labor input use levels ( Table 5). Those who use medium-scale labor-intensive resources in cotton production will have higher input losses than the other two types of farm enterprises on the Harran Plain. Considering that such cotton farm enterprises are in sharecropper (tenant) types, it is possible for them to reach the cotton yield level of small and high-scale enterprises by making more savings in the amount of labor. The relationship between the land amount and efficiency scores is similar to the There is a similar U-shaped relationship between efficiency values and farmers' labor input use levels (Table 3). Those who use medium-scale labor-intensive resources in cotton production will have higher input losses than the other two types of farm enterprises on the Harran Plain. Considering that such cotton farm enterprises are in sharecropper (tenant) types, it is possible for them to reach the cotton yield level of small and high-scale enterprises by making more savings in the amount of labor. The relationship between the land amount and efficiency scores is similar to the relationship between capital and activity. It seems the presence of distant fragmented land will decrease the efficiency in cotton production.
The level of relationship between the land amount and efficiency scores is similar to the level of the relationship between capital and activity. It seems that the amount of land will become more difficult as the cotton production plan becomes difficult, or perhaps the presence of distant fragmented land will decrease the efficiency in cotton production. Overall, the bias-corrected technical efficiency values in the region resemble a U-shaped curve, i.e., the most efficient cotton farms were the smallest and the largest. However, the results from the smallest cotton farms require attention because of the large farm number in the sample yet few of them in each input group (Table 3). Lerman [25] and Latruffe et al. [26] indicated low technical efficiency among the mid-scale farms. The region may develop remediation programs for farmers who waste resources in cotton production. Table 4 shows the parameters of the bias-corrected efficiency values along with their confidence interval values derived from the double bootstrap technique for the truncated regression model (see Appendix B). The subsequent discussion is based on the bias-corrected technical efficiency scores and covers only statistically significant variables. Since the dependent variable is the reciprocal of bias-corrected technical efficiency in the double bootstrap analysis, the technical efficiency decreases with positively signed parameters and increases with negatively signed parameters. The covariate that significantly and negatively impacts the bias-corrected efficiency scores is the household size. Thus, the bias-corrected technical efficiency scores under the assumed VRS technology increase as the household size increases. This finding is consistent with research reported elsewhere [27]. The increase in household size can pose a challenge, especially if children are present because the adults may not allocate time to cotton production when needed. This finding coincides with that reported by Coelli et al. [19].
Having an off-farm job negatively impacts farm technical efficiency. While this is a weak trend and the finding is not consistent with a previous study [12], it is in keeping with all findings reported earlier [19,22]. Farmers with non-agricultural income may have to concentrate resources on non-agricultural businesses which may lead to less efficiency on farms.
Cotton yield efficiency decreases gradually in the region as the size of agricultural farms increases compared to small-scale enterprises (less than 5 ha) and the results coincide with the findings of the international literature [7,26,[28][29][30][31][32], while they contradict findings reported by Wadud and White [33], Latrufe et al. [9], and Balcombe et al. [22]. While cotton farmers in Akçakale are less efficient than farmers in the central district, farmers in the Harran district are more efficient than those in the central district, but the differences are statistically insignificant. It might be due to easier access to information and the proximity of the relevant private and public institutions to farmers in the central district.
Technical efficiency declines for farms dependent entirely on hired labor. Such farms are deprived of control over the business, while the work division might not have been fully rationalized among employees. In contrast with this finding, Latruffe et al. [26] and Olson and Vu [23] found that hired labor increased the technical efficiency of farms. Technical efficiency increases with the ratio of family to total labor. This finding supports the previous result which showed the positive effect of family workforce on efficiency scores. Although the results were found to be statistically insignificant, the more tractors, machinery, and equipment farmers own, the more efficient cotton production in the region becomes. The results support the notion that the tractor can maintain the largest share in tool and machinery capital, while in general, it might be a proxy variable for high-income or large-scale farm enterprises. Since the tractor can be simultaneously used in more than one production pattern, input usage time or harvest time may sometimes overlap in different fields or different crops. Therefore, overlapping time can, of course, pose a reduction in technical efficiency. Additionally, surprisingly the average technical efficiency of a cotton farm increases as the number of operated parcels, owned or rented, increases, as long as these parcels are close together, giving farmers the chance to manage their products simultaneously.
Technical efficiency increases with an increase in the ratio of capital to labor, while it decreases with an increase in the ratio of land to labor. Such findings correspond to earlier reports [9,26] and indicate planned and rational distribution of resource use by some over-capital farms in cotton production process, whilst they are compatible with the effects of tractor and machine variables on efficiency on the Harran Plain. A confirmed negative relation between technical efficiency and land to labor ratio indicates that the amount of land per-hourly unit of labor decreases the technical efficiency of cotton farming, contradicting earlier results of Olson and Vu [23]. The efficiency values increase as the amount of support given to cotton by the government increases, although its impact was not statistically significant. Cash supports may be more effective when given during the production process.

Conclusions
The current inquiry's focus is twofold. First, the study investigated the relative performance measurements of farms using a random sample of 125 cotton farmers operating on the Harran Plain, Turkey, and applying the single bootstrap approach (Appendix A). Results revealed that the bias-corrected technical efficiency scores under the assumptions of CRS and VRS technologies averaged 71.20% and 83.50%, respectively, implying that sampled cotton farmers wasted 28.8%-16.5%, respectively, of inputs. Concerning the average efficiencies, poor input use has contributed to declining technical efficiency. Meanwhile, overuse of input ranging from around 17% to 29% poses a major threat to the environment on the Harran Plain, while it is also economically indispensable. The biased and bias-corrected technical efficiency values under the assumption of VRS were found higher than those computed under CRS technology. The finding has been further confirmed by a statistical test to generate its critical value using a bootstrap technique.
Another important result in the single bootstrap analysis is that the cotton farms are characterized by a U-shaped technical efficiency based on the input and yield scale. Thus, small-and large-scale farms use inputs more rationally than medium-size farms, wasting relatively less input to produce the same cotton output per decare. Additionally, medium-size farms need to balance capital, labor, and land against input applications. The timing and frequency of irrigation to reach the same cotton production per decare may be another input that requires attention and could prevent not only excessive water use but long-term consequences associated with salinity.
This paper also applied a double bootstrapped truncated regression (DBTRM) to quantify the relation between non-discretionary and bias-corrected technical efficiency (Appendix B). The analysis takes place in a unique loop shown in the model section. The analysis of the marginal effects indicates that farmers engaged in agricultural work are likely to be technically more efficient. Although non-agricultural work offers informal education opportunities broadening knowledge, teaching new skills, and enriching experience, which might improve the ability to manage input use, it has not been found, unfortunately, to transfer this accumulated knowledge to agriculture. The understanding that technical efficiency increases with household size could provide an opportunity for the family workforce to increase cotton production. The result that coincides with an increase in technical efficiency was the land area decrease, supporting the notion that the household labor endowment does match the operated land area. Further investigation of the issue is warranted because the age structure of all household members and gender composition of the farm was not considered. Since the expansion of land operated by a farm is possible only by renting or buying land from another farmer in the region, creating job opportunities outside agriculture is a broader challenge faced by provincial and national governments.
The increase in technical efficiency in households having a tractor, but the same effect of other machinery likely reflects the use of a tractor for multiple purposes, not limited to fieldwork. It appears that only if other equipment is owned by a farm will technical efficiency increase. Future studies may collect more detailed data on tractors and other equipment to establish a more accurate link to technical efficiency.
The decreasing technical efficiency on farms using only hired labor indicates that the landowner allocates a limited amount of time to manage the farm. Possibly, the offered remuneration provides little incentive or the quality of hired labor is low. If the reason for using only hired labor results from the landowner having a job outside agriculture, the phenomenon may open a chance for the development of professional farm managers, who could improve cotton production, especially given the prevailing waste of inputs. Consistent with these results, technical efficiency increases as the ratio of family to total workforce increases, indicating that through a planned division of labor, the family can achieve a desired production yield in the region.
Factors that have a consistently positive impact on efficiency measures include the ratio of capital to labor and increased land fragmentation. The latter effect on efficiency scores is consistent with the generally accepted idea that farmers operating a large number of parcels tend to be more efficient. Addressing land fragmentation through policy is difficult because the reasons behind land fragmentation are often rooted in culture; encouraging land consolidation through targeted grants may be a more viable approach.
One possible limitation of this study is the lack of knowledge of the seed types and the kind of technology that farmers use. For example, if a distinction between conventional and organic cotton production could have been made, meta frontier analysis would have provided more robust results than the bootstrapped DEA analysis. A future study can investigate the impacts of farm and farmer characteristics on technical efficiency using the stochastic frontier approach. In that case, technical efficiency measures take into account differences in the sources from which errors originate. That perspective can attempt to distinguish between errors arising from farmer discretion and errors arising from systematic factors.  (2) a new pseudo data is constructed for x * i and y * i , where x * i = (θ i /θ * i )x i and y * i = y i for i = 1, . . . , N, with y i and x i the original output and input vectors of the ith farm, respectively; (3) new DEA efficiency score is obtained for each farm in the region by taking the previous pseudo data as a reference for each CRS and VRS conditions; (4) steps (1) to (3) are repeated B times to yield a set of B new DEA either efficiency scoresθ * , CRS i or efficiency scoresθ * , VRS i . Here, B is set as 10,000 in each CRS and VRS application.

Appendix B. Simar and Wilson's Algorithm # 2
Following Simar and Wilson's [24] Algorithm 2, the procedure is as follows: Step 1 obtains the DEA technical efficiency scores (θ i ) under determined scale using Equation (1). Now, take the reciprocal of the technical efficiency score by lettingδ i = (1/θ i ) whereδ i bounds from one to infinity for i = 1, . . . , N and it becomes compatible with the left-truncation regression, mostly used in output-oriented approach; Step 2 uses the method of maximum likelihood for the left-limit truncation regression to estimatê β of β andσ of σ in the left-limit truncation regression ofδ i on non-discretionary variable Z, wherê δ i > 0; Step 3 implies a loop for each i = 1, . . . , N following four steps (i)-(iv) B 1 times to obtain N set of bootstrap estimates {δ * i,b } B 1 b=1 : (i) draw ε i from N(0,σ ε ) distribution with left-truncation at (1 − z iβ ); (ii) Compute δ * i = z iβ + ε i ; (iii) construct a pseudo data set x * i , y * i where x * i = (δ i /δ * i )x i and y * i , y i . Note that we can also construct the pseudo x * i from the original DEA efficiency scores as x * i = [(1/θ i )/(1/θ * i )]x i = (θ * i /θ i )x i , whereθ * i = z iβ + ε i ; (iv) using the above pseudo data set and the DEA in Equation (1) under the determined scale calculate pseudo efficient estimatesδ * i = 1/θ * i for i = 1, . . . , N; Step 4 computes bias-corrected estimatesˆi δ where bias =δ i −δ * i , whereδ * i = ((1/B 1 ) B 1 b=1δ * i,b ) −δ i ; Step 5 regressesˆi δ on z i to estimateβ andσ ε using the maximum likelihood for the left-truncation; Step 6 is a loop over the next three steps (i)-(iii) B 2 times to generate a set of bootstrap estimates (i) for each i = 1, . . . , N draw ε i from N(0,σ ε ) distribution with left-truncation at (1 − z iβ ); (ii) compute δ * * i = z iβ + ε i ; (iii) use the maximum likelihood method to estimate the truncated regression of δ * * i on z i to yield estimatesβ * andσ * ε ; Step 7 uses the bootstrap estimates (β * andσ * ε ) from Step 6 and the estimates (β andσ ε ) generated in Step 5 to construct (1−α) confidence intervals for original β and σ ε . The (1−α) confidence interval of the jth element of the vector β as the Pr(−b /2 ≤β * j −β j ≤ −a /2 ) ≈ 1 − α such that the estimated confidence interval is [β * j + a * /2 ,β j + b * /2 ].