Assessment of agricultural mechanization status of potato production by means of Artificial Neural Network model |
来源:一起赢论文网 日期:2013-06-04 浏览数:4317 【 字体: 大 中 小 大 中 小 大 中 小 】 |
Abstract An artificial neural network (ANN) model wa s developed to assess mechanization status of potato farms in Iran. Mechanization in dex (MI) and level of mechanization (LOM) were used to characterize farming system of potato production in the region. To develop A NN model, data were obtained from farmers, gover nment officials as well as from relevant databases. A wide range of explanatory pa rameters of farming activities were examined. Finally, 19 explanatory parameters were used as input variables to predict MI and LOM. Based on performance measures, single hidden layers w ith 8 and 3 neurons in the hidden layer were finally selected as the best configura tion for predicting MI and LOM, respectively. For the optimal ANN models, the values of the model’s outputs correlated well with actual outputs, with coefficient of determination (R2) of 0.98 and 0.99 for MI and LOM, respectively. Sensitivity analyses were also conducted to investigate the effects of various explanatory parameters on th e outputs. Since the ANN model can predict the two mechanization indicators for a target farming system with high accuracy, it could be a good alternative to regression for assessing agricultu ral mechanization of regional farms with similar conditions. Keywords: Potato, Farming system, Neural networks, Assessment, Mechanization index, Level of mechanization. Introduction Better knowledge of the past and present is a key component for the improvement of the planning process that will impact Iran’s agricultural sector in the years to come. Findings from this and similar studies can be used to set new directions for the analysis of the technological status of Iran’s agriculture. In this paper special attention is paid to mechanical inputs. In order to maximize the efficiency of introducing agricultural technology to farms in a target region, th e farming system of the region should be first characterized, es pecially to identify possible resource constraints and to capture the diversity of farming systems (Sims, 1987, Collado and Calderón, 2000, and Oida, 2000) Monitoring the mechanization status of target region, in combination with other agronomic indicators such as product- ivity potential, would lead to a better assessment of the sustainability of the farming sy stem (Garcia et al., 2005). Singh, 2002 stated that growth of crop production depends on the three sources: arable land expansion, increase in cropping intensity and yield growth. Okurut and Odogola, 1999 reported that besides land, farm power is the second most important input to agricultural production. Warkentin, 1991 stated that water use efficiency is highly concern of crop management. Barton, 1999 reported that farm power determines the scale and intensity of farm operation. Sarker, 1999 reported that adoption of power tillers for tilling has brought some significant changes on overall production and sustainability of small farm systems. Borlaug and Dowswell, 1993 stated that crop production environment including the genera tion and transfer of appropr- iate technology must be improved to increase the fertilizer use efficiency to meet the challenge of feeding increased population. Mechanization is a concept and cannot be measured directly. Appropriate indicators mu st be selected to determine levels of mechanization. An indicator of mechanization is a variable that allows describi ng and monitoring the processes, states and tendencies of systems at the farm, regional, national or worldwide levels. Evaluation of the performance of agri- cultural production system has been made with a set of evaluating indicators/parameters. The parameters include mach- inery energy ratio (MER), mechanization index (MI), level of mechanization (LOM), productivity level of consumed power (PLCP), energy input to produce per unit energy output (EIO), energy input to generate per unit GDP output, energetic efficiency of the system (EE), etc. A part of the present study was the formulation of an index to measure the mechanization status achieved at the farm level. To assess mechanization status of a farming system some of these indicators may be used to characterize the production of various agro-products in a region. Here, MI and LOM are chosen to investigate farming system of potato production in the Hamadan province of Iran. Table 1. Socio-economic structure of potato farms (per farm) The two indicators (MI and LOM) were defined because they would allow us to identify whic h farming systems in the region would benefit most from mechanization and to estimate the intensity of mechanization as part of an agricultural moderni- zation program. LOM is based on the premise that a mechanized farmer is the one that finds a way to utilize amounts of mechanical energy that are higher than the typical values using locally available technology. The second index (MI) elaborated here is an expression of the deviation of the actual amount of motorized farm work from the normal values at the regional level. This index is based on the premise that a mechanized farmer is the one that finds a way to utilize amounts of mechanical energy that are higher than the typical values using locally available technology. It is implied that these technologies are mechanized agricultural practices that have been successfully incorporated into the farming systems. The Artificial Neural Network (ANN) gives estimations of the mechanization indicators using limited data available from the target region, without the need to calculate them directly, which would require more data. The model is based on statistical analyses of actual data, and enables us to distinguish between necessary and unnecessary items of raw data. A fundamental hypothesis of this study is being feasible to train an ANN model to establish a non-explicit function, which corresponds to the ANN network itself, between a selected set of simple inputs, such as farm size, and number of tractors owned, and two mechanization indicators as the outputs. Recently, Zangeneh et al., 2010 have investigated energy use pattern in potato produ- ction in Hamadan province of Iran. The population investigated was divided into two strata base d on tractor and farm machinery ownership and level of farming technology. Group I farmers were owners of agricultural machinery and practiced high level of farming technology, whereas Group II were non-owners of machinery and exercised low level of farming technology. Total energy consumption of Group I and group II was 157.151 and 153.071 GJ ha-1, respectively which we studied Group I in present paper. The amount of nonrenewable energy (NRE) in both groups was rather high. They concluded a reduction in the total NRE ratio, specifically in chemical fertilizer usage would have positive effects on the sustainability of potato production as well as other positive environmental effects. In this sequel, we develop ANN models to predict mechanization indicators for potato production based on energy and power consumption. The potential practical application of this work consists on mapping the proposed mechanization indicators for a much wider area without direct calculations. Further analysis based on the interrelation between the produced data with complementary parameters already available in local databases, would contribute to assess the mechanization status in the region. Material and methods Data source and processing The study was carried out on potato farms in Hamadan province, Iran. Data were collected from the farmers by using a face to face questionnaire method. The additional materials used in this study were collected from the previous studies and publications by some institutions like FAO. The sample size was determined using Cochran technique as 68 farms (Snedecor and Cochran, 1989). The original data set was consisted of 300 explanatory parameters for each farm covering all character- istics of farming system of potato in the region. To assess the technological status and the agricultural production strategies, the farming system was analyzed according to its energy input-output flow and consumption of power for various farming operations. Socio-economic structures of farms Socio-economic structure of studied farms is shown in Table 1. According to the results of this table, maximum yield was 62.95 tonnes ha-1. Average farm size was 23.06 ha, and potato produ- ction occupied 22.67% of total farm lands. The other vegetables grown besides potato were wheat, alfalfa, corn and barley. Input and output parameters Based on data availability and how representative they were of all the data, a set of input items, including 94 explanatory parameters, were chosen as the first candidate set of the input items for all outputs. Underlying distributions were uniform, representativeness was checked. Using a regression method (forward method) different collec tion of input items selected for two mechanization indicators as outputs. Because the items have different scales, the data were normalized by converting them using natural logarithm to maintain the neural network sensitivity as per Drummond et al., 2003, Abdullakasim et al., 2005, Zhang et al., 1998. Table 2 shows the selected parameters fed into the ANN model during the training process. These items represent key factors of the farming system and were identified as factors in the mechanization status. They produced superior performance during the teaching process. Fig 1. The structure of multilayer feed forward neural network Item Value Population (person) 5.2 Age of farmer (year) 42.56 Total area (ha) 101.71 Potato area (ha) 23.06 Number of potato plots 1.86 Number of crop planted 4.12 Maximum yield (tonnes ha-1) 62.95 Tractor ownership (number) 2.10 374 Table 2. Selected input items for characterizing LOM and MI of farming system of potato Outputs Input items LOM MI 1 Area under potato cultivation (ha) Experience of potato production (year) 2 Number of potato plots Total land size (ha) 3 Total power of tractors (hp) Area under potato cultivation (ha) 4 Total number of tractors Number of land preparation operations 5 Number of land preparation operations Required time for hand collecting and bagging potatoes 6 Required time for hand collecting and bagging potatoes per hectare Cost of fixed labor per year ($) 7 Cost of fixed labor per year ($) Number of non-subsidized potassium fertilizer (50 kg bag) 8 Number of non-subsidized nitrogen fertilizer (50 kg bag) Amount of insecticides (L ha-1) 9 Amount of insecticides (L ha-1) Number of irrigation 10 Number of irrigation Total hours of farm machinery work (h ha-1) 11 Sum of fixed costs ($ ha-1) Cost of seed ($ ha-1) 12 - Equal energy of seed (MJ ha-1) Definitions of mechanization indices Based on the general concept of mechanization (Sims, 1987) and the structure of ANN models, two mechanization indi- cators, namely, Mechanizatio n Index (MI) and Level of Mechanization (LOM), were chosen to characterize farming system of potato in the target region. These indicators are defined mathematically as equations (1) and (2). The MI elaborated here is an expression of the deviation of the actual amount of motorized farm work from the normal values at the regional level. ∑==ni iiavi eTLLMMnMI1) (.1 (1) where: MI = Mechanization Index for the production unit `a`, Me(i) = Overall input energy due to machinery in the production unit `a`, Mav = Regional-average energy due to machinery, Li = Land area cultivated in the production unit `a`, TLi = Total farm land ownership of production unit `a`, n = Number of farms . The MI index, proposed by Andrade and Jenkins, 2003 is an indication of the amount of mach inery a given farmer uses for farm work compared with the average in the region. The second term in Eq. (1) includes a ratio between the land area cultivated with potato crop and the total land ownership. This term was introduced because it reflects the importance of land demand for cultivation. The LOM index is based on the premise that a mechanized farmer is the one that finds a way to utilize amounts of mechanical energy that are higher than the typical values using locally available technology. ∑=×=niiiLPLOM1η (2) where: LOM = level of mechanization, Pi= power of tractors , η = correction factor for utilized power (0.75). Field capacity was multiplied by rated power so the quantification of energy expenditure was made in work units (kWh). The regional normal was obtained after compiling a full dataset of all respondents and then it was defined the mode for the number of passes for each operation as well as the mode in tractor size and field capacity. It is implied that these technologies are mechan- 00.20.40.60.8112345678910Number of ANN modelR2Lo M MI Fig 2. Correlation between the ANN model’s outputs and calculated outputs -6-4-20241 2 3 4 5 6 7 8 9 10111213141516Number of farms (tes ting data s et)MAP E (% ) Fig 3. MAPE in the MI estimation over the test set 375 Table 3. Alternative configuration of ANN models for LOM and MI Outputs # ANN model # Hidden layer # Neurons MSE MAE MAPE R2 1 1 2 0.0792 0.1467 8.4137 0.839 2 1 3 0.0234 0.0692 1.8151 0.992 3 1 4 0.0202 0.0735 6.0146 0.965 4 1 5 0.0256 0.1300 6.6020 0.923 5 1 6 0.0240 0.0772 2.1966 0.990 6 1 7 0.0040 0.0536 2.7926 0.985 7 1 10 0.0068 0.0705 3.2190 0.969 8 1 12 0.0357 0.1558 9.8229 0.915 9 2 4-4 0.0250 0.0917 6.9694 0.954 LOM 10 2 4-8 0.0317 0.0994 6.9999 0.936 1 1 2 0.0655 0.1808 3.0602 0.921 2 1 3 0.0246 0.1291 2.5296 0.970 3 1 4 0.0953 0.2366 3.7092 0.978 4 1 5 0.0168 0.1122 2.2225 0.988 5 1 6 0.0532 0.1625 2.8564 0.974 6 1 8 0.0093 0.0733 1.3859 0.989 7 1 10 0.0166 0.0821 1.4996 0.988 8 1 12 0.0190 0.1167 2.3176 0.984 9 2 4-5 0.0450 0.1749 3.2799 0.951 MI 10 2 8-8 0.1095 0.1863 3.0602 0.921 ized agricultural practices that have been successfully incorpo- rated into the farming systems. During the interview, data was recorded on all the mechanized operations performed by farmers in the sample providing an estimation of the field capacity (hours of work per unit land). The ANN models were trained to output these indicators from the data of the 19 input parameters, included in Table 2. Multilayer Feed forward ANN There are multitudes of ANN structures and different classification frameworks. For examples, ANN could be classified according to the l earning method or to the organi- zation of the neurons (Chester, 1993). The one that have been used in this work is called Multi Layer Perception (MLP), in which neurons are organized in several layers: the first is the input layer (fed by a pattern of data), while the last is the output layer (which provides the answer to the presented pattern). Between input and output layers there could be several other hidden layers (see Fig. 1). The number of hidden layers has an important role in determining the generalization ability of the MLP. MLP represents a tool, which is able to identify the relationships between different data sets, although the form of these relationships is not defined exactly. For this reason they are called ‘‘universal approximation or regression tools’’ (Hornik et al., 1989). The ANN model was calibrated using the Neural Solutions 5.0 software package. During the calibration process, 80 architecture combinations were trained. Variations of the back propagation learning algorithm were applied. As presented by Zhang et al., 1998, the square error of the estimates between the observed and actual output is fed-back through the network causing changes of the weights, with the purpose of preventing that the same error will happen again. Batch back propagation provided smooth curves, with results generally better than those of the other training back propagation methods. At this stage, results from cross-validation analysis in relation to network size and number of training cycles were analyzed to select the best combination to keep the model simple. The data sets of the 68 farm patterns were divided randomly into three subsets, containing 41 patter- ns for training, 16 patterns for MI and 15 patterns for LOM testing, and 10 patterns for the validation phase. The number of patterns in the training subset was set to about 60% of the total data, as per Zhang et al., 1998. Extraction of the training subset was repeated several times at random to check the quality of the trained networks generated, as indicated by high R2 and low MSE, MAE and MAPE values and a wide range of outputs. The validity of the model was checked by comparing its output values with those calculated using equations 1 and 2, calculating and comparing mean squared error (MSE), mean absolute error (MAE), mean absolute percentage error (MAPE) and determination coefficient (R2). The values of the coeffi- cients of ANN models that have been assigned in order to minimize the MAPE are defined in Eq. (3): ∑=−=ni inMAPE1i i) 100yyˆ y(1 (3) where yˆ and y are predicted and actual value, and n is the total number of predictions. Results Number of hidden units To determine the optimal architecture of the ANN, the architectures of networks with hidden units ranging from one to 20 were trained, tested and validated. The validation subset contained 10 patterns, which were not used in the training and testing phases. This subset was used to test the correlation between the values of the outputs given by the ANN model and those calculated from Eqs. (1) and (2). Network architectures with hidden units ranging from one to 20 were simultaneously trained and tested with the respective subsets. The accuracy of the ANN model was validated using the testing data set. Figure 2 shows the correlation between the model’s output and actual (calculated) output. In general, networks containing two to eight hidden units showed better performance. Various numbers of hidden layers were tested, but single hidden layer networks showed better results. Furthe rmore, ANN architecture with 376-9-6-3036123456789101112131415Number of farmsMAP E (% ) Fig 4. MAPE in the LOM estimation over the test set Fig 5. Sensitivity analysis of ANN inputs on LOM. ( a ) area under potato cultivation, ( b) number of potato plots, ( c ) total power of tractors, (d) total number of tractors, ( e ) number of land preparation operations, (f) required time for hand collecting and bagging potatoes, ( g ) cost of fixed labor, ( h) number of non-subsidized nitrogen fertilizer, (i ) amount of insecticides, ( j ) number of irrigation, and ( k) sum of fixed costs. fewer hidden layers can avoid over fitting problems observed during the trial-and-error pro cedure. Therefore, ANN with single hidden layer was selected for further investigation. Based on performance measures, single hidden layers with 3 and 8 neurons in the hidden layer were finally selected as optimum configuration for predicting LOM and MI, respectively. The related character istics of ANN architectures’ are given in Table 3. The testing data set was also tested for the MAPE, MAE and MSE. Data patterns that generated great errors and recognized to be outlier for all of the indicators were rejected from the original data set in order to determine a boundary which represents the applic able range of the ANN model. Rejecting such data patterns enhances the network forecasting capability. The remaining, representative values of the items redefine our target farming system. The rejected data sets were regarded as those that could be analyzed under different farming systems. Figures 3 and 4 show MAPE of the two output indicators for each validation pattern, obtained by comparing the outputs of the best ANN model of every desired output and the outputs calculated using Eqs. (1) and (2) (actual model). |
[返回] |