Short-Term Load Forecasting for Electric Bus Charging Stations Based on Fuzzy Clustering and Least Squares Support Vector Machine Optimized by Wolf Pack Algorithm

: Accurate short-term load forecasting is of momentous signiﬁcance to ensure safe and economic operation of quick-change electric bus (e-bus) charging stations. In order to improve the accuracy and stability of load prediction, this paper proposes a hybrid model that combines fuzzy clustering (FC), least squares support vector machine (LSSVM), and wolf pack algorithm (WPA). On the basis of load characteristics analysis for e-bus charging stations, FC is adopted to extract samples on similar days, which can not only avoid the blindness of selecting similar days by experience, but can also overcome the adverse effects of unconventional load data caused by a sudden change of factors on training. Then, WPA with good global convergence and computational robustness is employed to optimize the parameters of LSSVM. Thus, a novel hybrid load forecasting model for quick-change e-bus charging stations is built, namely FC-WPA-LSSVM. To verify the developed model, two case studies are used for model construction and testing. The simulation test results prove that the proposed model can obtain high prediction accuracy and ideal stability.


Introduction
In recent years, low-carbon cities have become a common pursuit around the world, which is faced with increasing energy crises and environmental problems [1]. Electric buses (e-buses) have developed quickly with the burgeoning construction of low-carbon cities [2]. As important supporting facilities, charging stations bring new challenges to optimal dispatching and safe operation of the power grid due to great volatility, randomness and intermittence of the load [3]. Therefore, it is of great significance to conduct research on load characteristics analysis and short-term load forecasting. On one hand, this contributes to the optimal combination of generator units in terms of power system, economical dispatch, optimal power flow and electricity market transactions. On the other hand, it provides a decision basis for construction planning, energy management, orderly charging and economical operation for charging stations, which can guarantee and promote the development of low-carbon cities. Therefore, research on short-term load forecasting for quick-change e-bus charging stations has been conducted to provide data support and a theoretical basis for the large-scale construction of charging stations.
Nowadays, scholars have conducted a large amount of research on load forecasting for charging stations. The prediction methods are primarily divided into two categories: traditional forecasting approaches, such as time series [4], regression analysis [5], and fuzzy prediction [6], and artificially intelligent algorithms. Conventional prediction methods aiming at load forecasting poor stability and difficulty in coping with nonlinear constraints. The poor accuracy of local search of PSO cannot fully satisfy the need of parameter optimization in LSSVM. The shortcoming of CS and BA is that they easily fall into local optimums, leading to reduction in prediction accuracy. Wolf pack algorithm (WPA), as a new metaheuristic approach, is introduced in this paper to optimize the parameters in LSSVM. This technique possesses good global convergence and computational robustness due to insensitivity of the change of parameters in WPA [24].
As a result of the complexity and diversity of the influential factors for load forecasting in quick-change e-bus charging stations, it is of great necessity to select proper inputs for the prediction, so that redundant data can be reduced and computing efficiency can be improved [25]. Fuzzy clustering (FC) is a mathematical technique that classifies objects according to their characteristics [26]. In view of the fact that the daily load curves with similar influential factors of charging stations are basically consistent, good prediction results can be achieved by the use of samples on similar days. Consequently, a transitive closure algorithm grounded on a fuzzy equivalent matrix in FC is selected in this paper, which can extract samples similar to the predicted day. It can not only avoid the blindness of choosing similar days by experience, but also overcome the adverse effects of unconventional load data caused by sudden change of factors on LSSVM training. Therefore, the influential factors for the load in quick-change e-bus charging stations are analyzed in this paper, and a load forecasting model combining FC with LSSVM and optimized by WPA (FC-WPA-LSSVM) is established here. The rest of paper is organized as follows: Section 2 conducts an analysis of the daily load characteristics for quick-change e-bus charging stations based on related statistical data and studies various influential factors including day types, meteorological conditions and bus dispatch; Section 3 provides a brief description of FC, LSSVM and WPA, as well as the complete prediction framework; Section 4 introduces an experimental study to validate the proposed method; and Section 5 makes further validation. In Section 6, conclusions are obtained.

Analysis of Load Characteristics of E-Bus Charging Stations
The load of a large quick-change e-bus charging station in Baoding, China, is provided in this paper. When the bus comes into the station, the battery with electricity depletion is changed by the quick-change robot, which is further connected to the charging platform. Then, a battery filled with electricity is installed in the bus. After that, the e-bus goes into a specific area to wait for dispatch instructions. According to the dispatch, the e-bus appears at the charging station after 8:00 a.m. each day, which leads to a rise in load. The chargers will not stop working until the battery charging of the last e-bus is completed. At that time, the load decreases to the lowest point.
A typical daily load curve of the e-bus charging station is shown in Figure 1, which displays the active power per hour in a day. In common with the traditional load curve, there exist obvious crests and troughs. However, the curve of the e-bus charging station fluctuates greatly, and apparent distinctions appear among different curves, whereby the load in winter and summer is high, while the load in spring and autumn is low. All of these characteristics create difficulties for the daily load forecasting of the charging station.
the last e-bus is completed. At that time, the load decreases to the lowest point.
A typical daily load curve of the e-bus charging station is shown in Figure 1, which displays the active power per hour in a day. In common with the traditional load curve, there exist obvious crests and troughs. However, the curve of the e-bus charging station fluctuates greatly, and apparent distinctions appear among different curves, whereby the load in winter and summer is high, while the load in spring and autumn is low. All of these characteristics create difficulties for the daily load forecasting of the charging station.  The load is influenced by various factors. Here, three variables, including day types, meteorological conditions and e-bus dispatching, are selected. Unlike traditional motor vehicles, the source of power for electric buses is all electric power. When there is a traffic jam, there is no energy loss for electric buses. Therefore, traffic congestion factors do not affect the load of charging stations.

Day Types
E-bus charging stations serve the electricity supply of urban e-buses. In accordance with the habits and demands of citizens, the scheduling of e-buses between weekdays and weekends is different across the week, which also results in obvious differences in the load curve. Table 1 displays the annual mean of daily maximum load and daily average load for the e-bus charging station in Baoding in 2016 on the basis of day types. It can be seen that the loads on workdays are relatively higher than those on weekends. Thus, a week can be divided into two categories, namely workdays, including Monday to Friday, and weekends, which contain Saturday and Sunday. Special holidays, such as Dragon Boat Day, Labor Day or National Day, can be separated as a new type alone.

Meteorological Conditions
Data related to meteorological conditions and the power load of Baoding from August 16 to September 15, 2017 (31 days in total) are collected and shown in Figure 2. The meteorological conditions include the daily maximum temperature, daily weather, daily average wind speed and daily average humidity. In the daily weather condition, "1" is used to represent a sunny day, "2" is used to represent cloudy day, and "3" is used to represent a rainy or snowy day. As can be seen in Figure 2, there is a significant positive correlation between daily maximum temperature and power load, and weather and power load show a negative correlation. However, there is no obvious relationship between the average wind speed factor and load, and the average humidity factor is similar. Thus, it can be found that the load of e-bus charging stations is remarkably affected by temperature, as well as by rainy and snowy days, while the influence of other meteorological conditions such as humidity and wind speed is so weak that they can be omitted. Therefore, temperature and rainy and snowy days are selected as influential indicators in this paper. Similar to traditional power loads, the daily load of the charging station increases owing to the use of air conditioners on e-buses when the temperature change of coldness and warmth is aggravated. Since temperature has an important influence on battery capacity, as well as on the charging and discharging process, the charging time is diverse at different temperatures, which also leads to distinct trends of load. The daily load curves from September 12 to 14, 2017 are taken as an example, in which the total number of charged e-buses in these three days was about 60 and the maximum temperature dropped from 35 to 24. As seen in Figure 3, the violent fluctuation of air temperature in adjacent days causes great changes in daily load curves. Thus, it is necessary to take temperature as an influential factor in the selection of subsequent similar day samples. Taking the daily load curves on August 29 and August 30 in 2017 as an example, weather conditions can be divided into sunny days and rainy days. Figure 4 illustrates the relationship between weather conditions and the daily load of the charging station. It proves that daily maximum Similar to traditional power loads, the daily load of the charging station increases owing to the use of air conditioners on e-buses when the temperature change of coldness and warmth is aggravated. Since temperature has an important influence on battery capacity, as well as on the charging and discharging process, the charging time is diverse at different temperatures, which also leads to distinct trends of load. The daily load curves from September 12 to 14, 2017 are taken as an example, in which the total number of charged e-buses in these three days was about 60 and the maximum temperature dropped from 35 to 24. As seen in Figure 3, the violent fluctuation of air temperature in adjacent days causes great changes in daily load curves. Thus, it is necessary to take temperature as an influential factor in the selection of subsequent similar day samples. charging and discharging process, the charging time is diverse at different temperatures, which also leads to distinct trends of load. The daily load curves from September 12 to 14, 2017 are taken as an example, in which the total number of charged e-buses in these three days was about 60 and the maximum temperature dropped from 35 to 24. As seen in Figure 3, the violent fluctuation of air temperature in adjacent days causes great changes in daily load curves. Thus, it is necessary to take temperature as an influential factor in the selection of subsequent similar day samples. Taking the daily load curves on August 29 and August 30 in 2017 as an example, weather conditions can be divided into sunny days and rainy days. Figure 4 illustrates the relationship between weather conditions and the daily load of the charging station. It proves that daily maximum load decreases on rainy and snowy days on account of the deceleration of e-buses, which leads to a decrease in the daily driving mileage and charging times as well as the reduction of total load in the Taking the daily load curves on August 29 and August 30 in 2017 as an example, weather conditions can be divided into sunny days and rainy days. Figure 4 illustrates the relationship between weather conditions and the daily load of the charging station. It proves that daily maximum load decreases on rainy and snowy days on account of the deceleration of e-buses, which leads to a decrease in the daily driving mileage and charging times as well as the reduction of total load in the charging station. To this end, rainy and snowy days are another vital factor that affects the load characteristics of e-bus charging stations.

Bus Dispatching
The scheduling of departure time and off-running time is a momentous task for bus operation companies. In light of the daily plan of bus dispatching, different charging intensities of e-buses in the station cause changes in the daily load curve in the charging station at different periods. Moreover, diverse demands of the public, traffic jams, and sudden situations require the addition of temporary e-buses to enhance transport capacity, which brings about changes in bus scheduling on different days. Bus dispatching is one of the direct reasons for the fluctuation of daily load curve and the distinction of load curves among days. According to the dispatch plan made in advance, the total number of e-buses that need to be charged on a predicted day can be estimated; namely, the accumulated number of e-buses charged daily, which is used as an indicator to reflect the effect of bus dispatching on the load of the quick-change e-bus charging station.

Bus Dispatching
The scheduling of departure time and off-running time is a momentous task for bus operation companies. In light of the daily plan of bus dispatching, different charging intensities of e-buses in the station cause changes in the daily load curve in the charging station at different periods. Moreover, diverse demands of the public, traffic jams, and sudden situations require the addition of temporary e-buses to enhance transport capacity, which brings about changes in bus scheduling on different days. Bus dispatching is one of the direct reasons for the fluctuation of daily load curve and the distinction of load curves among days. According to the dispatch plan made in advance, the total number of e-buses that need to be charged on a predicted day can be estimated; namely, the accumulated number of e-buses charged daily, which is used as an indicator to reflect the effect of bus dispatching on the load of the quick-change e-bus charging station.

Fuzzy Clustering
FC analysis is a mathematical technique that achieves classification of objects through the establishment of fuzzy similarity relations based on their characteristics, familiarity and comparability. The fuzzy equivalent matrix dynamic clustering method is implemented in this paper.
Suppose n samples on the predicted day, that is The specific steps of FC can be explained as follows: (1) Data standardization. Considering different dimensions and orders of magnitude, the data must be standardized as Equation (1) [27].
where x jk is the raw data, x kmin and x kmax are the minimum and maximum of x 1k , x 2k , · · · , x nk , respectively, x jk is the standardized data.
(2) Establishment of fuzzy similarity relation matrix. In order to measure the comparability of the classified samples, a fuzzy similarity relation matrix R = r ij needs to be constructed by similarity of coefficient approach, distance or closeness. An absolute value index method is introduced here [28], as expressed in Equation (2).
Then the transitive closure R * of R can be obtained by square synthesis.
(3) Dynamic clustering. Select an appropriate threshold L to separate R * . The clustering results are up to the level of L. When L drops from 1 to 0, a dynamic clustering graph is obtained by changing the rough classification to a fine one. The best value of L can be acquired based on its change rate [29].
where i is the clustering order of L in a descending form; n i and n i−1 are the number of elements in i and i − 1 clusters, respectively; L i and L i−1 are the confidence levels in i and i − 1 clusters, respectively. If C i = max(C j ), L i is treated as the best threshold. Thus, n samples can be separated into several categories and each type contains a different number of samples.
(4) Classification recognition. The category consistent with the forecasted day needs to be identified after sample classification. The Euclidean distance is calculated between the predicted day and the above categories one by one [26]: where x ik is the characteristic vector on the predicted day, x jk represents the characteristic vector of each category. This paper takes the type with the shortest Euclidean distance as the classification of the forecasted day to make the prediction.

Least Squares Support Vector Machine
As an extension of SVM, LSSVM transforms the inequality constraints into equality ones and converts quadratic programming problems into linear equation ones, which is conducive to the improvement of convergence speed [30].
Set the training samples as where N is the total number of samples. The regression model can be expressed as follows [31]: where ϕ() is a function that maps the training samples into a highly dimensional space, w and b represent the weight and bias, respectively. For LSSVM, the optimization problem can be defined as Equation (6) [32]: where γ is the regularization parameter that balances the complexity and precision of the model. ξ i equals the error.
To obtain the solution, the Lagrange function can be established as Equation (8).
where α i is the Lagrange multipliers. Take the derivatives of each variable in the function and make them equal zero: Eliminate w as well as ξ i and transform it into the following problem: where The solution can be obtained based on the linear equations above: where K(x i , x) is the kernel function that meets Mercer's condition. The radial basis function (RBF) is employed as the kernel function here on the basis of its wide convergence region and extensive application scope, as shown in Equation (16).
where σ 2 represents the kernel parameter that reflects the characteristic of training samples and has influence on generalization ability of the technique. As we can see, the performance improvement of LSSVM model is greatly dependent on the appropriate setting of the following parameters: regularization parameter γ and kernel parameter σ 2 [33].

Wolf Pack Algorithm
In consideration of the blindness of manual selection in LSSVM model parameters, the optimal value of regularization parameter γ and kernel parameter σ 2 of LSSVM is obtained through the wolf pack algorithm. The WPA technique is inspired by research on the hunting behaviors of wolves [34]. According to their roles in hunting, wolves can be divided into three types: head wolves, safari wolves and feral wolves, who work together to complete the task. Random walk, call to action and siege are three main behaviors of wolves, which are simulated in the WPA model. The determination of the head wolf and the replacement of the wolf pack follow the common rules that the "winner is the king" and "the survival of the fittest", respectively [35]. WPA is illustrated in Figure 5. The principle and steps of WPA are summarized as follows [36]: (1) Initialize wolf pack. Suppose in D dimensional space, there are N wolves, wherein the location of the i -th wolf is set as: The initial position is generated as Equation (18) The principle and steps of WPA are summarized as follows [36]: (1) Initialize wolf pack. Suppose in D dimensional space, there are N wolves, wherein the location of the i-th wolf is set as: The initial position is generated as Equation (18): where rand represents random numbers within the range [0,1], and x max and x min are the upper limit and lower limit of the search space, respectively.
(2) Generate the head wolf. The wolf at Y lead with the best target function is selected as the head one. The head wolf does not update its position in the hunting process or participate in hunting; instead, it is directly iterated. If Y lead < Y i , Y lead = Y i , where Y i represents the location of the safari wolf i. Otherwise, the safari wolf i randomly walks in h directions until the maximum value H is achieved or the location cannot be further optimized; then the search is stopped. y ijd is the location at j-th point in d-th dimension of the i-th wolf.
(3) Keep close to the prey. The head wolf pushes the wolf pack to update their positions through call to action. The new position of the i-th wolf in d-dimension is described as Equation (20): where step a is the step length of wolves in search, step b represents the step length of wolves towards the target, x id and x lid are the location of the i-th wolf and the corresponding head wolf in d-dimension, respectively. (4) Encircle the prey. The head wolf sends signals to the surrounding wolf pack after finding the prey so that the encirclement and suppression of the target prey can be completed, as shown in Equations (21) and (22): where t equals the number of iterations, ra is the step length at the time of encirclement and suppression, X i is the location of the head wolf that sends the signal, and X t i is the location of the i-th wolf in the t-th iteration.
(5) The mechanism of competition and regeneration of the wolf pack. In encirclement and suppression, the wolves that fail to get food will be eliminated and the rest of wolves will be retained. Simultaneously, new wolves are randomly generated in the same number as the eliminated ones.
(6) Judge whether the maximum number of iterations has been reached. If the maximum number of iterations has been reached, the position of the wolf is output; that is, the optimal value of the LSSVM's parameters. If the maximum number of iterations has not been reached, then return to step 2.

Establishment of the Hybrid Forecasting Model
This paper firstly analyzes the influential load factors for quick-change e-bus charging stations, and FC is implemented to extract similar days to the predicted one as the training samples. Then, WPA is integrated with the LSSVM model to obtain the optimal values of γ and σ 2 . Finally, an analysis is performed on the forecasting results. The framework of the proposed hybrid approach is displayed in Figure 6.

Establishment of the Hybrid Forecasting Model
This paper firstly analyzes the influential load factors for quick-change e-bus charging stations, and FC is implemented to extract similar days to the predicted one as the training samples. Then, WPA is integrated with the LSSVM model to obtain the optimal values of  and 2  . Finally, an analysis is performed on the forecasting results. The framework of the proposed hybrid approach is displayed in Figure 6.

Initialize the location of wolf pack and parameters of LSSVM
Select the wolf at the location with the best target function as the head one The head wolf pushes wolf pack to update their positions through call to action and keep close to the prey The head wolf sends signals to wolf pack after finding the prey so as to complete the encirclement and suppression The wolves that fail to get food will be eliminated and randomly generate new wolves with the same number

Case Study
Base on the daily load, meteorological data and operation information of an e-bus charging station in Baoding, China, in 2017, a case study was carried out for the purpose of demonstrating the efficiency of the proposed model in load forecasting for e-bus charging station. The load data was provided by State Grid Hebei Electric Power Company in China, and the input data was provided by the local meteorological department. This paper adopts Matlab R2014b (Gamax Laboratory Solutions Kft., Budapest, Hungary) to program, and as for the test platform environment, an Intel Core i5-6300U (Intel Corporation, Santa Clara, CA, USA), 4G memory and Windows 10 Professional (Microsoft corporation, Redmond, WA, USA) Edition system was used. In order to eliminate the particularity of the target days and examine the generalization performance of the established technique, the data for one day from each of the four seasons was selected as test samples; that is, April 15, July 15, October 15 and January 15 were chosen as test samples for spring, summer, autumn and winter, respectively.

Input Selection and Pre-Processing
Based on the analysis of load characteristics in the e-bus charging station in Section 2, a set of eight variables was used as the input, including day type, maximum temperature, minimum temperature, weather condition, the accumulated daily number of charged e-buses and the loads at the same moment in the previous three days. Days can be divided into three categories: workdays (Monday to Friday), weekends (Saturday and Sunday) and legal holidays were valued at 1, 0.5 and 0, respectively. Weather conditions were separated into two types, where sunny and cloudy days were valued at 1, and rainy and snowy days were valued at 0.5. The loads at the same moment in the previous three days refer to those nearest the predicted day in similar samples after clustering according to the rule that "Everything looks small in the distance and is big on the contrary." The temperature, load data, and daily accumulated charged e-buses should be normalized as presented in Equation (1).

Model Performance Evaluation
It's important to effectively evaluate the load forecasting results for e-bus charging stations, and the performance of the prediction models is usually assessed by statistical criteria: the relative error (RE), root mean square error (RMSE), mean absolute percentage error (MAPE) and average absolute error (AAE). The smaller the values of these four indicators are, the better the forecasting performance is. In addition, the indicators named RMSE, MAPE and AAE can reflect the overall error of the prediction model and the degree of error dispersion. The smaller the values of these three indicators are, the more concentrated the distribution of errors is. The four generally adopted error criteria are displayed as follows: (1) Relative error (RE) (2) Root mean square error (RMSE) (3) Mean absolute percentage error (MAPE) (4) Average absolute error (AAE) where x andx are the actual load and the forecasted one of charging station, respectively; n equals the number of groups in the dataset. The smaller these evaluation indicators are, the higher the prediction accuracy is.

Results Analysis
The parameters of the proposed model are set as: the total wolf pack N = 50, iteration number t = 100, step a = 1.5, step b = 0.8, q = 6, h = 5. The forecasting results are shown in Figure 7. equals the number of groups in the dataset. The smaller these evaluation indicators are, the higher the prediction accuracy is.

Results Analysis
The parameters of the proposed model are set as: the total wolf pack 50  N , iteration number . The forecasting results are shown in Figure 7. As can be seen from Figure 7, the proposed model is very close to the actual load curve in each season and has a good degree of fit. Figure 8 shows the relative error of the prediction results. It can be seen that the relative error of the prediction results of the FC-WPA-LSSVM model is controlled within the range [−3%, 3%], and the degree of deviation is acceptable.

Discussion
In order to verify the performance of the forecasting approach, three basic techniques, including WPA-LSSVM [37], LSSVM [38], and BPNN [39], were introduced to make a comparison. The parameter settings in WPA-LSSVM were consistent with those in the established model. In LSSVM, As can be seen from Figure 7, the proposed model is very close to the actual load curve in each season and has a good degree of fit. Figure 8 shows the relative error of the prediction results. It can be seen that the relative error of the prediction results of the FC-WPA-LSSVM model is controlled within the range [−3%, 3%], and the degree of deviation is acceptable. where x and x are the actual load and the forecasted one of charging station, respectively; n equals the number of groups in the dataset. The smaller these evaluation indicators are, the higher the prediction accuracy is.

Results Analysis
The parameters of the proposed model are set as: the total wolf pack 50  N , iteration number . The forecasting results are shown in Figure 7. As can be seen from Figure 7, the proposed model is very close to the actual load curve in each season and has a good degree of fit. Figure 8 shows the relative error of the prediction results. It can be seen that the relative error of the prediction results of the FC-WPA-LSSVM model is controlled within the range [−3%, 3%], and the degree of deviation is acceptable.

Discussion
In order to verify the performance of the forecasting approach, three basic techniques, including WPA-LSSVM [37], LSSVM [38], and BPNN [39], were introduced to make a comparison. The parameter settings in WPA-LSSVM were consistent with those in the established model. In LSSVM,

Discussion
In order to verify the performance of the forecasting approach, three basic techniques, including WPA-LSSVM [37], LSSVM [38], and BPNN [39], were introduced to make a comparison. The parameter settings in WPA-LSSVM were consistent with those in the established model. In LSSVM, the regularization parameter γ and the kernel parameter σ 2 were valued at 12.6915 and 12.0136, respectively. In BPNN, tansig was utilized as the transfer function in the hidden layer, and purelin was employed as the transfer function in the output layer. The maximum number of convergence was 200, the error was equal to 0.0001, and the learning rate was set as 0.1. The determination of the initial weights and thresholds depend on their own training. Figure 9 illustrates the load forecasting results of FC-WPA-LSSVM, WPA-LSSVM, LSSVM and BPNN. Figure 10 presents the values of RE for each prediction method.
respectively. In BPNN, tansig was utilized as the transfer function in the hidden layer, and purelin was employed as the transfer function in the output layer. The maximum number of convergence was 200, the error was equal to 0.0001, and the learning rate was set as 0.1. The determination of the initial weights and thresholds depend on their own training. Figure 9 illustrates the load forecasting results of FC-WPA-LSSVM, WPA-LSSVM, LSSVM and BPNN. Figure 10 presents the values of RE for each prediction method. From Figure 9 and 10, it can be seen that the prediction error range of FC-WPA-LSSVM was controlled to within [−3% + 3%], where the minimum error (7:00 in the spring test) and the maximum error (18:00 in the autumn test) were 0.08% and −2.98%, respectively. Among them, 10 error points of the results were within [−1%, 1%], namely 7:00, 11:00 and 16:00 in the spring test, 1:00, 2:00, 9:00, 16:00, 23:00 in the summer test, 6:00 in the autumn test, 19:00 in the winter test; the corresponding values of RE were 0.08%, −0.49%, −0.52%, −0.71%, −0.98%, −0.74%, −0.85%, 0.71%, −0.81% and 0.31%, respectively. In addition, 19 error points of WPA-LSSVM were controlled to within [−3%, 3%], while the corresponding number for LSSVM was 17, of which 2 points of WPA-LSSVM were within the range [−1%, 1%], namely at 10:00 in the spring test (RE = −0.86%) and 9:00 in the winter test (RE = − 0.79%), but all error points of LSSVM were outside the range [-1%, 1%]. The minimum errors of WPA-LSSVM and LSSVM were −0.79% and −1.07% respectively, while their maximum errors were 6.6% and −7.59%, respectively. The errors of the BPNN model were mostly within the ranges [−6%, −4%] or [4%, 6%], where the maximum and minimum of RE were individually equal to 1.36% and 8.73%, respectively. In this regard, the forecasting accuracy ranked from the highest to the lowest was: FC-WPA-LSSVM, WPA-LSSVM, LSSVM, and BPNN. Hence, FC can effectively avoid the blindness in the selection of similar days through experience. In contrast with LSSVM, administering WPA improves the prediction precision by virtue of the parameter optimization of LSSVM. It is without doubt that the forecasting accuracy of some points in FC-WPA-LSSVM is worse than the other three  The performance comparison results of the forecasting models were measured by RMSE, MAPE and AAE, as presented in Figure 11. This demonstrates that the proposed approach outperforms the other models in terms of all the evaluation criteria, of which RMSE, MAPE and AAE of FC-WPA-LSSVM were equal to 2.20%, 2.09% and 2.09%, respectively. This is mainly due to the fact that FC can overcome the adverse effects of unconventional load data caused by factor mutation on LSSVM training, and WPA improves the generalization ability and prediction accuracy by parameter From Figures 9 and 10, it can be seen that the prediction error range of FC-WPA-LSSVM was controlled to within [−3% + 3%], where the minimum error (7:00 in the spring test) and the maximum error (18:00 in the autumn test) were 0.08% and −2.98%, respectively. Among them, 10 error points of the results were within [−1%, 1%], namely 7:00, 11:00 and 16:00 in the spring test, 1:00, 2:00, 9:00, 16:00, 23:00 in the summer test, 6:00 in the autumn test, 19:00 in the winter test; the corresponding values of RE were 0.08%, −0.49%, −0.52%, −0.71%, −0.98%, −0.74%, −0.85%, 0.71%, −0.81% and 0.31%, respectively. In addition, 19 error points of WPA-LSSVM were controlled to within [−3%, 3%], while the corresponding number for LSSVM was 17, of which 2 points of WPA-LSSVM were within the range [−1%, 1%], namely at 10:00 in the spring test (RE = −0.86%) and 9:00 in the winter test (RE = − 0.79%), but all error points of LSSVM were outside the range [−1%, 1%]. The minimum errors of WPA-LSSVM and LSSVM were −0.79% and −1.07% respectively, while their maximum errors were 6.6% and −7.59%, respectively. The errors of the BPNN model were mostly within the ranges [−6%, −4%] or [4%, 6%], where the maximum and minimum of RE were individually equal to 1.36% and 8.73%, respectively. In this regard, the forecasting accuracy ranked from the highest to the lowest was: FC-WPA-LSSVM, WPA-LSSVM, LSSVM, and BPNN. Hence, FC can effectively avoid the blindness in the selection of similar days through experience. In contrast with LSSVM, administering WPA improves the prediction precision by virtue of the parameter optimization of LSSVM. It is without doubt that the forecasting accuracy of some points in FC-WPA-LSSVM is worse than the other three approaches; for instance, the error of FC-WPA-LSSVM was 1.76% at 22:00 in the spring test, which was greater than WPA-LSSVM and BPNN.
The performance comparison results of the forecasting models were measured by RMSE, MAPE and AAE, as presented in Figure 11. This demonstrates that the proposed approach outperforms the other models in terms of all the evaluation criteria, of which RMSE, MAPE and AAE of FC-WPA-LSSVM were equal to 2.20%, 2.09% and 2.09%, respectively. This is mainly due to the fact that FC can overcome the adverse effects of unconventional load data caused by factor mutation on LSSVM training, and WPA improves the generalization ability and prediction accuracy by parameter optimization in LSSVM model. In comparison with BPNN, LSSVM can avoid the drawbacks of premature convergence and easily falling into local optimum.
approaches; for instance, the error of FC-WPA-LSSVM was 1.76% at 22:00 in the spring test, which was greater than WPA-LSSVM and BPNN. The performance comparison results of the forecasting models were measured by RMSE, MAPE and AAE, as presented in Figure 11. This demonstrates that the proposed approach outperforms the other models in terms of all the evaluation criteria, of which RMSE, MAPE and AAE of FC-WPA-LSSVM were equal to 2.20%, 2.09% and 2.09%, respectively. This is mainly due to the fact that FC can overcome the adverse effects of unconventional load data caused by factor mutation on LSSVM training, and WPA improves the generalization ability and prediction accuracy by parameter optimization in LSSVM model. In comparison with BPNN, LSSVM can avoid the drawbacks of premature convergence and easily falling into local optimum.

Further Study
In order to further verify the validity of the proposed method, another e-bus charging station in Baoding, China, was selected for an experimental study. The load data of the station from January, Figure 11. RMSE, MAPE and AAE of the forecasting results (I).

Further Study
In order to further verify the validity of the proposed method, another e-bus charging station in Baoding, China, was selected for an experimental study. The load data of the station from January, 2016 to December, 2016 are provided in this paper, where seven successive days in each season were taken as test samples and the remaining data were used as training samples. The setting of parameters in WPA-LSSVM was consistent with the proposed method. In LSSVM, γ and σ 2 were equal to 10.2801 and 11.2675, respectively. The values of the parameters in the BPNN model were same as those in the previous case study. Figure 12 displays the values of RMSE, MAPE and AAE. 2016 to December, 2016 are provided in this paper, where seven successive days in each season were taken as test samples and the remaining data were used as training samples. The setting of parameters in WPA-LSSVM was consistent with the proposed method. In LSSVM, γ and σ 2 were equal to 10.2801 and 11.2675, respectively. The values of the parameters in the BPNN model were same as those in the previous case study. Figure 12 displays the values of RMSE, MAPE and AAE. From Figure 12, it can be seen that FC-WPA-LSSVM presents the lowest RMSE, MAPE and AAE, with corresponding values of 2.07%, 1.92% and 1.97 in the spring test, 2.29%, 2.20% and 2.11% in the summer test, 2.39%, 2.35% and 2.25% in the autumn test, and 2.08%, 1.90% and 1.84% in the winter test. It can be seen that the overall prediction performance of the forecasting approach was optimal due to the advantages of FC, WPA and LSSVM. In conclusion, the load forecasting model for e-bus charging stations based on FC-WPA-LSSVM can provide accurate data support for the economical operation of the station. In addition, the proposed model can also be applied to the load forecasting of other charging stations, and its prediction accuracy will not be affected by changes in the number of electric vehicles and other factors.
Since this forecasting model is based on MATLAB development, if the transportation company wants to use this model to predict the load in the future, they can also use it easily and obtain the forecast results without additional costs.

Conclusions
In view of the load characteristics for e-bus charging stations, this paper selected eight variables, including day type, maximum temperature, minimum temperature, weather condition, the number of accumulated daily number of charged e-buses and the loads at the same moment in the previous three days, as the input. A novel short-term load forecasting technique for e-bus charging stations based on FC-WPA-LSSVM was proposed, in which FC was used to extract similar dates as training samples, and WPA was introduced to optimize the parameters in LSSVM to improve the prediction accuracy. Two case studies were carried out to verify the developed approach in comparison with WPA-LSSVM, LSSVM and BPNN. The experimental results showed that the forecasting precision of the proposed model was better than the contrasting models. Hence, FC-WPA-LSSVM provides a new idea and reference for short-term load forecasting of e-bus charging stations. From Figure 12, it can be seen that FC-WPA-LSSVM presents the lowest RMSE, MAPE and AAE, with corresponding values of 2.07%, 1.92% and 1.97 in the spring test, 2.29%, 2.20% and 2.11% in the summer test, 2.39%, 2.35% and 2.25% in the autumn test, and 2.08%, 1.90% and 1.84% in the winter test. It can be seen that the overall prediction performance of the forecasting approach was optimal due to the advantages of FC, WPA and LSSVM. In conclusion, the load forecasting model for e-bus charging stations based on FC-WPA-LSSVM can provide accurate data support for the economical operation of the station. In addition, the proposed model can also be applied to the load forecasting of other charging stations, and its prediction accuracy will not be affected by changes in the number of electric vehicles and other factors.
Since this forecasting model is based on MATLAB development, if the transportation company wants to use this model to predict the load in the future, they can also use it easily and obtain the forecast results without additional costs.

Conclusions
In view of the load characteristics for e-bus charging stations, this paper selected eight variables, including day type, maximum temperature, minimum temperature, weather condition, the number of accumulated daily number of charged e-buses and the loads at the same moment in the previous three days, as the input. A novel short-term load forecasting technique for e-bus charging stations based on FC-WPA-LSSVM was proposed, in which FC was used to extract similar dates as training samples, and WPA was introduced to optimize the parameters in LSSVM to improve the prediction accuracy. Two case studies were carried out to verify the developed approach in comparison with WPA-LSSVM, LSSVM and BPNN. The experimental results showed that the forecasting precision of the proposed model was better than the contrasting models. Hence, FC-WPA-LSSVM provides a new idea and reference for short-term load forecasting of e-bus charging stations.
The load of e-bus charging stations is a kind of power load with complex change rules and diverse influential factors. With the large-scale application of electric vehicles, more and more e-bus charging stations will start to be put into use. At that time, research on actual operation of charging stations will be more abundant. It is necessary to make further efforts to seek more suitable load forecasting approaches for e-bus charging stations based on the study of load variation rules and the internal relationships between the load and influential factors.