Power Demand Forecasting Using Long Short-Term Memory (LSTM) Deep-Learning Model for Monitoring Energy Sustainability

: The purpose of this study is to design a novel custom power demand forecasting algorithm based on the LSTM Deep-Learning method regarding the recent power demand patterns. We performed tests to verify the error rates of the forecasting module, and to conﬁrm the sudden change of power patterns in the actual power demand monitoring system. We collected the power usage data in every ﬁve-minute resolution in a day from some groups of the residential, public o ﬃ ces, hospitals, and industrial factories buildings in one year. In order to grasp the external factors and to predict the power demand of each facility, a comparative experiment was conducted in three ways; short-term, long-term, seasonal forecasting experiments. The seasonal patterns of power demand usages were analyzed regarding the residential building. The overall error rates of power demand forecasting using the proposed LSTM module were reduced in terms of each facility. The predicted power demand data shows a certain pattern according to each facility. Especially, the forecasting di ﬀ erence of the residential seasonal forecasting pattern in summer and winter was very di ﬀ erent from other seasons. It is possible to reduce unnecessary demand management costs by the designed accurate forecasting method.


Introduction
Power demand forecasting is an important part of the Smart Grid. The prospect of a smart grid business is once again being reexamined through the transformation of low-carbon energy due to rising oil prices and environmental problems and the renewable energy business [1,2]. Research is being processed to diversify new industries by trying to combine them in various fields, such as IT, in preparation for energy issues [2,3].
Generally, energy is efficiently managed by applying the Smart Grid system using hardware and software that reflect the latest technology, in order to promote economic benefit using energy as a facility unit. Some key features of the Smart Grid, including energy scheduling management, require facilities to predict power demand in the facility. The essential purpose of electric power demand volatility assessment is not only blackout but also economic profit [4][5][6], because it interacted with intelligent demand response, which monitors energy in real time and manages demand of energy [4]. Utilizing demand responses can bring economic benefits to the facility, which will allow the country to achieve additional benefits such as cost savings and environmental conservation. Also, energy efficiency is the most profitable way for society to ensure energy supply, so researches on ways to consume energy efficiently are being actively carried out. The forecasting energy usage to ensure adequate energy supply is closely related to the energy efficiency increasing methods [7,8]. Also, energy efficiency (EE) can help the countries achieve multiple objectives such as lowering the energy bill, reducing energy dependence, decreasing greenhouse gas (GHG) and non-GHG emission, while maintaining or increasing the level of economic activity as well as improving overall sustainability by raising the share of renewable energy [9]. For example, countries such as China and Austria have set energy intensity targets as a percentage reduction compared to a certain base year [9,10]. Accurate forecasting of energy demand can reduce energy waste and improve energy sustainability. Indeed, many attempts are currently being made to forecast such power demand.
In the case of using Support Vector Machine, which is the most similar and generalized study, it is difficult to analyze an energy usage dataset in a facility-customized manner. It is difficult to deduce only the past power usage data, because it cannot recognize the change of the specific time zone by using only the existing machine learning algorithms [11]. It is a way to increase the accuracy by predicting the volatility of power demand in every time zone [12]. Since it is needed at each process, including data collection, preprocessing, feature extraction, and so on, power demand forecasting systems consume a lot of time and efforts [13,14].
In this study, we conducted power demand forecasting for each facility based on deep learning. In addition, we proposed a power demand forecasting model, "LSTM+MIDAS", that can confirm the volatility of the unconformities in comparison with the actual usage. This deep-learning system based on power demand volatility is necessary, because each facility has its own usage patterns, and the power usage, detailed power usage, contract capacity, and self-generated capacity vary from each facility [15][16][17]. However, there was one pattern at each facility type. After analyzing the data by season, day, month, and hour, we found a pattern of volatility in power demand. To analyze complex time series data such as power usage, many algorithms in the field of machine learning are applied in combination with regression analysis using dependent variables [18,19].
The purpose of this study is to propose a power demand forecasting method and provide high accuracy of the predicted data; the goal is to reduce the error rate by more than 30% from the other methodologies. In addition, we confirmed the feature of power demand patterns and proposed efficient power demand forecasting methods for each facility. The most important function used in the volatility assessment is the usage forecasting study. Most studies have been conducted to predict and compare power demand using the autoregressive distributed lag (ARDL) and mixed-data sampling (MIDAS) methods. Those methods are used to estimate the value by numerical calculation using the different data formats that affect power demand. Therefore, in the power demand forecasting area, those are very actively applied. This section shows how the two approaches are used in power demand forecasting.

ARDL Approach Method
One way of predicting power demand is the ARDL, one of the most widely used dynamic regression analyses to analyze time series data. The ARDL, widely used as a methodology of error correction and cointegration, is a method mainly used for inferring numerical values in the social and economic fields [5,19].
In power demand forecasting, the ARDL method assumes that the monthly power demand affects the demand for power several years ago and includes autoregressive orders. The heating degree day and the cooling degree day are also included in the past month's independent variables. The statistical significance level can be confirmed by using different models according to the usage.
Equation (1) is commonly used as an ARDL(A y , A x ) model with autoregression degree of the dependent and independent variables A y and A x to predict monthly data (y t ) from weekly data (x t ) for the j-th week of the t-month [7]. It is assumed that the month is fixed at 4 weeks: j = 1, . . . , 4 [7]. However, power demand forecasting is very sensitive to temperature and seasonal factors due to the characteristics of power usage, so the older the past data, the less influence on forecast data [20]. Therefore, different weights should be assigned to each historical data to predict more accurately [20]. If each week's data in Equation (1) is given a different weight, Equation (2) can be derived where the w is a week.
In the case of ARDL(1,2) using Equation (2), the number of estimated coefficients of x is eight (2 × 4) [7]. The data used in this study are daily data, so the number of estimated coefficients is 60 (2 × 30), assuming 30 days per month. In this case, the estimation of the model itself becomes difficult and the reliability of the results is very low due to the degrees of freedom loss [7,21].

MIDAS Approach Method
Like the ARDL method, the existing power demand forecasting system using the MIDAS method predicted power consumption with regression model [7], which is used to calculate GDP in economics. The biggest advantage of MIDAS is that the weight function automatically assigns weights [22].
The MIDAS Equation (3) is similar to the ARDL Equation (1) but includes a function (ϕ( j; θ)), which imposes different weights on high frequency (lags) [23]. θ is the parameter vector of the weight function, and N w is the number of weeks [7]. So, when predicting power demand by the MIDAS method, it is possible to predict by considering various external factors besides power demand data without adjustment to the parameters of the weight function. MIDAS regression is essentially tightly parameterized, reduced form regressions that involve processes sampled at different frequencies [24][25][26]. In order to assign different weights according to the frequency, the weight function used in the MIDAS method was used in this study. In other MIDAS regression forecasting studies, by setting the temperature, the number of working days, the income variable, and the price variable as independent variables, the accuracy of short-term power demand forecasting could have been improved [26,27]. Saturdays were set to be half days, excluding holidays and Sundays, and add up the number of workdays [7]. As a result, monthly data were useful in the power demand forecasting area. Relatively high accuracy was obtained when monthly and weekly heating and cooling differences and temperature, etc., were reflected. Also, as it is possible to analyze the pattern of volatility of power demand by separating weekday and weekend power demand data, forecasting can be made in facilities where there is a large difference in power demand between weekdays and weekends, such as a company and city hall. We focused on this feature in power demand and conducted a forecasting analysis. In the MIDAS method, various types of weight functions for calculating weights can be used, so the results can be different for each weight function. In short, the key idea of the model is to simplify the estimation of the weights imposed on the high frequency variables using only a few parameters.
The rest of the article is organized as five sections. Section 2 explains the data that is used in this study and method, Section 3 shows the results data, Section 4 shows the discussion points and, lastly Section 5 shows the conclusion.

Materials and Methods
This section describes our dataset and LSTM power demand forecasting model as Figure 1.

Materials and Methods
This section describes our dataset and LSTM power demand forecasting model as Figure 1. From the results of previous studies of a volatility assessment model, we found that predicting power demand is the most important to understand the pattern of power demand data [28]. In order to identify the pattern, we classified the data into short-term and long-term data. Experiments were conducted through the collection of data for each facility according to power demand. We measured the three experiment methods' forecasting error rates to evaluate our forecasting model using data from different periods (short-term and long-term): (1) the existing power demand forecasting MIDAS algorithm; (2) the existing LSTM model; and (3) our proposed LSTM+MIDAS model. Using these three methodologies, we experimented with short-term data and long-term data to compare the differences in forecasting accuracy. However, in the case of residential facilities with large variable power demand patterns, it is important to understand the meaning from the seasonal, weather, and holiday aspects, rather than merely a period aspect. Then, our LSTM+MIDAS model performance was compared to other existing studies.
In Figure 2, in the case of residential and city hall buildings, the average maximum power demand tends to increase in summer (June to August), but the overall average maximum power demand of city hall was higher, and there was no difference between winter (November to January) and summer. Although there is a big difference in the power demand for each facility, the factory building showed the most similar average maximum power demand; between 600 kWh and 700 kWh over a year. Since hospital facilities are university hospital buildings, the maximum power demand was the highest among the four facilities. It also showed the highest maximum power demand pattern between May and September when the temperature rose. However, residential facility buildings showed the largest difference in the maximum power demand in summer and winter, while it showed similar patterns in other seasons, except summer. So, we conducted further experiments of seasonal power demand forecasting for residential facilities. From the results of previous studies of a volatility assessment model, we found that predicting power demand is the most important to understand the pattern of power demand data [28]. In order to identify the pattern, we classified the data into short-term and long-term data. Experiments were conducted through the collection of data for each facility according to power demand. We measured the three experiment methods' forecasting error rates to evaluate our forecasting model using data from different periods (short-term and long-term): (1) the existing power demand forecasting MIDAS algorithm; (2) the existing LSTM model; and (3) our proposed LSTM+MIDAS model. Using these three methodologies, we experimented with short-term data and long-term data to compare the differences in forecasting accuracy. However, in the case of residential facilities with large variable power demand patterns, it is important to understand the meaning from the seasonal, weather, and holiday aspects, rather than merely a period aspect. Then, our LSTM+MIDAS model performance was compared to other existing studies.
In Figure 2, in the case of residential and city hall buildings, the average maximum power demand tends to increase in summer (June to August), but the overall average maximum power demand of city hall was higher, and there was no difference between winter (November to January) and summer. Although there is a big difference in the power demand for each facility, the factory building showed the most similar average maximum power demand; between 600 kWh and 700 kWh over a year. Since hospital facilities are university hospital buildings, the maximum power demand was the highest among the four facilities. It also showed the highest maximum power demand pattern between May and September when the temperature rose. However, residential facility buildings showed the largest difference in the maximum power demand in summer and winter, while it showed similar patterns in other seasons, except summer. So, we conducted further experiments of seasonal power demand forecasting for residential facilities.

Datasets
From November 2016 to October 2017, power demand was collected by sensors installed in various facility buildings (residential, hospital, farm, city hall, factory, company, etc.). Of these data, power demands from four institutions-residential, factory, hospital, and city hall-were used to calculate forecast accuracy for each power demand usage patterns. The collected data consist of 288 data per day every 5 min, enabling detailed power pattern analysis, unlike other future forecasting power usage studies, which used quarterly power usage data [27]. For each of the four facilities, we used different data components as shown in Table 1 for short-term, long-term, and seasonal forecasts. In seasonal data, winter is classified from November 2016 to January 2017, and summer is from June to August 2017, based on Korean seasonal characteristics. In the LSTM module, the train and test data of the input data are in a 2:1 ratio. Table 1 shows the structure of the input and output datasets. Since the power demand patterns are similar for each day of the week, input data (train data and test data) and output data consist of the same day (7-day lags) data. In the short-term forecast, we used three 7-day lags data in the previous three weeks to predict

Datasets
From November 2016 to October 2017, power demand was collected by sensors installed in various facility buildings (residential, hospital, farm, city hall, factory, company, etc.). Of these data, power demands from four institutions-residential, factory, hospital, and city hall-were used to calculate forecast accuracy for each power demand usage patterns. The collected data consist of 288 data per day every 5 min, enabling detailed power pattern analysis, unlike other future forecasting power usage studies, which used quarterly power usage data [27]. For each of the four facilities, we used different data components as shown in Table 1 for short-term, long-term, and seasonal forecasts. In seasonal data, winter is classified from November 2016 to January 2017, and summer is from June to August 2017, based on Korean seasonal characteristics. In the LSTM module, the train and test data of the input data are in a 2:1 ratio. Table 1 shows the structure of the input and output datasets. Since the power demand patterns are similar for each day of the week, input data (train data and test data) and output data consist of the same day (7-day lags) data. In the short-term forecast, we used three 7-day lags data in the previous three weeks to predict the next week's data for the same day of the week. In the long-term forecast, twelve 7-day lags data during the previous 12 weeks were used to predict data for the same day during the next four weeks. For example, every Monday, data during the previous 12 weeks was used as train and test data when the power demand data for every Monday was predicted during the next four weeks. Likewise, in seasonal data forecast, the previous nine weeks' 7-day lags data were used to predict the next three weeks. Figure 3 shows the flowchart of the proposed LSTM structure according to input and output data for each of the three models.
Sustainability 2020, 12, 1109 6 of 14 the next week's data for the same day of the week. In the long-term forecast, twelve 7-day lags data during the previous 12 weeks were used to predict data for the same day during the next four weeks. For example, every Monday, data during the previous 12 weeks was used as train and test data when the power demand data for every Monday was predicted during the next four weeks. Likewise, in seasonal data forecast, the previous nine weeks' 7-day lags data were used to predict the next three weeks. Figure 3 shows the flowchart of the proposed LSTM structure according to input and output data for each of the three models.

Data Preprocessing
Before the previous power demand was entered as input data in the LSTM+MIDAS power demand forecasting model, we performed the preprocessing by assigning different weights to the input data affecting the volatility of the predicted data. Therefore, we conducted preprocessing using MIDAS method's weight function to weight each day's worth of input data, which is similar to the volatility of forecasting. = , , , … , is the input previous power demand, which means the power demand at t-th time of nth week's one day. In the MIDAS method, the weight value is calculated by using the MIDAS regression without directly estimating the weight given to the past value of the information variables. The function to determine the weight for high frequency values using the MIDAS regression method is: The results of the data preprocessing with the above Equation (4) are as follows: Equation (4) is an ALMOD exponential function, which is widely used as a weighting function of the MIDAS regression method [29][30][31]. is the parameter vector of the weight function. The shape and speed of the weight function vary depending on the value [7]. We set the value in the range from −0.002 to 0.01 for exponential increase of the weight.
is the vector matrix of ( ; ) and is a bias coefficient [32]. If the value after the calculation with the weight is too different from the raw data, it is adjusted through the adjustment of the bias value; the bias value is from −0.03 to 4.25.

Data Preprocessing
Before the previous power demand was entered as input data in the LSTM+MIDAS power demand forecasting model, we performed the preprocessing by assigning different weights to the input data affecting the volatility of the predicted data. Therefore, we conducted preprocessing using MIDAS method's weight function to weight each day's worth of input data, which is similar to the volatility of forecasting.
x n t = x n 1 , x n 2 , x n 3 , . . . , x n t is the input previous power demand, which means the power demand at t-th time of nth week's one day. In the MIDAS method, the weight value is calculated by using the MIDAS regression without directly estimating the weight given to the past value of the information variables. The function to determine the weight for high frequency values using the MIDAS regression method is: The results of the data preprocessing with the above Equation (4) are as follows: Equation (4) is an ALMOD exponential function, which is widely used as a weighting function of the MIDAS regression method [29][30][31]. θ is the parameter vector of the weight function. The shape and speed of the weight function vary depending on the θ value [7]. We set the θ value in the range from −0.002 to 0.01 for exponential increase of the weight. W is the vector matrix of W(n; θ) and b is a bias coefficient [32]. If the value after the calculation with the weight is too different from the raw data, it is adjusted through the adjustment of the bias value; the bias value is from −0.03 to 4.25.

LSTM-Based Power Demand Forecasting Model
The LSTM's forecasting mechanism has been widely used for many time-series forecasting in recent years [33]. So, we used the LSTM model which is suitable for time-series forecasting, but we made it possible to reflect the weight function value used in the existing MIDAS to make the model, considering the power demand's volatility. Figure 4 shows the structure of the LSTM+MIDAS model for power demand used in this study.

LSTM-Based Power Demand Forecasting Model
The LSTM's forecasting mechanism has been widely used for many time-series forecasting in recent years [33]. So, we used the LSTM model which is suitable for time-series forecasting, but we made it possible to reflect the weight function value used in the existing MIDAS to make the model, considering the power demand's volatility. Figure 4 shows the structure of the LSTM+MIDAS model for power demand used in this study. In the LSTM+MIDAS model, the input data from the input layer are entered into the input gates, the forget gates, and the output gates as the parameters. The input parameters' values are entered by calculation of weight and bias value. At each gate, different values are calculated according to time t. The LSTM structure of the Figure 4 is operated as below: In the above equations, ′ and represent the input and output at time t; , , and are the state values of the input gate, forget gate, and output gate; is the state value of the cell at time t; , , and -called peephole connections-are weight vectors connecting the internal memory value of the memory space and the gate; and ⨀ indicates point-wise multiplication. , and are bias values; the W is the input weight matrix; and R is the weight matrix for the input through the circulation. In the LSTM+MIDAS model, the input data x from the input layer are entered into the input gates, the forget gates, and the output gates as the parameters. The input parameters' values are entered by calculation of weight and bias value. At each gate, different values are calculated according to time t. The LSTM structure of the Figure 4 is operated as below:

Comparison of Evaluations for Each Model
To evaluate the forecasting performance of LSTM model, we measured the statistical analysis using the mean relative error, mean absolute percentage error (MAPE), root mean square error (RMSE) and R-squared (R 2 ). The formula for evaluating each model is as follows: where H * i is the predicted values of the data i, H i is the actual values of the data i and H is the mean of H i . The normal range of the R-square is [0, 1], and the closer to 1, the stronger the explanatory power of the model [6]. Since the power demand data used in this study varies in scale depending on the facilities, the R 2 was calculated to compare the predicted results according to the facilities.

Development Environment
In this study, we used Deeplearning4J (DL4J) to construct a power demand forecasting model using LSTM, which is the one of the most appropriate deep learning-based time series data forecasting methods [34]. DL4J has a characteristic that it is easy to construct an environment that can use graphics processing unit (GPU). So, we proposed a LSTM model based on the DL4J method combined with MIDAS to optimize for power demand forecasting.

Hyperparameter Setting
The Good performance of deep learning can be achieved by setting the appropriate hyperparameters. The optimal number of layers, nodes, iterations, and activation function, etc., must be set. Generally, it is necessary to find the most optimal hyperparameter setting according to the number or purpose of the data. We found the hyperparameter setting, which achieved optimal results through a total of 40 settings in the hyperparameter setting. We set the number of hidden layers to 3, the number of nodes to 10, the learning rate to 0.01, and set the number of iterations to 180. We used hyperbolic tangent (tanh) and stochastic gradient descent as the activation function and optimization algorithm of the LSTM layer, respectively. In the classification LSTM model, cross-entropy (CE) and sum of square errors (SSE) are used as cost functions for prediction using multiclass classification, but mean square error (MSE) is predominantly used for prediction using regression [35][36][37]. Therefore, we used MSE as a cost function to reduce the prediction error. Table 2 shows a total of three results showing the highest accuracy obtained in the experiments to find the optimized hyperparameters. According to each setting, different prediction results were shown, we used Setting 3, which had the highest accuracy.

Comparison of Experimental Results
Tables 3-5 show the statistical analysis for the experiments conducted in this study by each facility building. Table 3 is the result of the short-term forecasting and Table 4 is the result of the long-term forecasting. In the case of the short-term forecasting, the MAPE of residential, city hall, factory, hospital facilities decreased from 21.04% to 10.44%, from 15.6% to 2.73%, from 7.21% to 1.63%, and from 7.1% to 1.96%, respectively. Table 4 shows the long-term forecasting results. There was no significant difference in each error rate by methodology.   Table 5 shows the residential facility's seasonal forecasting results. Unlike the long-term and short-term forecasting results in Tables 4 and 5, it can be confirmed that the error rates of the large difference are shown according to the methodology in the case of the experiment of the seasonal forecasting. In the winter experiments, the predicted data included the holiday, which drastically reduced the power demand. So, in the MIDAS method, when weighting value was assigned, we experimented by giving bigger weight to other holiday data. However, in the LSTM method, weight is not assigned, and thus shows the lowest performance in LSTM as shown in Table 5.

Statistical Results
LSTM+MIDAS were compared with results from short-term, long-term, and seasonal experiments with three models (MIDAS, LSTM, LSTM+MIDAS) using a nonparametric statistical test called the Friedman test [38]. The usual form of the Friedman test uses rank instead of the original values, where the ranks are obtained by ranking the rows separately and independently from each other [39]. In Table 6, N is the number of our total MAPE results (short-term, long-term, and seasonal), Chi-squared is 8.6, Degree of freedom (DF) is 2, and the p-value is 0.018. Friedman's critical value α is set at a 0.05 significance level, which has been commonly used in significance level [40].

Discussion
In this study, we proposed LSTM+MIDAS, a short-term data optimized power demand forecasting model using only the previous power demand data. The accuracy of the power demand forecasting depends on the data preprocessing, weight function approach. In addition, it is important to set up a model that can closely follow the pattern of power demand volatility over time. In Table 4, we confirmed that the LSTM+MIDAS model is better reflected in the power demand volatility than the other two methods. As general industrial facilities show similar patterns of power demand every day, short-term power demand forecasting using our model was very efficient. On the other hand, facility buildings that tend to respond sensitively to weekend and weather influences, such as residential, should be seasonally classified and then applied to our LSTM+MIDAS model.
To sum up the experiment's results, we confirmed the lowest forecasting performance for residential facilities, whereas the factory and hospital facilities showed a high accuracy with a relatively low level of error rates. The LSTM+MIDAS model of short-term data showed a higher level of forecasting than the other two methodologies. Also, the results of the short-term data are relatively better than those of the long-term data. We confirmed that, in comparison with the largest MAPE in the case of short-term forecasting, our LSTM+MIDAS model shows that the residential facilities decreased by 10.44%p, city hall by 12.87%p, factories by 5.58%p, and hospitals by 5.14%p; in the case of long-term forecasting, residential facilities only decreased by 2.11%p and city hall by 2.8%p. Long-term forecasting did not improve the error rates in power demand forecasting at factories and hospital facilities. Generally, the power demand forecasting using the short-term data more accurately reflects the volatility, thereby improving the accuracy of short-term forecasting. However, the prediction of long-term data is highly influenced by weather and external factors, so the accuracy of the methodology does not show much difference, even though the overall accuracy is higher due to the larger number of datasets. We confirmed that there was no improvement in accuracy according to the methodology in the long-term power demand forecasting. Because the power demand data is related to the volatility, the long-term power demand data is not well predicted at the time of the volatility trends.
We also confirmed that predictions of power demand in residential facilities that are sensitive to weather, seasonal, and holiday influences can achieve higher accuracy by seasonally categorizing and predicting the data considering the weights for special situations, as shown in Figure 5. We also confirmed that predictions of power demand in residential facilities that are sensitive to weather, seasonal, and holiday influences can achieve higher accuracy by seasonally categorizing and predicting the data considering the weights for special situations, as shown in Figure 5. Other existing Cheng et al.'s study on power demand forecasting for residential facilities using PowerLSTM was used for July power demand input data [41]. We also used July data for summer forecasts, so we compared residential power demand forecasts for summer and Cheng et al.'s study [41]. The results show a comparison of the MAPE values of our model and Cheng et al.'s PowerLSTM model using residential data [41]. The MAPE of our LSTM+MIDAS is 5.400%, and the PowerLSTM is 8.935%. The difference is 3.535%p and the error rate (MAPE) of LSTM+MIDAS is reduced by 39.564% compared to PowerLSTM [41]. In addition, we conducted the Friedman test to show whether the LSTM+MIDAS model is more meaningful than the MIDAS or LSTM model for the values of each experiment. We identified that the LSTM+MIDAS model is less than the α (p < 0.05). So, we can say that the LSTM+MIDAS model results are statistically significant through the Friedman test.

Conclusions
The existing power demand forecasting method using the regression analysis method required many external factors, in addition to the previous power demand data. However, in this study, it is possible to predict the power demand with reduced error rates only by the previous power demand data using the prediction of the patterns' trends. We could analyze that predicted power demand patterns were different depending on the facilities. Especially, residential facilities are very influenced by the seasonal factors. Since we considered only power demand data as input data, the Other existing Cheng et al.'s study on power demand forecasting for residential facilities using PowerLSTM was used for July power demand input data [41]. We also used July data for summer forecasts, so we compared residential power demand forecasts for summer and Cheng et al.'s study [41]. The results show a comparison of the MAPE values of our model and Cheng et al.'s PowerLSTM model using residential data [41]. The MAPE of our LSTM+MIDAS is 5.400%, and the PowerLSTM is 8.935%. The difference is 3.535%p and the error rate (MAPE) of LSTM+MIDAS is reduced by 39.564% compared to PowerLSTM [41]. In addition, we conducted the Friedman test to show whether the LSTM+MIDAS model is more meaningful than the MIDAS or LSTM model for the values of each experiment. We identified that the LSTM+MIDAS model is less than the α (p < 0.05). So, we can say that the LSTM+MIDAS model results are statistically significant through the Friedman test.

Conclusions
The existing power demand forecasting method using the regression analysis method required many external factors, in addition to the previous power demand data. However, in this study, it is possible to predict the power demand with reduced error rates only by the previous power demand data using the prediction of the patterns' trends. We could analyze that predicted power demand patterns were different depending on the facilities. Especially, residential facilities are very influenced by the seasonal factors. Since we considered only power demand data as input data, the forecasting error rate of residential facilities, which is affected by the weather, increased more than other facilities. The analysis of other factors affecting power demand can be added to predict power demand performances.
So, this study can be expected to further future studies, provide efficient and accurate forecasting of power demand by adding data on external factors affecting power demand predicting, as well as previous power demand data. In addition, predicting accurate power demand with high performance would be contributed to the sustainable development of the natural environment and environment management area, which are nowadays great issues all over the world.
Author Contributions: Writing-original draft preparation, E.C.; writing-review and editing, E.C. and D.K.K.; data curation, S.C. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.