DDTree: A Hybrid Deep Learning Model for Real-Time Waterway Depth Prediction and Smart Navigation

Timely and accurate depth estimation of a shallow waterway can improve shipping efficiency and reduce the danger of waterway transport accidents. However, waterway depth data measured during actual maritime navigation is limited, and the depth values can have large variability. Big data collected in real time by automatic identification systems (AIS) might provide a way to estimate accurate waterway depths, although these data include no direct channel depth information. We suggest a deep neural network (DNN) based model, called DDTree, for using the real-time AIS data and the data from Global Mapper to predict waterway depth for ships in an accurate and timely way. The model combines a decision tree and DNN, which is trained and tested on the AIS and Global Mapper data from the Nantong and Fangcheng ports on the southeastern and southwestern coast of China. The actual waterway depth data were used together with the AIS data as the input to DDTree. The latest data on waterway depths from the Chinese maritime agency were used to verify the results. The experiments show that the DDTree model has a prediction accuracy of 91.15%. Therefore, the DDTree model can provide an accurate prediction of waterway depth and compensate for the shortage of waterway depth monitoring means. The proposed hybrid DDTree model could improve marine situational awareness, navigation safety, and shipping efficiency, and contribute to smart navigation.


Introduction
Water transport is the main transportation mode for the ever-increasing volume of international trade. The high capacity, cost savings, low energy consumption, and advances in the development of waterborne transportation make it the preferred mode for long-distance trade, when compared to other means of transport. Based on the latest report of the International Maritime Organization (IMO) [1], 90% of goods are currently transported by shipping. Maritime vessel traffic differs from ground-based traffic because of the complexity of rapidly changing the direction of the ships on the water [2]. In order to ensure that the ship passes safely through the waters, the water depth should be greater than the sum of the draft of the ship and surplus under keel clearance (UKC) [3]. Due to the lack of real-time monitoring means, there are often economic losses due to excessive ship loading, resulting in ship stranding and port obstruction. In this context, the development of new methods for increasing situational awareness and achieving smart maritime navigation is highly relevant [4].
Obtaining actual water depths, however, is difficult because the water depth changes constantly. Water depth data that are found in the electronic chart display and information system (ECDIS) and water-way area maps, therefore, are not accurate, which may cause ships to be stranded, collide, and hit rocks. Such accidents can result in serious casualties and property loss, and may even cause serious damage to regional transportation. For example, in the period from 2013 to 2014, the reef accidents in the Yangtze River navigation channel accounted for 10% of all accidents in the Yangtze River, and the proportion of all kinds of vessel accidents on the Yangtze River was 17% [5]. The total number of accidents caused by insufficient navigation depth was 27%. Therefore, having accurate and timely knowledge of water depth data is of great significance to safe navigation. The problem is especially relevant in the context of Internet-of-Things (IoT) solutions for smart ships [6], connected ships [7], and next generation marine data networks [8] in which the problems of quality of service and environmental awareness arise [9].
In water transport, the water depth of a channel refers to the vertical distance from the water level to the bed of a waterway channel. For a local section of a waterway, it usually refers to the vertical distance from the water level to the bed at the shallowest part of a channel. Safe water depth and the depth of water needed for a ship to pass safely acts as a guide for route planning and safe navigation of ships. The bottom of the shallow channel can be heavily silted under the hydrological conditions of wind, waves, and currents. A review of the literature reveals two broad approaches for obtaining the water depth of a waterway: (1) monitoring of the water depth with sonar and other detection devices, and (2) time-delay acquisition and processing of underwater acoustic signals to calculate the waterway depth.
At present, the prediction methods in ship transportation focus on short-term traffic flow prediction, and traffic congestion forecasting. The predictions of depth in waterways are obtained through satellite remote sensing images and real-time sensors; for instance, when performing monitoring of oil spills [10]. Since almost all in-flight vessels are equipped with automatic identification systems (AIS), the analysis of full AIS data and the development of deep learning models could support the improved prediction of waterway depth.
In this paper, a hybrid decision tree-deep neural network (DDTree) model is constructed to predict safe waterway depths for ships to increase the precision and timeliness of the waterway depth data and provide references for safe marine navigation and route selection. The rest of this paper is organized as follows. In Section 2, the conceptual model for the analysis of the actual waterway depth data (from Global Mapper) and the AIS data are introduced, and the proposed DDTree model is presented. In Section 3, the results for prediction of waterway depth are presented and the model's performance is analyzed. Finally, Section 4 presents the conclusions of the paper.

Related Work
In the field of waterway transportation, current research related to the prediction of water traffic focuses on progressive water prediction of depth. Big data collected in real time by automatic identification systems (AIS) might provide a way to estimate accurate real-time waterway depths, but these data include no direct channel depth information [11]. The large volume ship AIS data are widely used in navigation. In addition, the wide application of GNSSs (global navigation satellite systems) also provides technical support and a safety guarantee for navigation. Therefore, by employing the data-mining techniques on big data from AIS and GNSS, the waterway depth can be predicted. For example, Su et al. [12] proposed using multi-spectral satellite images to predict water depth change along a waterway. In order to enhance the accuracy of water prediction of depth, Kang et al. proposed a differential dynamic positioning algorithm based on GPS/Beidou which realizes middle precision based on low cost equipment [13]. Kamil Maciuk analyzed the accuracy of static GNSS measurements according to the number of visible satellites, based on different minimal elevation cutoff angles [14]. Hsin-Hung Chen et al. proposed two synthetic models for sound-speed profiles, one linear and the other bilinear [15]. Similarly, EL-Hattab [16] analyzed critical issues in GPS precise positioning. Mamoun F. Abdel-Hafez et al. addressed the problem of estimating the measurements' noise statistics of the Global Positioning System (GPS) using the autocovariance least squares technique [17]. Younis et al. [18] proposed a method for predicting the depth of water in a rough composite channel. Shiri et al. [19] proposed a wavelet and a fuzzy neural network model to estimate the depth of groundwater. Prediction of waterway depth can be a part of larger models focused on avoiding marine traffic accidents and ship collision. For example, Wang et al. [20] applied short-time space-time analysis of the AIS data to analyze in-land water ship collisions. Vodas et al. [21] proposed a ship accident prediction and judgment model based on the AIS data. Li et al. [22] proposed a method to improve the AIS data collection equipment. Through the detailed analysis of the data received by the AIS receiver in Skynet 3's debugging stage, receiving capability was improved. Mulyadi [23] proposed a Bayesian network-based model for assessing the frequency of ship sinking accidents. Similarly, Li et al. [24] used generalized Bayesian inference and the Dempster's rule to recognize the patterns of ship traffic from historical AIS data. Salmalian et al. [25] predicted ship squat behavior in shallow water depending upon waterway depth. They used the group method of data handling (GMDH) neural networks and an adaptive network-based fuzzy inference system (ANFIS) with good results. Niu et al. [26] used the feed-forward neural network (FFNN) and support vector machine (SVM) classifiers to infer ship location from acoustical data in a deep waterway channel. Bannari et al. [27] suggested an approach for bathymetric mapping of shallow water of Arabian Gulf near Bahrain using the kriging procedure. Kim and Lee [28] proposed the ship traffic extraction neural network (STENet) to forecast the maritime traffic of the caution area, which represented an increased risk for ships, due to things such as berth limitations. Jeong et al. [29] performed multi-criteria route planning using the AIS data while considering waterway depth (in the navigable area) as one of risk factors and criteria for optimization. Rong et al. [30] adopted the kernel density estimation-based method for mining ships' behaviors from the AIS data. Xin et al. [31] proposed a ship traffic simulation model for in the Xiazhimen waterway for analyzing the statistical characteristics of ship traffic. Zhang et al. [32] analyzed the spatial-temporal dynamic of traffic in port waterways while using the AIS data. Finally, Yang et al. [33] overviewed and summarized main contributions of the AIS data for marine navigation safety. Machine learning approaches has been used in addressing challenges in many fields, such as medical and engineering. For example, Gianni D'Angelo et al. presented a robust approach to the optimal policy of autonomous space systems modeled via Markov decision process (MDP) from the values assigned to its transition probability matrix [34]. In the medical field, Gianni D'Angelo et al. presented a proposal for distinguishing between bacterial and viral meningitis using genetic programming and decision trees [35]. Machine learning methods are also widely used in prediction problems, and have achieved good results in problems such as weather forecasting and ocean wave prediction.

Data
Waterway depth is the vertical distance from the water surface to the bottom (seafloor) of a waterway. At present, the waterway depth data can be extracted from the electronic charts visualized cartographically, as shown in Figure 1a. In addition to the ECDIS charts, the berth map also presents the waterway depth data. Unlike an electronic chart, the berth map divides the watercourse according to the geographical position, displaying waterway depth information from berths, as shown in Figure 1b. Figure 1a,b illustrates two waterway depth data acquisition methods. The waterway depth data can be obtained from ECDIS or from berth map. On the other hand, the prediction of unknown depth data from the available depth-related data is the main research goal of this paper. The AIS data represent the static data and the dynamic data of the ships. The dynamic data have information such as Maritime Mobile Communication Service Identification (MMSI) number, latitude and longitude, speed/course over ground, steering rate, and headings [36]. The static data include inherent information about a ship, such as the length, the breadth, the maximum draft, and the type. Table 1 shows an example of the AIS data. Since there are missing points in AIS data, the longitude and latitude data must be imputed by the data interpolation method, making the water depth information more complete. The cubic spline interpolation method obtains waterway depth data corresponding to different latitude and longitude points in the waterway, and the obtained waterway depth data are integrated according to the longitude and latitude values and the AIS data of the waterway, so that the depth data and the AIS dynamic data are combined with static data.

Conceptual Model
In order to achieve the prediction of waterway depth, we have designed, evaluated, and verified the proposed DDTree model. First, the AIS data must go through preprocessing and abnormal data detection and other methods to remove redundant and abnormal data (see Figure 2). There are many errors, omissions, and noises in AIS data. The use of erroneous data for calculation or analysis may lead to a wrong conclusion. Therefore, it is critical to preprocess the data and delete the obvious noise information, such as latitude 89°, negative longitude, and the data whose MMSI is not nine digits. At the same time, some redundant information should also be deleted; for example, AIS dynamic information has much duplicate data. Then, the AIS data and waterway depth data are combined to build a deep neural network dataset. After training datasets and verification datasets are divided, the network structure is adjusted and the depth data are predicted by the combination of a neural network and a decision tree. Finally, the prediction performance, accuracy, is evaluated. The AIS data represent the static data and the dynamic data of the ships. The dynamic data have information such as Maritime Mobile Communication Service Identification (MMSI) number, latitude and longitude, speed/course over ground, steering rate, and headings [36]. The static data include inherent information about a ship, such as the length, the breadth, the maximum draft, and the type. Table 1 shows an example of the AIS data. Since there are missing points in AIS data, the longitude and latitude data must be imputed by the data interpolation method, making the water depth information more complete. The cubic spline interpolation method obtains waterway depth data corresponding to different latitude and longitude points in the waterway, and the obtained waterway depth data are integrated according to the longitude and latitude values and the AIS data of the waterway, so that the depth data and the AIS dynamic data are combined with static data.

Conceptual Model
In order to achieve the prediction of waterway depth, we have designed, evaluated, and verified the proposed DDTree model. First, the AIS data must go through preprocessing and abnormal data detection and other methods to remove redundant and abnormal data (see Figure 2). There are many errors, omissions, and noises in AIS data. The use of erroneous data for calculation or analysis may lead to a wrong conclusion. Therefore, it is critical to preprocess the data and delete the obvious noise information, such as latitude 89 • , negative longitude, and the data whose MMSI is not nine digits. At the same time, some redundant information should also be deleted; for example, AIS dynamic information has much duplicate data. Then, the AIS data and waterway depth data are combined to build a deep neural network dataset. After training datasets and verification datasets are divided, the network structure is adjusted and the depth data are predicted by the combination of a neural network and a decision tree. Finally, the prediction performance, accuracy, is evaluated. Appl. Sci. 2020, 10,

Decision Tree
The waterway depth data are predicted through the hybrid model of a decision tree and a deep neural network (DNN). Similar hybrid models combining the models from different subdomains of machine learning have been proven successful for solving various classification and regression tasks [37,38]. The DDTree model is composed of two parts. The decision tree is responsible for selecting different attributes of the static data and dynamic data in the AIS data and selecting the optimal attributes. The DNN is responsible for processing the optimal attributes and generating prediction of waterway depth data.
The waterway depth data are used as supervised data to establish a model for prediction of waterway depth. Figure 3 shows a structure of the hybrid DDTree model. Each non-leaf node in the decision tree represents an attribute of the AIS data with which to judge whether the leaf node belongs to a certain range [39], as is shown in Figure 3. Each branch of a tree represents the output of an attribute over a range of values, which is used to indicate the output value under the corresponding judgment. After evaluating the static and dynamic AIS data in the decision tree, the information gain of different features can be determined, and the correlations between different features and predictive variables can be judged.

Decision Tree
The waterway depth data are predicted through the hybrid model of a decision tree and a deep neural network (DNN). Similar hybrid models combining the models from different subdomains of machine learning have been proven successful for solving various classification and regression tasks [37,38]. The DDTree model is composed of two parts. The decision tree is responsible for selecting different attributes of the static data and dynamic data in the AIS data and selecting the optimal attributes. The DNN is responsible for processing the optimal attributes and generating prediction of waterway depth data.
The waterway depth data are used as supervised data to establish a model for prediction of waterway depth. Figure 3 shows a structure of the hybrid DDTree model. Each non-leaf node in the decision tree represents an attribute of the AIS data with which to judge whether the leaf node belongs to a certain range [39], as is shown in Figure 3. Each branch of a tree represents the output of an attribute over a range of values, which is used to indicate the output value under the corresponding judgment. After evaluating the static and dynamic AIS data in the decision tree, the information gain of different features can be determined, and the correlations between different features and predictive variables can be judged.

Decision Tree
The waterway depth data are predicted through the hybrid model of a decision tree and a deep neural network (DNN). Similar hybrid models combining the models from different subdomains of machine learning have been proven successful for solving various classification and regression tasks [37,38]. The DDTree model is composed of two parts. The decision tree is responsible for selecting different attributes of the static data and dynamic data in the AIS data and selecting the optimal attributes. The DNN is responsible for processing the optimal attributes and generating prediction of waterway depth data.
The waterway depth data are used as supervised data to establish a model for prediction of waterway depth. Figure 3 shows a structure of the hybrid DDTree model. Each non-leaf node in the decision tree represents an attribute of the AIS data with which to judge whether the leaf node belongs to a certain range [39], as is shown in Figure 3. Each branch of a tree represents the output of an attribute over a range of values, which is used to indicate the output value under the corresponding judgment. After evaluating the static and dynamic AIS data in the decision tree, the information gain of different features can be determined, and the correlations between different features and predictive variables can be judged.  In the root node, we select the attribute in the AIS dataset that accurately decides the waterway depth in the electronic ECDIS chart. The remaining attributes are sorted according to their discriminating value and included in the output branches until reaching the final leaf node of the decision tree. The final left generates the output of the entire decision tree in the DDTree model.
After preprocessing AIS data, the optimal attributes in the AIS data are selected as the input for a DNN using the attribute selection method in the decision tree, by comparing the information gain from each attribute. This avoids selection of data with little influence on prediction, thus making the entire model responsive when estimating waterway depth.
The information entropy info(D) must be calculated in order to build a decision tree. Here info(D) is the entropy in the training dataset extracted from the AIS data calculated as in Equation (1): Here p i represents the probability that the i-th data category appears in the training tuple.
Assuming that training AIS dataset D is split by attribute A, the information expectation of A to D partitioning is expressed by Equation (2): We calculate the information gain for the difference between the two datasets by Equation (3): After calculating the information gain of each attribute, the set of the first k attributes with the largest information gain is selected by the DDTree model as the optimal attribute set (X 1 , X 2 , . . . , X k ). The algorithm in the decision tree is used to classify the waterway depth in the entire dataset. Then the decision tree is used to select optimal attributes as inputs for the neural network.
The characteristics of AIS data include the static and dynamic characteristics of ships. Among them, the static characteristics of a ship include the draft of the ship, the length of the ship, and the breadth of the ship. The dynamic characteristics of the ship include data varying momentarily during the movement of the ship, such as the data of the ship's speed over ground (SOG) and the steering rate. After calculating the information gain of the static and dynamic characteristics of the ship, the optimal number of attribute parameters is set. The static characteristics (a 1 , a 2 , . . . , a n ) and the dynamic information gain (b 1 , b 2 , . . . , b n ) of the ship are sorted in descending order; the attributes of the first k-th ships (c 1 , c 2 , . . . , c k ) make the optimal attribute combination. After selecting the optimal attributes, the depth of waterway is predicted by the DNN model.

Deep Neural Network
Prediction of waterway depth requires learning through a DNN as a nonlinear input-output model. Figure 4 shows the structure diagram of a DNN, in which the number of hidden layers is seven.
For the prediction of waterway depth, the DNN model uses as the inputs of different AIS attribute data x = (x 1 , x 2 , . . . , x n ) and produces the output y as prediction of depth; f 1 , f 2 , . . . , f n are the activation functions corresponding to each layer in the deep neural network;ŷ(x) is the output corresponding to each input data. Therefore, for layer l of the network, it can be expressed as Equation (4): where N l is the number of nodes on the first layer of the network, w l ∈ R N l ×N l−1 and b ∈ R, are the weight and bias respectively in the network, and f w,b is the network activation function of each layer.
Appl. Sci. 2020, 10, x 7 of 16 For the prediction of waterway depth, the DNN model uses as the inputs of different AIS attribute data = ( 1 , 2 , . . . , ) and produces the output y as prediction of depth; 1 , 2 , . . . , are the activation functions corresponding to each layer in the deep neural network; ̂( ) is the output corresponding to each input data. Therefore, for layer l of the network, it can be expressed as Equation (4): where is the number of nodes on the first layer of the network, ∈ × −1 and ∈ , are the weight and bias respectively in the network, and , is the network activation function of each layer. The optimal attribute is selected by the preceding decision tree model to provide the appropriate input attribute parameters for the DNN. Then, we need to set the initial parameters of the DNN. By iteratively adjusting the weights and offsets of the DNN, the prediction accuracy is maximized.
For the prediction of the waterway depth of the unknown waterway, DNN aims to predict the waterway depth information of the unknown waterway depth based on the waterway depth data of the known area. The prediction results ( ) can be expressed as Equation (5): where the waterway depth is , , and the latitude and longitude of the geographical position of the point is (a, b).
For the entire region of the prediction of depth data, the prediction is given by Equation (6) where n represents points corresponding to different geographic locations, and (a, b) represents the corresponding longitude and latitude. After preprocessing and analyzing the AIS data, the most important factor for the prediction capability of DNN is to discover the optimal hyperparameter values to achieve the best prediction The optimal attribute is selected by the preceding decision tree model to provide the appropriate input attribute parameters for the DNN. Then, we need to set the initial parameters of the DNN. By iteratively adjusting the weights w i and offsets θ i of the DNN, the prediction accuracy is maximized.
For the prediction of the waterway depth of the unknown waterway, DNN aims to predict the waterway depth information of the unknown waterway depth based on the waterway depth data of the known area. The prediction results y(x) can be expressed as Equation (5): where the waterway depth is x m a,b , and the latitude and longitude of the geographical position of the point is (a, b).
For the entire region of the prediction of depth data, the prediction is given by Equation (6): where n represents points corresponding to different geographic locations, and (a, b) represents the corresponding longitude and latitude. After preprocessing and analyzing the AIS data, the most important factor for the prediction capability of DNN is to discover the optimal hyperparameter values to achieve the best prediction performance of the entire network. We use the stochastic gradient descent [40] for the network optimization. The range of the network parameters can be specified as Equation (7): Appl. Sci. 2020, 10, 2770 where λ is the rate of learning, f is the activation function, N l is the number of iterations, n is the number of network layers, x m a+h,b+h is the prediction of depth model structure, and f l is the expression of each neural network.

Experimental Data
In order to test the proposed hybrid prediction of depth model, the data of Nantong Port in Jiangsu Province of China and Fangcheng Port in Guangxi of China were selected for analysis. Nantong (31 • 59 32" N, 120 • 49 46" E) is located near the east Yellow Sea, southwest of the Yangtze River, and it is an important coastal port of China. Nantong has both a natural inland river port and a deep-water coastal port. Fangcheng Port (21 • 44 58 ' N, 108 • 21 41 ' E) is a natural river port known as the largest port in western China. Figure 5 shows the satellite images of Nantong Portand Fangcheng Port.
performance of the entire network. We use the stochastic gradient descent [40] for the network optimization. The range of the network parameters can be specified as Equation (7) where λ is the rate of learning, f is the activation function, is the number of iterations, n is the number of network layers, + , + is the prediction of depth model structure, and is the expression of each neural network.

Experimental Data
In order to test the proposed hybrid prediction of depth model, the data of Nantong Port in Jiangsu Province of China and Fangcheng Port in Guangxi of China were selected for analysis. Nantong (31° 59′ 32" N, 120° 49′ 46" E) is located near the east Yellow Sea, southwest of the Yangtze River, and it is an important coastal port of China. Nantong has both a natural inland river port and a deep-water coastal port. Fangcheng Port (21° 44′ 58′' N, 108° 21′ 41′' E) is a natural river port known as the largest port in western China. Figure 5 shows the satellite images of Nantong Portand Fangcheng Port.  The AIS data of Nantong Portand Fangcheng Port were selected for these experiments. The entire AIS dataset contains two parts of the ships' dynamic data and static data. The waterway depth data used in the prediction of waterway depth model use the depth data provided by the Global Mapper (Blue Marble Geographics, Hallowell, ME, USA) to obtain the dataset corresponding to the latitude, longitude, and depth data through the latitude and longitude matching. Global mapper is the GIS software wherein depths are taken from ECDIS or other bathymetric sources. The waterway depth data at the missing latitude and longitude points are interpolated by three-dimensional interpolation using AIS data latitude and longitude data.
The density map for the Nantong Port dataset showing the most probable positions of ship along the sea fairway is shown in Figure 6.
Appl. Sci. 2020, 10, x 9 of 16 The AIS data of Nantong Portand Fangcheng Port were selected for these experiments. The entire AIS dataset contains two parts of the ships' dynamic data and static data. The waterway depth data used in the prediction of waterway depth model use the depth data provided by the Global Mapper (Blue Marble Geographics, Hallowell, ME, USA) to obtain the dataset corresponding to the latitude, longitude, and depth data through the latitude and longitude matching. Global mapper is the GIS software wherein depths are taken from ECDIS or other bathymetric sources. The waterway depth data at the missing latitude and longitude points are interpolated by three-dimensional interpolation using AIS data latitude and longitude data.
The density map for the Nantong Port dataset showing the most probable positions of ship along the sea fairway is shown in Figure 6.

Experimental Setting
We selected the AIS data of two waterways in order to predict the waterway depths for Nantong and Fangcheng ports. Through the integration of depth data acquisition and the AIS data, the data of waterway depth and AIS data of the two waterways were obtained as the data sources for this experiment. The total number of data points was 195,742, of which there were 137,006 training data points and 58,718 testing data points. The dataset was divided into training and testing datasets (70% of the data as a training data set, and 30% of the data as a testing data set). These data are used to train the DDTree model to predict the depth of waterway.
Before training, the data were labeled. The data in the waterway area were divided into 25 categories at two-meter intervals from 0 to 50 m depth, and the 26th label was used for depths >50 m. We chose this interval because most of the waterway depth data are around 30 to 40 m, and waterway depth data of less than 30 m and more than 50 m meters are limited.
The hyperparameters of the model (depth of decision tree, activation function, iterations, epochs) were adjusted, and the prediction of waterway depths under different hyperparameter values was recorded. After setting the parameters, the optimal parameters were selected, and the depth data were predicted by the DDTree model and compared with the actual waterway depth (ground truth). Using the model on the test data, the prediction accuracy and the error of the model were calculated.
The input data were classified using a decision-tree classification algorithm. The input parameters were six ship attributes of the dynamic data portion of the AIS data, including speed over ground, ship width, course over ground, draft, and ship steering rate. The processing times of the actual prediction of depth in the Nantong Port and Fangcheng Port using the trained model were 279 s, and 468 s respectively. Table 2 presents the prediction performance to evaluate the decision-tree parameters of the DDTree combination model in relation to the ground truth waterway-depth data.

Experimental Setting
We selected the AIS data of two waterways in order to predict the waterway depths for Nantong and Fangcheng ports. Through the integration of depth data acquisition and the AIS data, the data of waterway depth and AIS data of the two waterways were obtained as the data sources for this experiment. The total number of data points was 195,742, of which there were 137,006 training data points and 58,718 testing data points. The dataset was divided into training and testing datasets (70% of the data as a training data set, and 30% of the data as a testing data set). These data are used to train the DDTree model to predict the depth of waterway.
Before training, the data were labeled. The data in the waterway area were divided into 25 categories at two-meter intervals from 0 to 50 m depth, and the 26th label was used for depths >50 m. We chose this interval because most of the waterway depth data are around 30 to 40 m, and waterway depth data of less than 30 m and more than 50 m meters are limited.
The hyperparameters of the model (depth of decision tree, activation function, iterations, epochs) were adjusted, and the prediction of waterway depths under different hyperparameter values was recorded. After setting the parameters, the optimal parameters were selected, and the depth data were predicted by the DDTree model and compared with the actual waterway depth (ground truth). Using the model on the test data, the prediction accuracy and the error of the model were calculated.
The input data were classified using a decision-tree classification algorithm. The input parameters were six ship attributes of the dynamic data portion of the AIS data, including speed over ground, ship width, course over ground, draft, and ship steering rate. The processing times of the actual prediction of depth in the Nantong Port and Fangcheng Port using the trained model were 279 s, and 468 s respectively. Table 2 presents the prediction performance to evaluate the decision-tree parameters of the DDTree combination model in relation to the ground truth waterway-depth data. The results we contained with the baseline model, which used a traditional method of data interpolation. Table 2 reveals that the DDTree model prediction accuracy was the highest when the value of k was six, the depth of the decision tree was five, the iteration step was 200, and the epoch was 600. After selecting the appropriate neural network parameters, the waterway depth information of the Nantong Port and Fangcheng Port was predicted and analyzed by the model.

Prediction of Water Depth in Nantong Port
The results of comparative prediction and actual waterway depth results are shown in Figure 7. Figure 7a is an actual distribution map of waterway depth. Figure 7b shows the predicted waterway depth distribution. The distribution histogram of actual and predicted waterway depth of Nantong Port is presented in Figure 8. Table 2 reveals that the DDTree model prediction accuracy was the highest when the value of k was six, the depth of the decision tree was five, the iteration step was 200, and the epoch was 600. After selecting the appropriate neural network parameters, the waterway depth information of the Nantong Port and Fangcheng Port was predicted and analyzed by the model.

Prediction of Water Depth in Nantong Port
The results of comparative prediction and actual waterway depth results are shown in Figure 7. Figure 7a is an actual distribution map of waterway depth. Figure 7b shows the predicted waterway depth distribution. The distribution histogram of actual and predicted waterway depth of Nantong Port is presented in Figure 8.
We can see that the proposed model has accurate prediction, and there is an error in the depth data of individual shallows and other locations. The overall prediction accuracy is relatively high and can accurately reflect the distribution of waterway depth in Nantong Port.

Predicting of Water Depth in Fangcheng Port
First, the model used the Nantong Port data for the prediction of waterway depth in order to test whether the model had an overfitting situation, verify the ability of the model to predict waterway depth, and assess practicality of the model, but the depth data of other waterways can also be used with this model. Figure 9a,b shows the actual waterway depth and predicted waterway depth distribution for Fangcheng Port using the model trained on the Nantong Port data. We can see that the proposed model has accurate prediction, and there is an error in the depth data of individual shallows and other locations. The overall prediction accuracy is relatively high and can accurately reflect the distribution of waterway depth in Nantong Port.

Predicting of Water Depth in Fangcheng Port
First, the model used the Nantong Port data for the prediction of waterway depth in order to test whether the model had an overfitting situation, verify the ability of the model to predict waterway depth, and assess practicality of the model, but the depth data of other waterways can also be used with this model. Figure 9a,b shows the actual waterway depth and predicted waterway depth distribution for Fangcheng Port using the model trained on the Nantong Port data. Water depth (m) Figure 8. Actual and predicted water depth data from Nantong Port.

Predicting of Water Depth in Fangcheng Port
First, the model used the Nantong Port data for the prediction of waterway depth in order to test whether the model had an overfitting situation, verify the ability of the model to predict waterway depth, and assess practicality of the model, but the depth data of other waterways can also be used with this model. Figure 9a,b shows the actual waterway depth and predicted waterway depth distribution for Fangcheng Port using the model trained on the Nantong Port data.
The distribution histogram of actual and predicted waterway depth of Fangcheng Port is shown in Figure 10. This area accounts for the vicinity of the shoal. This part of the waterway depth is complex, leading to unsatisfactory predicting results. The prediction of waterway depth shows that the prediction of depth results of the DDTree model trained using the data from Fangcheng Port have good accuracy, and the predicted waterway depth is consistent with the actual waterway depth, indicating that the model was trained well. The accuracy of the prediction of waterway depth is high, and the model does not overfit; therefore, it can be used for the prediction of waterway depth.  The distribution histogram of actual and predicted waterway depth of Fangcheng Port is shown in Figure 10. This area accounts for the vicinity of the shoal. This part of the waterway depth is complex, leading to unsatisfactory predicting results. The prediction of waterway depth shows that the prediction of depth results of the DDTree model trained using the data from Fangcheng Port have good accuracy, and the predicted waterway depth is consistent with the actual waterway depth, indicating that the model was trained well. The accuracy of the prediction of waterway depth is high, and the model does not overfit; therefore, it can be used for the prediction of waterway depth.

Analysis of Results Using Residual Analys
In order to verify the prediction accuracy of the hybrid DDTree model and the prediction ability of the model, the residual values of different models were obtained by calculating the prediction time and the actual value. The smaller the residual value, the stronger the prediction ability of the model, and the model fully learns the characteristics of the data from the training dataset. The residual

Analysis of Results Using Residual Analys
In order to verify the prediction accuracy of the hybrid DDTree model and the prediction ability of the model, the residual values of different models were obtained by calculating the prediction time and the actual value. The smaller the residual value, the stronger the prediction ability of the model, and the model fully learns the characteristics of the data from the training dataset. The residual analysis plots were created from the two models. Figure 11a,b shows the distribution plots of residuals and the residual histograms.
is consistent with the actual prediction results and shows no bias. It can be seen from the results of two experiments that there are significant differences in the prediction of waterway depth results in the areas with small depths, such as near islands and shoal edges. The residual depth difference of this part is also significantly higher than other areas. In other deep-water navigation areas, the waterway depth predicted by the DDTree model is closer to the actual waterway depth, so the residual value of this part of the prediction of waterway depth result is smaller. In addition, in Figures  7 and 9, there are also obvious differences at the edges of the navigation areas at different depths where the residual value is large. The waterway depth information of this part of the channel tends to fluctuate greatly, and it is affected by the hydrological and meteorological conditions. Therefore, the prediction results using the DDTree model can show a certain degree of error. Figure 11b shows the sample distribution of the residual values. Most of the residual values are around 0. The residuals of the overall prediction of waterway depth are in accordance with the normal distribution, so the DDTree model can predict the waterway depth information of the waterway channel without bias. In Figure 11a, the waterway depth residual error value fluctuates around 0, which means that it is consistent with the actual prediction results and shows no bias. It can be seen from the results of two experiments that there are significant differences in the prediction of waterway depth results in the areas with small depths, such as near islands and shoal edges. The residual depth difference of this part is also significantly higher than other areas. In other deep-water navigation areas, the waterway depth predicted by the DDTree model is closer to the actual waterway depth, so the residual value of this part of the prediction of waterway depth result is smaller. In addition, in Figures 7 and 9, there are also obvious differences at the edges of the navigation areas at different depths where the residual value is large. The waterway depth information of this part of the channel tends to fluctuate greatly, and it is affected by the hydrological and meteorological conditions. Therefore, the prediction results using the DDTree model can show a certain degree of error. Figure 11b shows the sample distribution of the residual values. Most of the residual values are around 0. The residuals of the overall prediction of waterway depth are in accordance with the normal distribution, so the DDTree model can predict the waterway depth information of the waterway channel without bias. Figure 12 shows a difference plot between the ground truth depth map and predicted depth map for Nantong Port (Figure 12a) and Fangcheng Port (Figure 12b), respectively. One can note that the errors mostly concentrate on the boundaries of different seafloor depth zones. However, there are two areas with notable depth errors, which may be explained by the lack of AIS data in these areas.
We also compared and analyzed the traditional DNN model and decision tree model. Table 3 shows the prediction results of the model using different parameters. After the evaluation of the model, we chose the mean square error (MSE) as an indicator to calculate the deviation of the predicted value from the actual value, and reflect the prediction ability of the model. Table 3 shows a comparative analysis of the performance of the two models on the actual bathymetry data. residuals, (b) the residual histogram Figure 12 shows a difference plot between the ground truth depth map and predicted depth map for Nantong Port (Figure 12a) and Fangcheng Port (Figure 12b), respectively. One can note that the errors mostly concentrate on the boundaries of different seafloor depth zones. However, there are two areas with notable depth errors, which may be explained by the lack of AIS data in these areas. We also compared and analyzed the traditional DNN model and decision tree model. Table 3 shows the prediction results of the model using different parameters. After the evaluation of the model, we chose the mean square error (MSE) as an indicator to calculate the deviation of the predicted value from the actual value, and reflect the prediction ability of the model. Table 3 shows a comparative analysis of the performance of the two models on the actual bathymetry data.
In predictive analysis, the confusion matrix is a table composed of false positives, false negatives, true positives, and true negatives. The confusion matrix function evaluates accuracy by computing the confusion matrix with each row corresponding to the true class. In Equation (8), indicates the precision; R is the recall; and F is used to measure the value of precision and recall-it is the harmonic mean of precision and recall values. When = 1, score is calculated for performance evaluation.
In Table 3, MSE and the coefficient of determination ( ) are used to demonstrate the performance of the model; MSE is used to measure the prediction error of the model and is used to measure the accuracy of the prediction. We can see that the MSE value of the DDTree model is small and the value is close to 1. Besides, the score of the Hybrid DDTree model is 0.85 and the value of the deep neural network is 0.61, which means the hybrid DDTree model has better prediction performance. Thus, the waterway depth was predicted accurately using the hybrid DDTree Model. The hybrid DDTree model has a higher performance than the stand-alone DNN model. Table 3. Results of depth classification: mean square error (MSE) and coefficient of determination ( 2 ).

Deep Neural Network
Hybrid DDTree Model MSE 6.71 5.23  In predictive analysis, the confusion matrix is a table composed of false positives, false negatives, true positives, and true negatives. The confusion matrix function evaluates accuracy by computing the confusion matrix with each row corresponding to the true class. In Equation (8), P indicates the precision; R is the recall; and F is used to measure the value of precision and recall-it is the harmonic mean of precision and recall values. When α = 1, F 1 score is calculated for performance evaluation.
In Table 3, MSE and the coefficient of determination (R 2 ) are used to demonstrate the performance of the model; MSE is used to measure the prediction error of the model and R 2 is used to measure the accuracy of the prediction. We can see that the MSE value of the DDTree model is small and the R 2 value is close to 1. Besides, the F 1 score of the Hybrid DDTree model is 0.85 and the value of the deep neural network is 0.61, which means the hybrid DDTree model has better prediction performance. Thus, the waterway depth was predicted accurately using the hybrid DDTree Model. The hybrid DDTree model has a higher performance than the stand-alone DNN model.

Conclusions and Future Work
Depth of the waterway is a critical factor in the safety of marine traffic. However, due to the uncertainty of the waterway depth, it is difficult to obtain accurate waterway depth data in a timely manner. In this paper, we used the waterway depth data available from Global Mapper and the AIS data, together with the proposed DDTree model to predict the waterway depth data. The decision tree model is used to select the optimal attributes, and the AIS data are processed using the DNN. Then the neural network learns to perform inference reasoning on the sufficient waterway depth based on the ship's behavior (the ship did not go aground). The experimental results show that the DDTree model has higher accuracy for prediction of waterway depth and can complete prediction of waterway depth more efficiently. Thus, it can compensate for the lack of waterway depth data in marine transportation and provide for faster update of waterway depth data. The results for prediction of depth using DDTree had an accuracy rate up to 91.15%.
However, there are still some disadvantages of prediction of waterway depth through DNN, as the network parameters can influence the prediction ability of DNNs. In future research, firstly, the sources of waterway depth data will be expanded, and the prediction methods will be optimized. Secondly, we hope to get inspiration from [41] to improve the resource utilization of our proposed model and enhance the security and data processing capacity of the prediction of waterway depth. Besides, DNNs and large-scale flow data processing will be considered to optimize network parameters, and a visual module will be introduced to display the map of predicted depth data in real time, which will improve the safety of marine transportation and will provide a reference for the smart navigation of ships.