Skip to main content

A Hybrid Particle Swarm Optimization with Dragonfly for Adaptive ANFIS to Model the Corrosion Rate in Concrete Structures

Abstract

The use of reinforced concrete is common in marine structures. Failure of reinforcement due to corrosion has detrimental impacts on nearly all of these structures. Hence, proposing an accurate and reliable model was imperative. The goal of this paper is to develop a new hybrid model by combining Particle Swarm Optimization (PSO) with Dragonfly Algorithm (DA) for Adaptive Neuro-Fuzzy Inference System (ANFIS) to predict the corrosion current density (C11) of marine reinforced concrete. The neuro-fuzzy-based methods have emerged as suitable techniques for encountering uncertainties associated with the corrosion phenomenon in marine structures. To the best of our knowledge, this is the first research that predicts the C11 through a model integrating fuzzy learning, neural learning rules, and meta-heuristics. 2460 data are collected from 37 regions in Persian Gulf. The input parameters are age, concrete repairing history, height above the sea level, distance from sea, concrete compressive strength, rebar diameter, concrete cover depth, concrete electrical resistivity, chloride ion concentration and pH. The proposed rules for the estimation of C11 based on collected dataset are assessed based on the several metrics such as R2, efficiency, mean absolute percentage error (MAPE), and median of absolute error (MEDAE). According to the results, ANFIS-PSO–DA enables to predict C11 by R2 (0.92), MAPE (1.67), MEDAE (0.14), and EF (0.97). The results of sensitivity analysis revealed that concrete compressive strength and pH are the most effective parameters on the corrosion current density of reinforced concrete.

1 Introduction

The most accepted reason for the destruction of reinforced concrete (RC) structures is the corrosion of the steel rebar inserted in them. Corrosion can be activated by the entrance of any carbon dioxide or chloride ions into the concrete structure (Qiao et al., 2015; Taffese & Sistonen, 2017). The CO2 dissolves the protective layer on the rebar surface by a decrease in pH of surrounding concrete of reinforcement to below 9. The existence of chlorine ions above the critical concentration will also destabilize the protective film by attainment the surface of the steel rebars and occurrence of pitting corrosion (Doi et al., 2020; Paul et al., 2018; Rodrigues et al., 2020). These phenomena lead to the corrosion of the steel bar and consequently a reduction in its cross-sectional area, manifestation of cracks, and spalling in the concrete cover (Li et al., 2020). Thus, the mechanical properties of the rebar were reduced (Sun et al., 2018; Zhou et al., 2020) and this had a negative impact on the performance, safety, and durability of the reinforced concrete structure, which can cause the failure of the entire structure (Angst, 2018).

The total cost of corrosion damage in the world as reported by 2013 NACE impact study is about 4% of gross domestic product (GDP) (Salami et al., 2020). For example, in the case of a RC structure, the maintenance and repair cost of US highway bridges is about 8 billion US$ annually, while indirect costs are ten times higher due to the traffic delays and loss of efficiency (Hamidane et al., 2020). Therefore, predicting the service life and designing of the sustainable RC is a major challenge to avoid the cost of premature damage and extend the service life of such structures. In addition, this prediction can be utilized for maintenance and timely repair planning (Cai et al., 2020; Salami et al., 2020). Estimation and prediction rely gradually on analytical models based on in-service experience and observations. In general, the communication between the inferences attained from the experimental laboratory and the factual in-field behavior is weak (Melchers, 2020). In this study, we employed field data gathered from the corrosive area of the marine Persian Gulf to estimate the corrosion rate of steel reinforcement. Indeed, one of the significant factors that controls the destruction rate of RC structures is the corrosion rate.

In the recent decade, soft computing has become one of the most well-known intelligence techniques due to the lower cost and time consumption in different engineering sciences (Cai et al., 2019; Chung et al., 2018; Heo et al., 2018; Ryu et al., 2020; Shafaei and Khayati, 2020). Combining fuzzy logic with neural networks presents adaptive network-based fuzzy inference systems (ANFIS) to determine the superlative relationship among the input and output parameters in various processes. Fuzzy logic applies human reasoning in a mathematical framework and so can play a fundamental role in the relationship modeling of parameters. In ANFIS the training to find the adaptive parameters is very critical, which must be investigated. Therefore, artificial intelligence techniques and evolutionary methods are employed to determine the optimized parameters of the model.

Many researchers have utilized artificial intelligence in modeling of corrosion rate of steel rebar and service life prediction reinforced concrete structures (Anoop et al., 2002; Dey et al., 2019; Lv et al., 2020; Parthiban et al., 2005; Roxas and Lejano, 2019a; Sadowski and Nikoo, 2014; Salami et al., 2020; Taffese & Sistonen, 2017; Topçu et al., 2009). For example, Anoop et al. (2002) demonstrated the success of the fuzzy sets to predict the service life of reinforced concrete based on corrosion of reinforcement. Sadowski and Nikoo (2014) suggested a combination model based on ANN and imperialist competitive algorithm (ICA) to forecast the corrosion current density of steel in concrete. They employed ICA to optimize of ANN weights and compared ICA-ANN model with genetic algorithm. Their results revealed the high ability and accuracy of ICA-ANN model in forecasting of steel corrosion in comparison to the genetic algorithm. Roxas and Lejano (2019b) evaluated the performance of artificial neural network (ANN) in predicting the corrosion current density of steel rebar. Their results indicated a good prediction capability of neural network procedure. Lv et al. (2020) proposed a method based on particle swarm optimization support vector machine (PSO-SVM) and the grid search support vector machine (GS-SVM) for the prediction of steel corrosion rate of reinforcement. The authors found that the accuracy of PSO-SVM model is more than GS-SVM in the estimation of corrosion rate in steel.

Ansarinezhad and Shahbazian (2016) predicted corrosion rate of 3C steel by comparing hybrid ANFIS-PSO model with ANFIS-GA and SVR. Their results revealed that all the models have the capacity of predicting corrosion rate of steel. However, ANFIS-PSO model has a higher accuracy compared to ANFIS-GA and SVR. This is due to a combination advantage of both fuzzy logic with ANN and additionally benefit of inserting PSO algorithm.

To the best of our knowledge, different artificial intelligence approaches were used to predict the corrosion rate of steel. The main challenges of most literatures are the lack of appropriate operational data for modeling and the lack of use of appropriate strategies to distinguish effective interaction between selected features. The main contributions of current study can be considered as the usage of a sufficient field dataset and propose a new hybrid model based on the combination of PSO, dragonfly (DA) and ANFIS (PSO–DA–ANFIS) to model the corrosion rate of steel in concrete. In this regard, two main purposes are followed: (i) improvement of ANFIS performance for determining the corrosion current density as a criterion for corrosion rate using five optimization algorithms, namely, genetic algorithm (GA), particle swarm optimization (PSO), biogeography-based optimization (BBO), bat algorithm, hybrid particle swarm optimization (HPSO), improved genetic algorithm (IGA), hybrid dragonfly algorithm and particle swarm optimization (DA–PSO). (ii) Comparing the operation of available models, for predicting the corrosion current density in RC structures. The experimental results proved the ability of mentioned techniques in enhancing ANFIS accuracy for predicting the corrosion current density.

2 Background

Adaptive Neuro-Fuzzy Inference System (ANFIS) model typically consists of input–output variables, Fuzzy Inference System (FIS), which is used the Takagi–Sugeno rule types (Benzaouia & El Hajjaji, 2014; Lendek et al., 2010) and an Artificial Neural Network (ANN). In an ANFIS approach, adaptive and feed-forward network extracts fuzzy rules from inputs. Then, a hybrid learning procedure applies the parameters of fuzzy membership functions and tries to find correlations among the inputs and output according to the expert knowledge. Fig. 1 indicates the learning process of ANFIS with several nodes.

Fig. 1
figure 1

ANFIS structure (Karaboga & Kaya, 2019a).

For example, in Fig. 1, two inputs (x, y) and one output (z) are assumed. Then, a set with two different if–then fuzzy rules for a first-order Sugeno fuzzy model is expressed by Eq. (1):

$$\begin{gathered} {\text{Rule}}1:{\text{if}}\;x\;\text{is}\;A1\;\text{and}\;y\;\text{is}\;B1\;\text{then}\;Z1 = P1x + q1y + r1, \hfill \\ {\text{Rule}}2:{\text{if}}\;x\;\text{is}\;A2\;\text{and} \, y \, is\;B2\;\text{then}\;Z2 = \, P2x + q2y + r2. \hfill \\ \end{gathered}$$
(1)

The linguistic A1 and B1 variables perform the entries evaluation and a linear combination of the inputs with a constant value (r) defines the results of each rule (Karaboga & Kaya, 2019b).

2.1 Particle Swarm Optimization (PSO)

Generally, PSO is a meta-heuristic technique that is inspired by the movement of flocks and used for solving different optimization problems and with low calculations and acceptable results. Compared to existing optimization algorithms, PSO has a high learning speed, takes up less memory and is easy to execute (Bahiraei et al., 2021; Patwal et al., 2018). This algorithm is a population-based search technique in which every capability solution is signified as a particle in a populace (Pousinho et al., 2012). Each particle is considered as a solution and randomly moves in the search space. In PSO, particles move toward their best neighbors. In other words, each particle updates its position based on its best position and the best position adjacent to its neighbor. PSO randomly generates a particle set. If \(x_{i}^{t} = \left( {x_{i1}^{t} ,x_{i2}^{t} ,...x_{i\,n}^{t} } \right)\) and \(v_{i}^{t} = \left( {v_{i1}^{t} ,v_{i2}^{t} ,...v_{i\,n}^{t} } \right)\) show the position and velocity of the ith particle for tth iteration, respectively, then the particle updates its position for the t + 1 iteration based on Eqs. (2) and (3) (Alarifi et al., 2019):

$$v_{i}^{t + 1} = \omega \cdot v_{i}^{t} + c_{1} \cdot r_{1} (p_{i}^{t} - x_{i}^{t} ) + c_{2} \cdot r_{2} \cdot (g_{i}^{t} - x_{i}^{t} )\,{\text{with}} - v_{\max } \le v_{i}^{t + 1} \le v_{\max } ,$$
(2)
$$x_{i}^{t + 1} = (x_{i}^{t} + v_{i}^{t + 1} ),$$
(3)

where \(x_{i}^{t}\) shows the previous position of ith particle. \(p_{i}^{t}\) and \(g_{i}^{t}\) indicate the best position of ith particle and the best position found by particles, respectively. The two parameters r1 and r2 are random numbers in [0–1]. In addition, the cognitive coefficient, social coefficient, and inertia weight are defined by c1, c2, and ω, respectively. In standard PSO, the inertia weight is obtained as Eq. 4 (Basser et al., 2015):

$${\text{Interia}}\,\,{\text{weight}}\,\,\omega = \;\omega_{\max } - \left( {\frac{{\omega_{\max } - \omega_{\min } }}{{{\text{Itr}}_{\max } }}} \right) \times {\text{Itr,}}$$
(4)

where \({\omega }_{\mathrm{max}}\) and \({\omega }_{\mathrm{min}}\) signify the initial and final inertia weights, and also Itrmax and Itrmin are the maximum number of iteration and the current number of iteration.

2.2 Genetic Algorithm (GA)

GA is a heuristic optimization method based on Darwinian survival of the fittest and natural selection. It is inspired by nature and biological evolution. The algorithm begins by random initializing a population of individual solutions (chromosomes). Then GA operates on this population to produce better and better generation of it. The simple genetic algorithm follows stages: initialize population; calculate fitness, selection, crossover and mutation (Mirjalili, 2019).

It uses mutation and crossover operators for recombination and generating a population that has more properties related to the previous iteration. The fitness function is used for chromosome evaluation. Equation 5 indicates crossover formula (Katoch et al., 2021):

$$R = (G + 2\sqrt g )/3G,$$
(5)

where G represents the total number of evolutionary generation set by populace and g indicates the number of generations.

3 Problem Explanation, Material, and Methods

The data are collected from electric transmission tower foundations of 37 regions in the Persian Gulf. These foundations have a great impact on the stability and performance of the towers. Various examples of corrosion affected zones of these structures are shown in Fig. 2a–c. As illustrated the corrosion of steel rebar causes the concrete cover cracks.

Fig. 2
figure 2

Corrosion-affected zones of electric transmission tower foundations.

For investigating the corrosion behavior, different parameters were measured as construction date or age of concrete (C1, year), concrete repairing history (C2; 0: NO, 1: yes), height above the sea level (C3, m), distance from the sea (C4, km), concrete compressive strength (C5, N/mm2), rebar diameter (C6, mm), concrete cover depth (C7, mm), concrete electrical resistivity (C8, Ω m), chloride ion concentration (C9, ppm) in concrete, alkalinity of concrete (C10) and corrosion current density (C11, μA/cm2) of the rebar. The C6 and C7 were evaluated with an ultrasonic test. EN 13791 and ASTM C114 standards were used to determine the C5 and C9, respectively. The corrosion rate or C11 and C8 were measured by galvanostatic pulse technique (Fig. 3).

Fig. 3
figure 3

The measurement of corrosion rate by galvanostatic pulse technique (Beirami & Ehteshamzadeh, 2016).

ANFIS model adjusts two structural parameters (i.e., antecedent and consequent) based on the gradient-based methods. The main weakness of gradient-based methods is the slow convergence rate and the solution may trap local optimality (Salleh & Hussain, 2016). Metaheuristic optimization techniques (e.g., PSO and GA) can be used to overcome the issue of gradient-based methods. The optimal neuro-fuzzy system steps are defined as follows:

  1. 1.

    Define the train and test data;

  2. 2.

    Design a fuzzy system;

  3. 3.

    Tune the parameters of the fuzzy system based on meta-heuristic technique and error function;

  4. 4.

    Determine the best values of the ANFIS model.

3.1 PSO and GA Improvement Methods

PSO and GA are popular due to their simplicity and strong global search capability and the optimal answer is obtained in a relatively small number of iterations. Similar to most of the optimization techniques, GA and PSO algorithms can fall in the local optima. Especially for complex problem, the immature convergence of these algorithms have been observed. Thus, several modifications of PSO and GA are developed to enhance the ability to find global optimal value and jump out of local optimal solutions (Garg, 2015, 2018; Patwal et al., 2018; Serani et al., 2015).

An effective way is a hybridization with other meta-heuristics. For example, Garg (2016) presented a PSO–GA method where at first particles are updated based on standard PSO, and then some of them are selected for performing the crossover and mutation to increase the diversity of the population. Nevertheless, as the dimensionality of the problem increases, the likelihood of being trapped in the local optimization of the PSO–GA method increases. To solve this issue, in this paper a chaotic function is used in the searching step of particles. The details of the proposed HPSO are explained in Subsect. 3.1.1. Another hybridization version of PSO is introduced by Rafie et al. (Pellegrini et al., 2020) that uses the ability of the dragonfly algorithm to improve the local search capability.

Many works have been done to improve the searchability and diversity for genetic algorithms by modifying the genetic operators (selection, crossover, and mutation) (Metawa et al., 2017). For example, Song & Kusiak (2010) introduced a new ranking group selection operator that helps GA to generate a more feasible solution. In addition, the standard GA algorithm randomly initializes chromosome swarm and may fall into the local optimum during searching. To address this weakness, a chaotic initialization method is used.

3.1.1 Hybrid Particle Swarm Optimization (HPSO)

It can be seen that the inertia weight ω in Eq. (4) affects the convergence speed. As iterations number increases, ω decreases and leads to a decrease in the accuracy and speed of convergence in PSO. For setting the search step of particles so that they can better explore search space, a chaotic version of inertia weight is used. We consider the sin map (Griffin, 2013) function. The sine interval is defined between zero and one. The inertia weight ω is obtained based on Eq. (6) (Wang et al., 2020):

$$\omega = \overline{\omega }^{t} = A \cdot \sin (\pi \overline{\omega }^{t - 1} ),\;\overline{\omega }^{t} \in (0,1)\;,0 < A \le 1,\;k = 1,2, \ldots ,\max {\text{iter,}}$$
(6)

where k is the current iteration number.

After updating particles using Eqs. (2), (3), and (6), some particles are randomly selected to perform crossover (Prügel-Bennetf, 2001) and mutation (Hassanat et al., 2019) by the roulette wheel selection (Yu & Author, 2016). New populations are called Cpopulation and Mpopulation. The particles in Cpopulation and Mpopulation are evaluated, and then the appropriate particles are selected for the next iteration.

3.1.2 Hybrid Particle Swarm Optimization with Dragonfly (PSO–DA)

The hybrid algorithm, named PSO–DA, uses the dragonfly algorithm (DA) and PSO to solve the optimization problem. In general, PSO has good exploitation due to the best experience of the particles and so shows a quick converge rate. Nevertheless, PSO may fall in the local optima and show a quickly converge exploration ability. While DA improves the exploration ability and stochastic behavior using the Levy flight. Therefore, a hybrid algorithm (PSO–DA) combines the exploration of DA and the exploitation of PSO. In the first phase, DA defines the population of dragonflies to determine the region of the global solution. Then, the best position of DA is replaced with the global best position of PSO. Now, PSO performs the exploitation phase to determine the expected optimal solution. Therefore, the velocity of PSO is defined by Eq. (7) (Khunkitti et al., 2018):

$$v_{i}^{t + 1}\, = \,\omega \cdot v_{i}^{t} + c_{1} \cdot r_{1} (x_{{p{\text{best}}_{i} }}^{t} - x_{i}^{t} ) + c_{2} \cdot r_{2} \cdot (x_{{{\text{DA}}}}^{t + 1} - x_{i}^{t} )\,{\text{with}} - v_{\max } \le v_{i}^{t + 1} \le v_{\max } ,$$
(7)

where \(x_{{{\text{DA}}}}^{t + 1}\) shows the best position of DA for iteration t + 1.

3.1.3 Improved Genetic Algorithm (IGA)

In this modification of GA, a chaotic system [e.g., logistic map (Liu et al., 2013)] has been employed to generate the initial population. In other words, IGA replaces the standard initialization (i.e., uniformly distributed random numbers) with the chaotic system that shows more randomness and less regular behavior. In the chaotic initial population, each individual is defined based on Eq. (8) (Wang et al., 2020):

$$c_{i + 1} = \mu c_{i} (1 - c_{i} ),\,\,c_{i} \in [0,1],\,\,i = 0,1, \ldots ,D - 1$$
(8)

where ci indicates a set of chaotic numbers, µ shows the control factor, D denotes the dimension of the problem, and c0 \(\notin\){0, 0.25, 0.5, 0.75, 1}. Firstly, it defines a D-dimensional chaotic vector c = (c0, c1, …, cD−1), where c0 is obtained by random manner, and c1, …, cD−1 are created according to Eq. (8).

Secondly, it maps the chaotic vector (c) to search space [lb, ub]D based on Eq. (9) (Tang et al., 2010):

$$x_{i} = {\text{lb}}_{i} + c_{i} ({\text{ub}}_{i} - {\text{lb}}_{i} ),\;i = 0,1, \ldots ,\,\,D - 1,$$
(9)

where lbi and ubi are the upper and lower bounds for the ith dimension, respectively. Therefore, an individual x = (x0, x1, …, xD−1) is generated.

3.1.4 Modified Genetic Algorithm (MGA)

In the GA, the selection process has a great impact on the overall performance of the genetic algorithm. Since the mutation and crossover operators should choose one or more agents and the selection operator determines which agent can be selected as input to these two operators. Instead of using common selection methods where they require repeated comparison of the individual’s fitness, an alternative approach named ranking group selection (RGS) is employed (Rahimzadeh et al., 2021). RGS is as follows.

The size of the population is considered a multiple of 4 and the individuals are arranged based on fitness function F(x). For maximum and minimum value problems, descending and ascending order of F(x) is necessary, respectively. Assume that X = {X1, X2,…, Xn} is the initial population and the arranged population is \(X^{R} = \left\{ {X_{1}^{R} ,X_{2}^{R} , \ldots ,X_{n}^{R} } \right\}\), hence it satisfies \(F\left( {X_{1}^{R} } \right) \le F\left( {X_{2}^{R} } \right) \le , \ldots , \le F\left( {X_{n}^{R} } \right)\). XR is divided into four sets (i.e., \(X^{1} = \left\{ {X_{1}^{R} , \ldots ,X_{n/4}^{R} } \right\}\), \(X^{2} = \left\{ {X_{(n/4) + 1}^{R} , \ldots ,X_{n/2}^{R} } \right\}\), \(X^{3} = \left\{ {X_{(n/2) + 1}^{R} , \ldots ,X_{3n/4}^{R} } \right\}\), and \(X^{4} = \left\{ {X_{(3n/4) + 1}^{R} , \ldots ,X_{n}^{R} } \right\}\)) to enhance the diversity of the paired individuals. When two elements must be selected from four elements, there are six cases [i.e., (X1, X2), (X1, X3), (X1, X4), (X2, X3), (X2, X4), and (X3, X4)] is feasible. Now, RGS presents the paired groups of individuals as XA = {X1, X1, X1, X2, X2, X3}and XB = {X2, X3, X4, X3, X4, X4}. Fig. 4 indicates the RGS process when the population size is 8.

Fig. 4
figure 4

An example of the RGS process.

In Fig. 4, the population is first sorted and divided into four sets (i.e., X1, X2, X3, and X4). Then, the crossover is performed by selecting \(X_{1}^{R}\) in XA and \(X_{3}^{R}\) in XB, \(X_{2}^{R}\) in XA and \(X_{4}^{R}\) in XB, and so on. The individuals of XA are better than the individuals of XB in terms of fitness value.

3.2 Optimize ANFIS with PSO and GA Variants

In Fig. 1, ANFIS with two layers consists of adaptive parameters and other layers have fixed parameters. The adaptive parameters can be grouped as consequent and premise parameters. In the first layer, there are premise parameters that correlate to the membership functions like the Gaussian function (Eq. 10) (Sarkheyli et al., 2015):

$$\mu_{i} (x) = e^{{\frac{{ - \left( {x - c_{i} } \right)^{2} }}{{2a_{i}^{2} }}}} .$$
(10)

In Eq. (10), ai and ci show the premise parameters.

In this layer, the membership functions number is a function of linguistic terms number of inputs. For Gaussian function, the total number of premise parameters that must be optimized is obtained by Eq. (11) (Sarkheyli et al., 2015):

$${\text{TNP}} = 2 \times \sum\limits_{i = 1}^{n} {L_{i} } ,$$
(11)

where n indicates the number of inputs and Li shows the linguistic terms for input i. In the fourth layer, the consequent parameters of the fuzzy rules are determined. In Eq. (12), the total number of consequent parameters is determined (Sarkheyli et al., 2015):

$${\text{TNC}} = r \times (n + 1).$$
(12)

As can be seen, the effective and optimized finding of the premise parameters of layers 1 and 4 plays a key role in finding the best ANFIS model.

3.2.1 Initialization

Firstly, the proposed algorithm creates a basic fuzzy system according to the received training data. Secondly, the optimization algorithms determine the best values based on predefined objective functions, which is more explained in details in Subsect. 3.2.2. The general frameworks of the optimized ANFIS by PSO-based and GA-based variants are illustrated in Figs. 5 and 6.

Fig. 5
figure 5

PSO-based ANFIS flowchart.

Fig. 6
figure 6

GA-based ANFIS.

Fig. 5 shows the steps of PSO–DA and HPSO that have similar steps like initialization particles and loading dataset. But PSO–DA is a hybrid method where two methods particle swarm optimization and dragonfly are combined. In addition, HPSO adds two operators (i.e., crossover and mutation).

Fig. 6 indicates the steps of MGA and IGA. The difference between them can be seen in the initialization step, where IGA initializes its individuals with a chaotic function and MGA considers uniformly distributed random numbers. In the following steps, two algorithms perform crossover and mutation like standard GA. But MGA selects individuals by RGS technique (see Sect. 3.1.4) and IGA selects them by roulette wheel selection.

3.2.2 Individual Evaluation

It evaluates the position of each individual based on the objective function F. In this paper, two factors are used for evaluating each position. Firstly, the square of coefficient of determination \(\left( {R^{2} } \right)\) determines the percentage change in response due to the input variable X. Secondly, the root mean squared error (RMSE) shows the size of the errors in the regression model. Here, the objective function maximize R2 and minimize the RMSE value, respectively. R2 and RMSE are defined by Eqs. 13 and 14, respectively (Somu et al., 2020):

$$R^{2} = 1 - \frac{{{\text{SSE}}}}{{{\text{SST}}}} = 1 - \frac{{\sum\nolimits_{i = 1}^{n} {(y_{i} - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\frown}$}}{y}_{i} )} }}{{\sum\nolimits_{i = 1}^{n} {(y_{i} - \overline{y}_{i} )} }},$$
(13)
$${\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {y_{i} - \widehat{{y_{i} }}} \right)} } ,$$
(14)

where \(y_{i}\) indicates the observed amount and \(\widehat{{y_{i} }}\) shows the predicted amount for the ith case. In addition, \(\overline{y}_{i}\) is the average of observed values. The adopted fitness function combines the two criteria into one value by setting a weight factor in Eq. (15) (Somu et al., 2020):

$$F = {\text{minimize}}({\text{RMSE}} + w \times (1 - R^{2} )),$$
(15)

where w shows the weight factor, which has value in [0, 1]. The factor w is used to control the importance of the coefficient of determination and the size of errors. We divide the original dataset into two parts (i.e., training and testing) based on a tenfold cross-validation approach. The training part is applied to train a model based on the optimization process and the testing part evaluates the selected features.

4 Discussion of Results

This section is divided into two parts. Firstly, the performance of GA and PSO variants, which are explained in Sect. 3, is tested based on several test functions. In addition, the optimization results obtained by GA, PSO, and their variants including (i.e., HPSO, PSODA, IGA, and MGA) and other strategies (i.e., BBO and Bat) are discussed. Secondly, the ANFIS created by each algorithm is evaluated. Moreover, the sensitivity analysis is performed for the appropriate algorithm, which indicates in Sects. 4.1 and 4.2.

The setting of parameters of different methods in this paper is shown in Table 1 (Guo et al., 2020; Hayyolalam et al., 2020; Hong et al., 2019).

Table 1 Parameter setting.

4.1 General Experimental Setting

  1. 1.

    Test problems and dimension setting: To verify the performance of compared algorithms, ten test functions from CEC2005 (Liang et al., 2013) benchmark problems are used. According to their diverse characteristics, the problems can be divided into two kinds of optimization problems (Liang et al., 2013):

    • Unimodal problems (i.e., F1–F5).

    • Multimodal problems (benchmark).

    Table 2 shows the details of the test functions used.

    Table 2 Test functions description.
  2. 2.

    Experimental platform: All the experiments are run on a PC with intel core i5-10210U 2.11 GHz CPU and 8 GB memory.

  3. 3.

    Performance metrics: In this paper, convergence curve, trajectory, and distance to the best solution (\(\Delta_{x}\)) are used to investigate the performance of each algorithm. The convergence curve shows the best solution obtained by each algorithm throughout the iteration. Trajectory illustrates the location of the first agent at the first dimension in each algorithm. It is used to show whether the agent changes its location to explore the search space or exploit a specific location. The distance-based metric is used to show the algorithm-found solutions and analytical optima. It is calculated by Eq. (16) (Serani et al., 2016):

    $$\Delta_{x} = \sqrt {\frac{1}{{N_{{{\text{d}}v}} }}\sum\limits_{j = 1}^{{N_{{{\text{d}}v}} }} {\left( {\frac{{x_{j,\min } - x_{j,\min }^{*} }}{{R_{j} }}} \right)^{2} } } ,$$
    (16)

    where \(\Delta_{x}\) is a normalized Euclidean distance between the analytical optima (\(x_{\min }^{*}\)) and the optimal position that is obtained by the method (\(x_{\min }\)). \(R_{j} = \left| {u_{i} - l_{j} } \right|\) shows the range of variable j.

4.1.1 Analysis of Compared Methods on Benchmark Functions

In this subsection, the behavior of different methods for several test functions has been examined. Figs. 7 and 8 show the trajectory and convergence curve of different methods, respectively.

Fig. 7
figure 7figure 7

The trajectory of different compared methods.

Fig. 8
figure 8figure 8figure 8

Convergence curve of different compared methods.

From Fig. 7, it can be seen that most methods face abrupt fluctuations in less than 50% of the time (under 250 iterations) for Unimodal functions (i.e., F1–F5). This behavior is logical because Unimodal functions have one global optimal. In multimodal functions, this fluctuation is increased up to half of the time since the number of local optimal increases in these types of functions and hence subsequently the exploration activity is increased too.

We can observe that, unlike most algorithms where their fluctuations are gradually decreased throughout iterations, HPSO and Bat do not converge appropriately. The sine-map function that is used in HPSO and the frequency parameter that is used in the Bat algorithm for setting the value of weight can change the size of the search step irregularly. Therefore, agents take a long step in some iterations and their locations are very differently changed from their previous location. Then, they take a small step search to move near from current location with small change.

However, using a chaotic function increases the exploration ability of HPSO but causes the HPSO algorithm to face unbalancing between exploration and exploitation activity. To remove this drawback, evolutionary operators (e.g., crossover and mutation) are added but it cannot help HPSO to converge appropriately.

In Fig. 7, it seems that PSODA and MGA are two algorithms that have successfully provided a balance between exploration and exploitation. We can see that range of PSODA fluctuations are smaller than MGA. It is because the best solution is determined by DA in each iteration of PSODA. The DA has a good exploration ability and when it is combined with PSO, it can significantly improve the searchability. In general, it seems that PSODA and MGA are the best ones among other variants and can converge to the global optimal and escape from the local optimal.

Fig. 8 shows the best value obtained by each algorithm for different test functions. It can be seen that PSODA reaches the best value in Unimodal functions except in F1, which achieves the second-best result.

Fig. 9 indicates the average fitness of different methods. It shows the performance of agents in a method as a team unlike the convergence curve (Fig. 8), where it considers only the best agent in each iteration and plots its behavior. In general, it is expected that all agents eventually behave like the best agent and have a downward movement from the first iteration to the last one. It can be seen from Fig. 9 that HPSO agents don’t follow this term in functions F4 and F7, while the behavior of the best agent in Fig. 8 shows the downward movement. On the other hand, in the both of Figs. 8 and 9, the formation of BBO has a downward movement and all agents try to help each other to reach the best position. From the structure of HPSO and BBO and their results in Figs. 8 and 9, it can be concluded that if all agents in HPSO get an equal chance to perform crossover and mutation, most likely this algorithm can find a better solution. This result shows that PSODA has more exploitation ability compared to other methods and the RGS technique embedded in PSODA significantly improves the exploitation ability. In multimodal functions, all algorithms except HPSO converge to a near-optimal solution for F7 and F9. The HPSO suffers from an unbalancing between exploration and exploitation. The results of IGA and HPSO prove that using strategies that radically increase the exploration (e.g., chaotic functions) regardless of exploitation it’s not appropriate and can take the algorithm away from converging to the optimal solution.

Fig. 9
figure 9figure 9figure 9

The average fitness of different compared methods.

Table 3 shows the average and standard deviation of objective function achieved by the compared methods for different test functions. It can be interpreted that the Bat algorithm achieved the worst result compared to PSO, especially in multimodal functions. The standard Bat algorithm bears many benefits, along with the significant advantage is the fact that it can produce extremely fast convergence at an extremely major stage by transferring from adventure to exploitation. Nevertheless, when we permit the algorithm to switch to the exploitation stage much too immediately, it may result in stagnation after a certain first stage. Therefore, the nature of the Bat algorithm is such that it is expected to work better in Unimodal functions, which have one global optimal, compared to Multimodal functions, which have several local optimal.

Table 3 Average and standard deviation objective function values obtained by different methods.

The distance between the best solution was found by different methods and the analytical best solution of each test function can be seen in Fig. 10. In addition, the average and standard deviation of this distance value obtained by the compared methods after 30 runs can be seen in Table 4. It can be seen that the best solution that the PSODA reaches to them is near to the analytical best solution more than BBO’s best solution. This result indicates that the quality of solutions obtained by PSODA is higher than BBO.

Fig. 10
figure 10figure 10figure 10

Distance metric.

Table 4 Average and standard deviation of \({\Delta }_{x}\) obtained by the different methods.

One of the main weaknesses of BBO is that the migration operator, an immigration habitat, and an emigration habitat are selected, respectively, based on the immigration rate and emigration rate first, and then features are migrated from the emigration habitat to the immigration habitat. On one hand, any habitat has a chance to be an emigrating habitat. In other words, BBO uses global topology in migration operations. It needs to check (ps − 1) habitats for a given immigration habitat, which is computationally expensive. On the other hand, high-quality habitat has more chance to be an emigration habitat whereas poor habitat is easy to be an immigrating habitat. That is to say, the direction of migration operation is asymmetrical, which decreases population diversity. Unlike the BBO, in PSODA by using the ability of the DA algorithm in searching the different locations of solution space a good diversity in population can be created.

The compared approaches [i.e., ANFIS-MGA (Song et al., 2019), ANFIS-IGA (Hong et al., 2013), ANFIS-GA and ANFIS-PSO (Moayedi et al., 2020), ANFIS-HPSO (Chen et al., 2018), ANFIS-PSO–DA (Khunkitti et al., 2018)] are applied and evaluated to model constructed based on C1, C2, C3, C4, C5, C6, C7, C8, C9 and C10 as input parameters and C11 as output ones. For estimating the corrosion current density with the trained ANFIS models, the k-fold cross-validation technique has been used.

In K-fold cross-validation, the original dataset is divided into K sub-sets. Then, one sub-set is considered as validation data to test the model and the remaining sub-sets are applied for the training phase. This process is repeated K times so that each section is selected once as a validation set. The final result will be the average of K folds.

Table 5 summarizes the factors employed for comparisons of training algorithms of the ANFIS network. Considering the 2460 dataset, it has 10 features and so the number of input variables for ANFIS is equal to 10. On the other hand, each input variable has 14 membership functions and the membership functions are Gaussian. This type of membership function has two parameters for optimization and so, in total 2 × (10 × 14) = 280 premise parameters exist. There are 14 rules in the ANFIS model, each rule has 11 parameters for optimization (based on Eq. (15)), and so there are 11 × 14 = 154 consequent parameters. In general, the optimization problem has 280 + 154 = 434 parameters, which must optimize during the finding of best values as well as the best ANFIS model (or training process). In the following subsections, the result of meta-heuristic methods in terms of RMSE, MSE, R2, VAF, MEDAE, and the efficiency for solving optimization ANFIS parameters’ is presented.

Table 5 The properties of ANFIS structure.

4.1.2 Actual Train–Predict Train

Figs. 11 and 12 compare the predictive results that are regularly obtained by different ANFIS models and the actual data for training and testing datasets. The performance on testing and training datasets by different methods is divided into two parts: predict vs. actual value and absolute error part. We can see that using the GA, PSO and their improvements in the training process of ANFIS leads to robust training steps. Although both GA-based and PSO-based models are found satisfactorily capable of predicting corrosion rate, PSO-based models indicate superiority over GA-based in terms of accuracy and on average reduce absolute error by 17%. Among PSO-based, the absolute error of ANFIS-PSO–DA is the smallest, followed by ANFIS-HPSO, then ANFIS-PSO. In GA-based methods, the error value of ANFIS-IGA is the smallest, followed by ANFIS-MGA and ANFIS-GA.

Fig. 11
figure 11

Performance and absolute error of various approaches through training dataset.

Fig. 12
figure 12

Performance and absolute error of various approaches through testing dataset.

The main reason for this superiority is that the PSO-based method can better control individual search steps with inertia weight compared to GA. While GA focuses only on performing exploitation by crossover operator and exploration by mutation operator and so PSO-based model has a better balance between exploration and exploitation activity by moving individuals towards the best locations (i.e., global best and personal best).

4.1.3 Errors

Theoretically, a predictive model could be evaluated based on error mean and error standard deviation that are obtained based on Eqs. (17) and (18), respectively (Bemani et al., 2020):

$${\text{Error}}\,{\text{mean}} = \frac{{\sum\nolimits_{i = 1}^{n} {(y_{i} - \widehat{{y_{i} }})} }}{n},$$
(17)
$${\text{Error}}\,{\text{StD}} = \sqrt {\frac{{\sum\limits_{i = 1}^{n} {(E_{i} - \overline{E})} }}{n - 1}} ,$$
(18)

where E represents the error of predicting a value and \(\overline{E}\) indicates the mean error. The RMSE parameter indicates how much output of each meta-heuristic model is close to the real value. As it can be seen in Fig. 13, RMSE values of ANFIS-IGA, ANFIS-PSO–DA, ANFIS-MGA, ANFIS-GA, ANFIS-HPSO, and ANFIS-PSO have been calculated for the training steps 6.6586, 5.5479, 6.8101, 6.8564, 6.2857, and 6.3529, respectively. This shows that compared to other models, ANFIS-PSO–DA has the best performance while ANFIS-GA has the worst result for training step.

Fig. 13
figure 13

MSE and RMSE values of different methods for training dataset.

The ANFIS-PSO improves RMSE by 8% compared to ANFIS-MGA, on the other hand, ANFIS-MGA with changing the selection strategy from the standard selection techniques [i.e., roulette wheel selection (RWS) and tournament selection (TS)] to RGS improves the RMSE by 9.2% compared to standard GA. Therefore, this method gets better result compared to the standard GA but it cannot create suitable diversity in population over PSO method due to the lack of exploitation performance.

We know that determining the best model based on RMSE for training step is not enough, and it is necessary to check the RMSE level for testing step. According to the results shown in Fig. 14, the values of RMSE for ANFIS-GA, ANFIS-PSO–DA, ANFIS-MGA, ANFIS-A, ANFIS-HPSO and ANFIS-PSO are 5.1419, 2.6828, 5.5024, 5.603, 3.9554, and 4.141, respectively. Therefore, the best performance is achieved by ANFIS-PSO–DA and the worst model is ANFIS-GA.

Fig. 14
figure 14

MSE and RMSE values of different methods for testing dataset.

The error distribution plots in Figs. 13 and 14 show that most of the errors are concentrated near zero, specifically for PSO–DA.

Table 6 illustrates the standard deviation error and mean error achieved by compared methods for both training and testing datasets. It can be seen that HPSO, PSO–DA, and IGA predict output value higher than the actual value. In addition, PSO–DA achieves the low value for standard deviation error and so shows good stability and performance.

Table 6 Error mean and error std. for different methods.

4.2 Comparison of Other Methods

There are some common metrics, i.e., mean absolute percentage error (MAPE), square coefficient of determination (R2), the variance accounted for (VAF), and median of absolute error (MEDAE) for performance evaluation of different models. The square coefficient of determination (R2) is obtained based on Eq. (13). The MAPE, Efficiency, VAF, and MEDAE are defined as Eqs. (19)–(21), respectively (Umrao et al., 2018):

$${\text{MAPE}} = \frac{1}{n}\sum\limits_{j = 1}^{n} {\left| {\frac{{y_{j} - \widehat{y}_{j} }}{{y_{j} }}} \right| \times 100} ,$$
(19)
$${\text{Efficiency}} = \frac{{\sum\nolimits_{j = 1}^{n} {\left( {y_{j} - \overline{y} } \right)^{2} - \sum\nolimits_{j = 1}^{n} {\left( {\widehat{y}_{j} - y_{j} } \right)^{2} } } }}{{\sum\nolimits_{j = 1}^{n} {\left( {y_{j} - \overline{y} } \right)^{2} } }},$$
(20)
$${\text{VAF}} = \left[ {1 - \frac{{{\text{var}} (y - \widehat{y})}}{{{\text{var}} (y)}}} \right],$$
(21)
$${\text{MEDAE}} = {\text{median}}(y - \widehat{y}),$$
(22)

where y is the actual output for \(\widehat{y}\) that is predicted by the networks. \(\overline{y}\) is the mean of the whole (actual) outputs and n is the number of outputs.

PSO–DA achieves the highest value for R2 and efficiency. Also, it on average improves these indices compared to GA, MGA, IGA, HPSO, and PSO by 35%, 29%, 25%, 20%, and 22.5%, respectively. In addition, it obtains the lowest error among the other methods and on average reduces the error by 5% and 3% compared to PSO and HPSO, respectively. The main reason for this superiority is improved local search in PSO–DA with dragonfly levy flight technique compared to PSO and HPSO. This technique helps PSO–DA to explore wide search problem areas than the two other methods. It can be seen from Fig. 15b that PSO and HPSO achieve the same error in the last iteration while they have a great different error at iteration 100 (i.e., 15 for HPSO and 2.5 for PSO). But as the number of iterations increases, the influence of using the chaotic function in HPSO is more appeared. The chaotic function creates different search steps and so each individual in HPSO can take a small and big step, while an individual in PSO takes a steady step and different steps help HPSO to escape from local optimal.

Fig. 15
figure 15figure 15

Results for training dataset over 400 iterations.

The performance results of compared methods on the test dataset can be seen in Table 7. It is seen that IGA achieves the best result among all of the GA-based methods (i.e., MGA and GA). This superiority demonstrates the importance of definition for the initial population. In IGA, the population is initialized with a chaotic function, while in GA and MGA it randomly creates population. The chaotic technique helps IGA to a better spread individuals.

Table 7 Results for the test dataset.

To gain a better insight into the prediction success of models, the actual values were plotted against predicted ones. Figs. 16a–f and 17a–f show the results for the train dataset and test dataset, respectively. As can be observed from Fig. 16, for most methods the points are scattered around the fit line (the fit line represents the experimental data). In other words, there is a good adjustment between the outputs of models and the actual values in the train dataset. Fig. 16 shows all methods approximately learn well and can predict unseen data very well. Fig. 17 indicates the performance of compared methods on the test dataset.

Fig. 16
figure 16

Regression line plot for the train dataset.

Fig. 17
figure 17

Regression line plot for test dataset.

It can be seen that GA and MGA can predict output value as well as HPSO and PSO–DA in train dataset. Therefore, it shows that these methods (i.e., GA and MGA) get overfit and cannot be predicted these unseen data very well. In predicting of C11, the ANFIS-PSO–DA model presents a low error in comparison with the other models with having a correlation coefficient close to one.

4.3 PSO–DA ANFIS Information

In this section, the properties of PSO–DA as the best model among compared methods are explained and then the influence of a different number of control parameters (i.e., population size and iteration) and the number of membership functions are investigated. Fig. 18 shows the optimal value of the membership function.

Fig. 18
figure 18

Gaussian membership functions of input parameters after the training process by PSO–DA.

By finding optimal values for consequent parameters, the ANFIS fuzzy rules could be defined as fixed rules for testing the network. Table 8 illustrates 14 rules that are prepared using various combinations of linguistic terms after the training process. It is clear that fuzzy rules can be effective in determining the effect of parameters on model performance. Fig. 19 represents the data analyzing result for the sensitivity of this experimental case. We can see that C5 and C10 (each of them with around 30%) have more influence on C11, while C4 (with around 1%) has the least influence on that performance.

Table 8 The optimal consequent parameters after training.
Fig. 19
figure 19

Relationships between the input parameters and output parameter.

4.3.1 Investigation of PSO–DA ANFIS Performance

Three parameters for training ANFIS are available that should be adjusted. They are two common control parameters for PSO–DA (i.e., population size and the number of iteration) and the number of membership functions for ANFIS. The values of these important parameters have a major influence on the solution quality and execution time. Nevertheless, these values are usually determined empirically through pilot runs.

Table 9 shows the influence of setting the different number of membership functions for input variables. Accordingly, the best performance is achieved by setting the number of membership functions to 14. The number of variables to be optimized decreases as the number of membership functions decreases and so the execution time is reduced. Nevertheless, the efficiency and the accuracy of the model are also reduced. On the other hand, with more than 14 membership functions, it does not show significant improvement results while the execution time is increased.

Table 9 Influence of the various number of membership functions on PSO–DA ANFIS.

The influence of setting different sizes for population and the different number of iterations is shown in Table 10. It can be interpreted that the PSO–DA shows the best performance by setting maximum iteration to 400 and population size to 50 the PSO–DA. By setting population size lower than 50, the method face to lack of diversity and so convergence to local optimal. Also by setting the number of iterations lower than 400, the model doesn’t enough time to explore search space very well and hence convergences to the local optimal.

Table 10 Influence of various population sizes and iterations on PSO–DA ANFIS.

4.3.2 Sensitivity Analysis

The sensitivity analysis is used to rank the effect of each parameter on C11 of steel in concrete. The results of this analysis can extend the useful service life of the RC structures in terms of sustainable new structure’s design and optimization of renovation and maintenance of such structures (Zhang & Lounis, 2006). As seen from Fig. 19, the importance of each input parameter in determining the corrosion rate of steel in descending order (from top values to low values) is as C5, C10, C8, C7, C6, C2, C9, C3, C1, and C4. According to results, C11 is inversely related to C5, C10, C8, C6, C2, C3, and C4, and decreases with increasing their proportion. But C11 is directly related to the C7, C9 and C1. In the following, the sensitivity of major input parameters to the C11 as well as the service life of RC structures will be discussed.

4.3.2.1 Sensitivity of C 11 to the Compressive Strength of the Concrete

C11 depends on C5 (Stewart & Mullard, 2007) and compressive strength relates to the permeability of concrete. By decreasing the ratio of water to binder in the concrete, the concrete permeability decreases, and consequently the compressive strength of concrete will increase. Concrete with a high compressive strength extends the time for chlorine ions to penetrate. As a result, the incubation time to start the corrosion and service life of structures will be longer (Topçu et al., 2009).

4.3.2.2 Sensitivity of C 11 to C 10

The rate of steel corrosion decreases with increasing pH. Corrosion of concrete reinforcement occurs very slowly in normal circumstances due to the high pH within the concrete. In fact, high pH of the pore solution and passivation of the reinforcement can change by occurring some unfavorable chemical processes, which leads to an increase in the corrosion rate of the steel (Garcés et al., 2011). The main reason for reducing the pH of the pore solution is the carbonation of the concrete, leading to the cracking and dissolution of the passive layer of the rebar. In fact, carbonation reduces hydroxyl ions in the pore solution of alkaline concrete and thus leads to a decrease in concrete pH (Otieno et al., 2019).

4.3.2.3 Sensitivity of C 11 to C 8

The electrical resistance of a substance is its ability to resist the flow of free electrical charges when exposed to an electric field (Alhajj et al., 2019). Concrete resistivity (ρ, Ω m), also known as electrical resistivity, is mainly affected by the water to cement ratio, the aggregates size, binder type, and conditions of the concrete preparation (Rodrigues et al., 2020). Concrete electrical resistivity is one of the important parameters for evaluating steel reinforcement corrosion. It is now generally confirmed that as the concrete resistivity increases under the normal environmental circumstances, the corrosion current density and consequently the corrosion rate decreases (Hornbostel et al., 2013; Wang et al., 2019).

4.3.2.4 Sensitivity of C 11 to C 7 and C 6

Concrete cover acts as a fence against the penetration of aggressive materials, which are necessary to initiate and stabilize the corrosion process (Muthulingam & Rao, 2014). Corrosion current density depends on the thickness of the concrete cover (Wang et al., 2019), and as the thickness of the cover increases, more corrosion is necessary to start the cracking in the surface of the concrete (Otieno et al., 2019). Because of the presence of thicker concrete cover, more tensile energy is required to the manifestation of cracks in the concrete (Zhao et al., 2011). It should be noted that the influence of cover depth on the corrosion rate of steel can be different after the depassivation of steel. In this case, due to a better electrical potential circulation in concrete with a thicker cover, the corrosion rate of the reinforcement will increase (Pour-Ghaz et al., 2009).

In general, a larger diameter of steel bars generates more tensile energy in concrete. As a result, a crack can initiate more easily in a larger diameter steel bar on the concrete cover than the smaller one (Zhao et al., 2011). But after the corrosion propagation, the diameter of the rebar decreases with increasing the corrosion current density according to Eq. 23 (Molina et al., 1993):

$$\varphi (t) = \varphi (i) - 0.023I_{{{\text{corr}}}} t,$$
(23)

where φ(t) is the diameter of rebar (mm) at time t, φ(i) is the rebar diameter (mm) at the start of service life, \({{I}}_{\text{corr}}\) is corrosion current density (μA/cm2), t is the time passed (years) from the start of the propagation period and 0.023 is the conversion factor for uniform corrosion.

4.3.2.5 Sensitivity of C 11 to C 9

Concrete can be contaminated with chlorine ions through exposure to a marine environment, the use of deicing salts, or even during the manufacturing of concrete. Chloride penetration generally takes place by suction and diffusion via the capillary pores. Therefore, the time to the corrosion initiation is highly related to the diffusion coefficient of chloride in concrete. Diffusion is influenced by the size distribution of the pores in the concrete, which is related to the ratio of water to cement. The presence of defects such as cracks in the steel–concrete interface provides easier and faster paths for chlorine penetration (Rodrigues et al., 2020). The existence of chloride ions above the critical level (more than 0.4% compared to the mass of cement) in RC structures, around the steel reinforcement, makes the passive state unstable and thus causes the corrosion process to begin (Medeiros et al., 2013). In fact, the existence of chloride ions destabilizes the passive layer of reinforcement by the occurrence of pitting corrosion (Li et al., 2020). Thus, as the chloride concentration increases, the corrosion rate will increase.

4.4 Comparison of the Effect of Selected Variables on C 11

Observing how the operating parameters change and their effect on the corrosion rate can provide a good view of the interaction between the parameters. As an example, Fig. 20a–g shows these binary changes for some of selecting parameters. It should be noted that the values of other parameters are considered to draw each graph equivalent to the sample with the lowest corrosion rate. As shown in the guide bar, the blue region indicates the conditions with less corrosion current density, and yellow indicates the conditions with higher corrosion rates. These diagrams are a good guide for selecting the best parameter values in specific operating conditions.

Fig. 20
figure 20

Binary changes for some of selecting parameters: a C10C5, b C9C3, c C9C1, d C9C4, e C3C1, f C3C4, and g C1C4.

5 Summary and Conclusions

Effective construction, maintenance of concrete reinforced structures located at coastal areas has an effective role in sustainable development. Therefore, accurate prediction of the corrosion rate of these structures is very important. In the first step, Particle Swarm Optimization with Dragonfly Algorithm (PSO–DA) is introduced to enhance stochastic behavior using the Levy flight. In the second step, the PSO–DA algorithm is utilized to optimize Adaptive-Network-Based Fuzzy Inference System (ANFIS). Finally, a novel hybrid model is proposed for forecasting corrosion current density, based on the combination of PSO–DA and ANFIS.

A real test case dataset including 2460 cases collected from 37 regions in the Persian Gulf is considered to investigate the effectiveness of the proposed hybrid modeling technique. The input parameters of the model included the concrete characteristics and the operational condition of the structure. Different meta-heuristic optimization techniques (i.e., GA, PSO, HPSO, IGA, PSO–DA, BBO, and Bat) are used to predict C11 as a criterion of corrosion rate. From the experiment results, the following conclusions are drawn.

  1. 1.

    The convergence curve and trajectory for 10 Unimodal and Multimodal problems proved the PSO–DA can converge to the global optimal and escape from local optimal;

  2. 2.

    The proposed method (PSO–DA) can improve the performance of ANFIS and find the optimal values of adaptive modeling parameters;

  3. 3.

    The optimal ANFIS topology based on PSO–DA can provide a balance between the exploitation/exploration abilities through modeling and optimizing;

  4. 4.

    Comparison of the prediction results of the employed models shows that the predictive performance of the proposed model is high and the error level is low (R2 = 0.92, MAPE = 1.67, MEDAE = 0.14, EF = 0.97);

  5. 5.

    The proposed model fulfills the goal of the present study to predict the corrosion current density of reinforced concrete structures in the coastal areas by reducing the error of generalization and optimizing the performance of the ANFIS model;

  6. 6.

    The efficient rules are constructed by various combinations of linguistic terms for the prediction of corrosion current density;

  7. 7.

    A useful model was proposed by as new approach to the current international corrosion management models;

  8. 8.

    Scientific and industrial aspects were considerd to predict the corrosion current density in marine concrete structures;

  9. 9.

    The sensitivity analysis revealed that C5, C10, C8, C7, C6, C2, C9, C3, C1, and C4 are the effective factors on the C11 in reinforced concrete structures. Knowing the extent and order of effectiveness of the parameters, played an important role in improving the performance of such structures.

  10. 10.

    Proposed rules can be used as criteria for risk based inspection, enhanced the documentation and data management, and considered as a new approach to expressing the importance of recording quality with industrial data and documentation.

For future work, parallel computing will be applied in the ANFIS model to reduce the computational time. Also, we would like to consider the impact of other meta-heuristics algorithms in ANFIS training. Further research to be undertaken to use a complex machine learning approach (e.g., deep learning) for corrosion rate prediction.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Download references

Acknowledgements

Not applicable.

Funding

This project was supported by Niroo Research institute (NRI), info@nri.ac.ir.

Author information

Authors and Affiliations

Authors

Contributions

GRK designed the model and the computational framework and analyzed the data. ZR performed literature survey regarding RC structure. GKH and ZR wrote the manuscript. ME supervised the work. HB performed the measurements. All authors read and approved the final manuscript.

Authors’ information

Gholam Reza Khayati is an Associate Professor in the School of Materials Science and Engineering at Shahid Bahonar University. Zahra Rajabi is a Ph.D. student in the school of Materials Science and Engineering at Shahid Bahonar University. Maryam Ehteshamzadeh is a Full Professor in the School of Materials Science and Engineering at Shahid Bahonar University. Hadi Beirami is researcher in Metalnastri Anticorrosion Systems, Via Padova, 3, 20063, Cernusco sul Naviglio, Milano, Italy.

Corresponding author

Correspondence to Zahra Rajabi.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Journal information: ISSN 1976-0485 / eISSN 2234-1315

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khayati, G.R., Rajabi, Z., Ehteshamzadeh, M. et al. A Hybrid Particle Swarm Optimization with Dragonfly for Adaptive ANFIS to Model the Corrosion Rate in Concrete Structures. Int J Concr Struct Mater 16, 28 (2022). https://doi.org/10.1186/s40069-022-00517-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40069-022-00517-9

Keywords