Leveraging Urban Growth Models (UGM) for Sustainable Urban Planning and Climate Resilient Cities: A Bibliometric Analysis

Speedy technological, social, political, demographic, and urban developments continued to change the scale of the cities worldwide. Therefore, it becomes essential to model urban growth to minimize future uncertainties. Therefore, it is critical to investigate how cities grow and how researchers investigate which factors influence city growth. Using bibliometric methods, this study intends to pinpoint the publishing patterns and expansion potential of Urban Growth Modelling (UGM) works, providing a better understanding and possible future research paths. All published articles on the 'Urban Growth model' from Scopus were identified and analyzed using the Bibliometric R-package and VOSviewer software. 218 publications were identified from 1983 to 2023, published in 103 journals, and 25 book chapters contributed by 557 authors, with a 2.7 collaboration Index and 2.56 authors per document. The high-frequency keywords used in recent years are urban growth, land use, remote sensing, urban planning, cellular automaton, urban development, modeling, urbanization, Geographic Information System (GIS), and land use change. Research papers published in Computers, Environment, and Urban Systems are the most cited, with 1047 total citations, h -index 13. The most active country was China, with a total of 38 documents. The most cited paper for UGM research is titled 'Modelling urban growth in Atlanta using logistic regression', which received over 378 citations. The study's findings offer milestones, a starting point for important research productivity data, and an understanding of how UGM research has evolved. This will help to estimate and assess the rate of urbanization, its location, and the consequences of before and after development before it gets stranded in unsuitable and unsustainable pathways.


INTRODUCTION
As per the World Urbanization Prospects (WUP) Report 2018, 41% of the population of lower-middle-income countries lived in cities, expected to rise to 59% by 2050.Although the pace of urbanization in lower-middle-income countries is slower than in developed countries, these countries are expected to have the fastest urban growth between now and 2050.This trend will undoubtedly continue in the future. [1]Speedy technological, social, political, demographic, and urban developments contributed to the changes in the scale of the major cities worldwide.Many have triggered these developments, i.e., the industrial revolution, capital gains, and population growth, which have played an essential part in cities' urbanization. [2]This unprecedented urbanization is due to the current most significant demographic transition of this century from rural to urban areas and the growth of the peri-urban regions because of economic opportunities and the choice of people to live in nearby urban areas. [3,4]The inevitable outcome of this transition is the spatial extension of towns and cities beyond their city limits and into their peripheries and hinterland to provide settlement to the growing population (Mohammadian, Tavakoli, and Khani, 2017).This rapid urban expansion of urban areas has placed high pressure on urban land resources.[8][9][10][11][12][13] This massive and complex city system involves many interactions between diverse agents and elements. [14]Therefore, the researchers believe that modeling these factors is critical to better understanding urban expansion's spatial pattern, mechanism, and effects, which will help frame effective land-use policies. [6,15]Urban growth pattern analysis helps to identify the potential urban growth drivers that lead to the spatial expansion of areas beyond their judicial limits.Urban growth factors vary widely among the empirical studies. [6,16]herefore, the urban system requires integrated tools that help to guide and forecast urban growth due to its complex and dynamic nature. [17][20][21] While modeling some of these models shows the inability to incorporate socioeconomic variables, not indicate multicollinearity among independent factors, and use only relative weights, enhancing the model's efficacy, scholars integrated two or more models to generate reliable urban growth models and growth patterns.Furthermore, some researchers pointed out that bio-inspired algorithms can simulate and predict complex urban growth processes. [22,23]The Cellular Automata (CA) model has been considered the first analytical model as one of the bio-inspired models in urban growth prediction and is used widely in many studies that depend on the evaluation of complex urban development.However, CA models can simulate nominal data, not quantitative driving forces of urban expansion.Suppose it is combined with other spatial-quantitative models like the Markov Chain (MC).In that case, the Multi-Criteria Evaluation technique (MCE), Logistic Regression (LR) Land Transformation Model (LTM), and Agent-Based Modeling (ABM) can make more accurate predictions and modeling can be done.Generally, the MCE techniques have primarily been used to show the ranking order between different spatial factors.However, the value of each spatial element is calculated based on the experience of decision-makers. [24]As a result, the proposed land use pattern based on various combinations of weights will be highly disparate because the dynamic characteristics of spatial factors often fail to be observed.Therefore, the Weight of Evidence (WOE) method has been worked out to overcome the disadvantage of the MCE method because it prominently depends on the nature of data rather than on the subjective decision of individuals. [25]e ABM can model the actions or decisions of individual agents and model processes based on influencing criteria closely related to a particular agent.Therefore, new factors will be required to load into the system to simulate the environment every time.Also, it works based on the assumption that the agent behavior will remain the same in the prediction year.Hence, it limits the simulation even if the condition changes due to interaction with other social, economic, political, and environmental factors. [26]rthermore, over the last 16 years, the SLEUTH (Slope, Land use, Exclusion, Urban, Transportation, and Hill shade), a growth-oriented model, has been successfully applied worldwide to forecast Land Use Land Cover Change (LULC) transition.At the same time, this model does not provide information regarding human behavior and ecological impacts.Also, the model's drawback is that the simulation operation is constrained by particular variables that cannot be altered or developed. [27]Artificial Neural Networks (ANN), Decision Trees (DT), and Support Vector Machines (SVM) are all examples of Machine Learning (ML) models that are widely used in land-use applications.It can learn and fit the complex nature of components by iteration of training data and target data when the problem comprises input and output data in bulk amounts. [14,28,29]ML models are often combined with CA, LR, and FR models to increase the simulation accuracy.Simulating or modeling urban land expansion cannot be viewed as a linear problem because it involves complex interactions.Hence, various heuristic bio-inspired algorithms, such as the Genetic Algorithm (GA), [30,31] Simulated Annealing (SA), [32] Particle Swarm Optimization (PSO), [33][34][35] Ant Colony Optimization model (ACO), [36,37] Artificial Fish Swarm Algorithm (AFSA) [38] and Memetic Algorithm (MA), [39] bat-inspired algorithm, [40] Artificial Bee Colony (ABC) and shuffled leaf algorithm has been designed to achieve the optimal pattern of urban land use based on particular objectives and within required constraints.Looking at the facts of multivariable problems and multiple modelling options, it is necessary to showcase the researcher's trend in the specific area in a classified way so that the researcher will make the mature choice in selecting the appropriate growth model for their studies.With the use of bibliometric methods, it is possible to showcase the research trend.
The study concentrated on presenting the findings on a single platform in light of the extensive research on urban growth modeling conducted worldwide.The main objective of this research is to provide a thorough analysis of the urban growth modeling techniques that have been indexed in the Scopus database from 1983 to 2023 in order to estimate and assess the rate of publication on urban growth modeling and to identify future research directions and techniques for planned and controlled urbanization, its location impact, and the effects of before and after development before it becomes stuck on unsuitable and unsustainable paths.This paper includes a bibliometric analysis of the urban growth modeling topic to explore the study of various urban growth models used over a forty year of span and to characterize the most cited documents, countrywide publications, most cited journal source, keyword occurrence analysis, conceptual structure map analysis of keywords, scientific collaboration of authors, thematic analysis of keywords used in publication in four sub-periods, collaboration analysis between countries.The main contribution of this study is to provide insights into the publication of the latest urban growth models with their pros and cons based on various machine learning techniques indexed in the Scopus database from 1983-2023.

DATA AND METHODS
Numerous methods exist for a literature review, including systematic literature review, content analysis, meta-analysis, and bibliometric analysis.To understand the impact of an article, bibliometric analysis includes statistical analysis of published Sheladiya and Patel: UGM for Sustainable Urban Planning and Climate Resilient Cities articles and quantification of article citations.The primary requirement for good, high-quality literature is bibliometric analysis.It is a highly effective method for an in-depth literature review and study of academic publications that employ mathematical and statistical techniques.Bibliometric analysis is a numerical examination of various types of literature on a specific topic, excluding all aspects from which one can obtain information, such as leading authors, countries, journals, funding organizations, publishers, publisher city, web of science index, and research area.

Data Source and Research Platform
Scopus database was chosen as it uniquely combined expertly curated abstract and citation databases with enriched data.It also linked the scholarly literature from various disciplines.The search term for literature was "urban growth modeling", which included many different search paths.Initially, 301 results were discovered.After applying the English language and subject areas filter related to social science, environmental science, earth and planetary science, computer science, engineering, multidisciplinary, and decision science, 218 papers were found pertinent to the above subject areas.Then, this data was converted into different raw data forms, which act as an input in bibliometric analysis.Figure 1 shows the methodology adopted for the study.
The bibliometric analysis was carried out with the help of Biblioshiny, an online web-based data analysis framework written in R language.With raw data from Scopus or Web of Science, this platform can be used for a variety of bibliometric analyses, such as statistical analysis, data preprocessing, co-occurrence matrix construction, co-citation analysis, coupling analysis, co-word analysis, thematic map evaluation, factorial analysis, historiography, collaboration network, collaboration world map, and cluster analysis.
The bibliometric analysis was done using the R language-based software package Biblioshiny, an online web-based data analysis framework.This platform, with data raw data obtained from Scopus or Web of Science, can be used for many bibliometric processes like statistical analysis, data preprocessing, co-occurrence matrix construction, co-citation analysis, coupling analysis, co-word analysis, and cluster analysis.

Characteristics of the Meta Data
The retrieved documents were published between 1983 and 2023 in 128 journals by 557 authors, with a Collaboration Index of 2.70 per document.According to the analysis, the total citation score for all documents was 6,078.The majority of the documents, 156 (71.56%), were articles, followed by conference papers 41 (18.80%) and book chapters 17 (7.79%),as shown in (Table 1).Figure 2 depicts the annual trends in publication and citation times over the study period.

Most Cited Documents
The top 10 cited documents are presented in Table 2.The top most cited paper for UGM research is 'Modeling urban growth in Atlanta using logistic regression', which received over 378 citations.From this, it was determined that despite the lack of temporal dynamics, logistic regression was spatially explicit, suited for multi-scale research, and, most crucially, allowed for a far better comprehension of the processes influencing the development and growth of the urban spatial pattern, [42] followed by 'Urban Growth Pattern Modeling: A Case Study of Wuhan City, PR China with 358 citations. [43]The later study proposed a Modelling of urban growth patterns. [44]

Output analysis of top 10 countries wise publication
A total of 45 countries contributed to 218 papers.China tops the list with around 38 publications, followed by The United States of America (USA) with 32 papers and India with 18 papers.These three countries are the most productive countries for research in the field of UGM.Additionally, our results showed that China is ranked first regarding the number of documents and citations (Table 3).

Output analysis of the top 10 most cited Journal source
The 218 documents on UGM were published in 103 Journals, and the top 10 most published journals are listed in Table 4. Research papers on urban growth modeling published in Computers, Environment, and Urban Systems was the most significant number of publications 15, followed by ISPRS International

Keyword Occurrence Analysis
Keyword occurrence analysis identifies the keywords used in the titles of published research papers by authors working in urban growth modeling worldwide.Figure 3 depicts the result of the keywords plus analysis, which unveils the most used keywords in UGM literature, which can enable the identification of research themes and topics that have been heavily studied by researchers and documented in the Scopus database during the past 40 years.
Based on Keyword occurrence analysis, it is inferred that the researchers had used many keywords most frequently, like "urban growth" 182 times, "land use":55 times, "remote sensing" 44 times, "urban planning" 43 times, "cellular automaton":41 times, "urban development": 39 times, "modeling": 38 times, "urbanization": 36 times, "GIS":29 times, and "landuse change":29 times, Further, in the current time the keyword urban growth occurs mainly with the Cellular automation, GIS and Urban Planning.This demonstrates the researchers' interest in utilizing cutting-edge methods and technologies to simulate urban growth for improved land use and urban planning.Even so, there is little interest in using artificial intelligence and machine learning; potentially, this will be the focus of future studies.

Conceptual structure map analysis
Figure 4 shows the topic dendrogram based on factorial analysis of Keyword plus to represent the conceptual structure of the current literature on UGM for capturing an article content with greater depth of understanding the scientific concepts in UGM research over the past 40 years.The analysis of UGM mapping and the conceptual structure between reported keywords were visualized by using three methods: Correspondence Analysis (CA), Multiple Correspondence Analysis (MCA), and Multidimensional Scaling (MDS), as seen in (Figure 4 A, B, and  C).The CA analysis shows the red cluster has the most keywords, i.e., growth modeling, urban planning, urban growth, sustainable development, satellite imagery, remote sensing, land use, urban growth modeling, cellular automata, geographic information system, spatiotemporal analysis, numerical model, urbanization, land cover, regression analysis, urban development, urban area  and land use change which means the attention of the researchers to the UGM subject theme of the study.MCA analysis also shows that the red cluster has the most keywords.In contrast, MDS shows the blue cluster has the most fundamental words, i.e., growth modeling, land use change, numerical analysis land cover, spatiotemporal analysis, cellular automaton, regression analysis, urban development, urbanization, urban growth, and urban planning.

Co-authorship analysis between countries
Co-authorship analysis shows the contribution of authors from different countries in published research articles, which helps identify the research interest on the same theme by authors from different countries.The VOSviewer is used to visualize the social networks between co-authors and countries based on the Links (L) and Total Link Strength (TLS) having a link threshold of 4 occurrences.Among the initial 45 countries, only 14 met the thresholds and were grouped into 4 clusters with Links (L=60) and Total Link Strength (TLS=124).The United States was reported with (L=8; TLS=28), followed by China (L=7; TLS=24), Iran (L=6; TLS=11) times, Australia (L=5; TLS=10), United Kingdom (L=6; TLS=7) times, and Netherland (L=5; TLS=7), among others (Figure 5).Based on the Co-authorship analysis, most work is only carried out in developed nations.Still, developing and underdeveloped nations are not part of the network, so it is necessary to network with this kind of nation to create better sustainable urban planning by understanding the best-case scenario of each country and their experience with urban growth.

Thematic analysis
Figure 6 of thematic maps portrays prevailing and emerging themes on UGM-based keyword analysis concerning the thematic evolution area have identified two thematic areas over four sub-periods of time slices (1983-2008; 2009-2014;  2015-2017; 2018-2020, and 2021-2023) as presented in Figure 6 A. Moreover, thirty-three thematic areas were identified over these four sub-periods based on the article title by using thematic evolution of reported topics that caught the reader's attention and identified the extent to which topics are related (Figure 6B).

Collaboration analysis between countries on UGM
R software was used to uncover new knowledge and determine the collaboration between researchers within the top 50 countries.The analysis of collaborations between countries shows that (the USA, China, and Germany) are located in clusters four, one, and five, respectively, with betweenness between countries as (235.48,137.63, and 108.76) as can be seen in Table 5.

RESULTS AND DISCUSSION
Bibliometrics has had a tremendous impact in recent years, whether in governing, legislating, or seeking to comprehend particular scientific subjects better.To conduct the study, the data was retrieved from Scopus.This database provides different h-index ratings for journals needed to track citations and determine the impact of publications. [45]A total of 218 documents have been selected from the Scopus database.The retrieved documents were published from 1983-2023.The study focused on the comprehensive analysis of the global publication output and outlined possible future directions the researchers just venturing into the field of UGM by providing sufficient information on the growth and development of the literature, information on active authors, journals, countries, institutions, as well as complete keyword analysis for terms most frequently used in UGM research.
The findings showed that research surrounding UGM steadily increased and reached its highest peak between 2020 and 2022.
The retrieved publications on UGM received many citations, with an average of 27.88 per document, indicating many readers and scholars.The most reported published documents in UGM literature were published as research articles 71.56%, followed by full conference papers 18.80% and book chapters 7.79%.These findings show that researchers in the UGM field prefer publishing their work as research papers rather than other types of documents.
The research papers in 'Computers, Environment, and Urban Systems' have been very active in publishing documents on UGM over the past years.The published documents had a relatively high reported scientific contribution and ranking, with around 15 documents receiving 1047 citations.Thus, based on the analysis and evidence reported, researchers seem particularly interested in publishing their research as research papers in these domains.research is 'Modeling urban growth in Atlanta using logistic regression', which received over 378 citations.
The keyword occurrence analysis mainly focused on 'urban growth', 'land use', 'remote sensing', 'urban planning", 'cellular automaton', 'urban development', 'modeling', 'urbanization', 'GIS', and 'landuse change' among others.These top 10 keywords can be used to identify future research hotspots for UGM.Conceptual structure map analysis using three models (CA, MCA, and MDS) shows the distribution of the topic.It means the researchers' attention to the study's UGM subject theme.
In addition, urban growth and urban development decision-making research has received more attention during the recent time slice about the thematic evolution analysis.Modeling urban growth in Atlanta using Logistic regression article published by Hu Z et al., 2007, [42] and Urban Growth Pattern Modeling: a case study of Wuhan City, PR China, published by Cheng J et al.  2003 attracted the interest of most scientists, and they had the highest citations. [43]These articles originally introduced UGM to the international urban planning community.
Based on the Total Link Strength (TLS), the United States, China, Iran, and Australia showed high occurrence in UGM research.Despite its many advantages, our study has limitations, i.e., it is only based on the Scopus database.In future bibliometric analysis, researchers may consider using more diverse data sources.Future bibliometric studies might consider using other databases, such as Web of Science and Google Scholar, to provide a more comprehensive overview of research productivity in UGM.

IMPLICATIONS
The present paper highlights the trends in Urban Growth modeling based on the researcher's thoughts on the research direction, tools, techniques, and networking countries.From the study, it is found that urban growth', 'land use', 'remote sensing', 'urban planning", 'cellular automaton', 'urban development', 'modeling', 'urbanization', 'GIS', and 'landuse change are the most frequent word used by the researchers.However, very few have used artificial intelligence and machine learning; potentially, this will be the focus of future studies.
Even though the models are already available based on bottom-up and top-down approaches, problems remain as they found in literature studies.Most studies found the individual influence of growth drivers rather than the combined influence of different growth drivers and their impact on human development.Because, in a massive and complex environment, spatial interactions among growth drivers are required to explore, which is not well explored in available studies.
The spatial structure of an urban system is primarily the result of the spatial interaction of driving forces at local, regional, national, and global levels. [15]However, previous empirical studies have limitations, like driving forces considered for urban expansion are only at a city-scale level without considering the regional aspect.The city's growth is also influenced by nearby cities, i.e., regional level effect.There is also spatial autocorrelation among independent growth factors.Still, only a few researchers have used the geographically weighted regression method to incorporate that effect.Most urban simulation studies use logistic or binary logistic regression models, which cannot represent a non-linear relationship among independent growth drivers. [6]st recent studies on urban growth simulation are limited to six to seven driving growth factors of urbanization.Also, models used for urban growth simulation are processed by considering the probability of only the dynamic nature of one growth factor.However, there is complicated interaction and competition among the urban components in a real urban world.Thus, the occurrence of changes in one driving factor also makes changes in other related growth factors.Hence, capturing this complex and dynamic effect of the real urban world through simulation becomes inevitable by coupling human and natural impact with other factors. [46]Furthermore, very few studies of urban growth simulation have considered the potential of the socioeconomic variables and climatic variables.Because of long-term land-use dynamics, future climate changes will considerably impact the city's growth, and hence, it is necessary to study the impact of extreme weather events.
Traditional models, such as the CA models, are constrained to incorporate quantitative driving forces.Therefore, combining CA with other quantitative models, such as MC, MCE, LR, LTM, and ABM, is essential.These analytical techniques are unable to model more complex data.Therefore, ML models may be used to simulate potential land-use change.ML models can be combined with CA, LR, and FR models to increase the simulation accuracy. [14,28,29]The land parcel can be considered a small child because its activities are mainly based on nearby nature, and finally, it will mimic some of that.Further researchers can use bio-inspired algorithms to solve complex, high-dimensional, multi-modal problems.
In the end, calibration and validation of the model are essential, and it is found that the scholars have used mainly 12 types of calibration and validation techniques for study areas ranging from 88.5 sq.km to 9.6 million sq.km for Urmia, Iran [49] and mainland China [46] respectively.It's also found that most researchers have calibrated 10 to 20 years of spatial and non-spatial data and forecasted for 10 to 40 years having a resolution of 30 m by using neighborhood window sizes varying from 3x3 m [10] to 11x11 m. [50] Delicate resolution data and window size 5x5 have given satisfactory results, [12,51]
Sheladiya and Patel: UGM for Sustainable Urban Planning and Climate Resilient Cities Journal of Geo-Information with nine papers, and International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences -ISPRS Archives with six papers.

Figure 2 :
Figure 2: Annual Trend of Publication on UGM from 1983 to 2023.

Figure 3 :
Figure 3: Word clouds illustrating high-frequency words in the UGM.
so the research can use such calibration and validation techniques based on the budget, resources, and type of assignments.Urban growth simulation studies reveal mainly nineteen types of scenarios that researchers may consider for their studies like sustainable development, conservation priorities, perspective of carrying capacity, business as usual, Coalescence, Diffusion, Protection of Cropland, Impact of climate change, Open space conservation, Unconstrained Development Scenario, Constrained Development Scenario, without zoning, Zone based on the type of land use, zone based on administrative division, Zone based on development planning subdivision, zone based on urbanized suitability, Intensified Residential development, encouragement of Industrial and commercial development, Optimum allocation of land use and perspective of land acquisition based on bargaining between farmers and governments.Over time, researchers modeled urban growth using analytical models, machine learning algorithms, and bio-inspired algorithms.Now, the time has come to adopt artificial intelligence to model urban growth for a sustainable future.

Figure 4 CFigure 4 :
Figure 4 C Figure 4: Conceptual structure map of UGM themes using techniques of Correspondence Analysis (A), multiple Correspondence Analysis (B), and Multidimensional Scaling (C) method for (n=25 Keyword).

Figure 6 :
Figure 6: Representative thematic evolution diagram based on (A) keyword plus and (B) Article Title.

Figure 6
Figure 6 A

Table 1 : Statistics of UGM Publications from 1983-2023.
Sheladiya and Patel: UGM for Sustainable Urban Planning and Climate Resilient Cities

Table 2 : Top 10 Most Cited publication on UGM.
They subsequently refer to these notes for reference in other publications.China tops the list with around 38 publications, followed by The United States of America (USA) with 32 papers and India with 18 papers.The top most cited paper for UGM TC: Total Citations; MCP: Multiple Country Publication; SCP: Single country publications.