JAIST - Japan Advanced Institute of Science and Technology

03/27/2026 | Press release | Distributed by Public on 03/26/2026 19:02

Generative AI-Powered Forecasting for Sustainable Urban Development

A memory-aware multi-conditional generative model for predicting future urban layouts

Researchers introduce a novel generative AI-driven framework, MMCN (Memory-aware Multi-Conditional generation Network), for forecasting future urban layouts by jointly considering building density, building height, transportation networks, and historical development patterns. Leveraging a generative architecture-enhanced diffusion model with multi-conditional control, semantic prompt fusion, and spatial memory embedding, MMCN offers a novel approach to modeling complex urban evolution. This framework provides a powerful tool to explore sustainable urban development, demonstrating AI's transformative potential in urban design.

Environmental sustainability in urbanization has become a critical global concern as cities expand at unprecedented rates. Urban design faces the challenge of making long-term decisions about infrastructure, building development, transportation networks, and land use, all of which shape the future structure and sustainability of cities. These decisions are inherently complex, as urban growth emerges from the interaction of multiple factors, including building density, building height, road networks, and historical development patterns, which evolve together over time. Traditional urban design methods often struggle to capture these interconnected dynamics, making accurate forecasting of urban development impossible.

In response to this challenge, artificial intelligence (AI) has emerged as a promising tool for modeling complex spatial patterns and supporting data-driven urban planning. Yet, many existing generative AI-based models produce fragmented predictions because they may have difficulty in effectively integrating multiple urban development factors or maintaining spatial continuity across large areas.

To address these limitations, researchers at the Japan Advanced Institute of Science and Technology (JAIST) and Waseda University, Japan, developed a novel AI-driven framework called the Memory-aware Multi-Conditional generation Network (MMCN). The research team was led by Associate Professor Haoran Xie (JAIST and Waseda University) and included Doctoral Student Xusheng Du from JAIST and Professor Zhen Xu from Tianjin University, China, among others. Their study was published online on March 2, 2026, and will be published in Volume 141 of the top journal in urban design, Sustainable Cities and Society, on May 1, 2026.
Explaining the motivation behind the study, Dr. Xie said, "We aimed to bridge the gap between current AI capabilities and the practical needs of urban planners by developing a predictive model capable of forecasting future urban layouts while simultaneously considering multiple urban development factors and historical evolution patterns, as inspired by the actual decision-making workflow from professional planners."

The MMCN model relies on multi-temporal spatial data, including building layouts, building density, building height, and transportation networks, which were standardized into 512 × 512-pixel patches for model training. Especially, this model adopted the urban layout data of Shenzhen due to it being the most rapidly developing city in China. The network architecture combines a diffusion model with a multi-conditional control mechanism, allowing diverse urban factors to guide the generation process. A semantic prompt fusion module encodes information from each input type, while a spatial memory embedding component preserves contextual information from neighboring regions, ensuring continuity across patches. Multiple conditional generation branches integrated with the diffusion model form the core generative model, enabling the production of realistic, coherent urban layouts that remain consistent with historical patterns. Data training uses denoising and edge-stitching loss functions to enhance reconstruction accuracy and smooth transitions across patch boundaries. This approach allows MMCN to model complex interactions among urban variables and generate spatially consistent forecasts of urban development.

Experimental results demonstrated the framework's effectiveness. MMCN outperformed baseline methods such as Pix2Pix, CycleGAN, and Instruct-Pix2Pix, achieving a Structural Similarity Index (SSIM) of 0.885 and a Boundary Intersection over Union (IoU) of 0.642, indicating strong structural fidelity and spatial continuity. Qualitative analysis further confirmed that MMCN generates realistic, coherent urban layouts with continuous road networks and well-organized building clusters, whereas baseline models often produce fragmented roads, duplicated structures, or disconnected patterns. These findings highlight the importance of combining multi-factor conditioning, spatial memory mechanisms, and learning from historical patterns within a unified generative framework. Additional cross-city experiments using data from Shanghai and Tianjin in China further demonstrated the model's ability to produce stable and consistent urban layout predictions under diverse spatial conditions.

Beyond technical performance, MMCN offers practical benefits for urban design. By simulating potential growth scenarios, the framework allows planners to evaluate the long-term consequences of development strategies, supporting more informed and sustainable decisions. This aligns with the Sustainable Development Goals, particularly those focused on creating resilient and inclusive cities.

Looking ahead, the researchers envision several enhancements. Integrating climate models could enable assessment of environmental impacts, while including socio-economic data, could support more comprehensive forecasts. "Interactive planning tools built on MMCN could facilitate community and stakeholder engagement in urban design, promoting collaborative planning," said Dr. Xie. He added, "Expanding the dataset to include cities with diverse morphologies would improve the model's generalizability, making it applicable across different urban contexts worldwide."

In conclusion, MMCN represents a significant advancement in AI-assisted urban design, offering a novel approach to forecasting urban layout evolution by integrating multiple spatial factors and historical patterns. By producing accurate, spatially coherent predictions, it provides a powerful tool for guiding cities toward more resilient, livable, and sustainable futures in an increasingly urbanized world.

Image:

Image title: MMCN Framework for AI-Driven Urban Layout Forecasting
Image caption: Overview of the Memory-aware Multi-Conditional generation Network (MMCN) framework for forecasting future urban layouts. The system integrates multiple modules, including a spatial memory module that captures contextual information from neighboring regions, a multi-prompt fusion module that combines urban condition inputs such as building density, building height, and road networks, and a multi-conditional control module that guides a diffusion-based generative model. Together, these components enable the model to generate spatially coherent urban layout predictions while maintaining continuity across adjacent areas.
Image credit: Associate Professor Haoran Xie from JAIST
Image source link: N/A
License type: Original content
Usage restrictions: Credit must be given to the creator.

Image:

Image title: Generative AI-Based Prediction of Future Urban Layout Patches with MMCN
Image caption: Illustration of how MMCN predicts future urban layout patches. The model receives multiple inputs, including neighboring urban patches, historical building layouts, density maps, height maps, and road networks, which are used as prompts and control conditions for a diffusion-based model. Using these inputs, the model can generate a layout for a target region and stitches the generated patch together with surrounding areas to produce a spatially consistent urban map.
Image credit: Associate Professor Haoran Xie from JAIST
Image source link: N/A
License type: Original content
Usage restrictions: Credit must be given to the creator.

Reference

Title of original paper: AI-driven urban evolution forecasting: A unified memory-aware
multi-conditional generation framework for sustainable development planning
Authors: Xusheng Du, Chengyuan Li, Qingpeng Li, Yuxin Lu, Yimeng Xu, Ye Zhang,
Zhen Xu, and Haoran Xie
Journal: Sustainable Cities and Society
DOI: 10.1016/j.scs.2026.107272

Additional information for EurekAlert

Latest Article Publication Date: 01 May 2026
Method of Research: Computational simulation/modeling
Subject of Research: Not Applicable
Conflicts of Interest Statement: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper

Funding information

This work was supported by JST SPRING, Japan Grant Number JPMJSP2102, JST BOOST Program Japan Grant Number JPMJBY24D6, the National Key R&D Program of China under Grant 2024YFC3808104-01, and the National Natural Science Foundation of China under Grant 52508023.

March 24, 2026

JAIST - Japan Advanced Institute of Science and Technology published this content on March 27, 2026, and is solely responsible for the information contained herein. Distributed via Public Technologies (PUBT), unedited and unaltered, on March 27, 2026 at 01:02 UTC. If you believe the information included in the content is inaccurate or outdated and requires editing or removal, please contact us at [email protected]