K Kai

Xukai Zhao

Tsinghua University · PhD Student

I explore the future of Urban Studies in the age of Artificial Intelligence (AI). Currently a Ph.D. candidate at Tsinghua University, I develop computational AI frameworks to tackle complex urban challenges that traditional methods struggle to address.

My research leverages Natural Language Processing (NLP) and Computer Vision (CV) to process multi-modal big data, transforming unstructured noise into measurable, explainable insights. I am dedicated to integrating advanced AI techniques into the core of urban research.

Xukai Zhao
Scroll down for more

Skills

Technical

NLP Agentic pipeline CV RAG Knowledge graph Multimodal learning Urban big data

Tools

Python ArcGIS Pro Adobe Photoshop Adobe Illustrator AutoCAD Rhino SketchUp

IELTS

Overall Score 7.5
Listening 8.5
Reading 8.5
Writing 7.0
Speaking 6.5

Publications

Figure placeholder for Perception and Visitation: Multi Scale Measurement and Driving Mechanisms of Tourism Attractiveness across 143 Cities
Feb 2026 · Submitted to Nature Cities
Multi-agent

From Perception to Visitation Multi-Scale Measurement and Driving Mechanisms of Tourism Attractiveness across 143 Chinese Cities

Xukai Zhao, He Huang, Nan Bai, et al.

Multi-agentTourism attractivenessUser-generated contentLarge language model
Abstract

Tourism is a driver of urban vitality and sustainable development. As tourism attractiveness emerges from interactions among tourists, attractions, and supporting conditions, effective management requires understanding how attraction and supporting conditions influence tourists’ subjective evaluations (perceived attractiveness), and how these aspects together shape realized visitation (revealed attractiveness). However, existing studies rarely examine this full chain simultaneously across large-scale systems. Addressing this gap, we propose a tourist-centered framework to measure and interpret tourism attractiveness across 2,128 heritage attractions in 143 Chinese cities. Using 610,136 reviews, we quantify perceived attractiveness by aggregating eight cultural values extracted using a large language model pipeline (tolerance-1 accuracy=0.862), while proxying revealed attractiveness through review volume. We analyze both metrics at attraction, intra-city, and inter-city scales to investigate spatiotemporal patterns, and integrate attraction-, supporting-condition-, and city-level variables using XGBoost and SHAP to identify driving mechanisms. Results show that (1) both perceived and revealed attractiveness metrics exhibit a multi-core pattern clustered around Beijing, Shanghai, and Chengdu; (2) revealed attractiveness exhibits a distance-to-city-center decay and a polarized long-tail distribution; (3) major holidays increase visitation but diminish perceived attractiveness; (4) attraction-level attributes, especially official designation, visual quality, and ticket pricing, dominate both metrics but operate through different mechanisms; (5) positively perceived social, economic, scientific, historic and aesthetical values increase visitation, whereas political value is negatively associated; and (6) accessibility and spatial agglomeration contribute positively. Practically, multi-scale benchmarking and quadrant analyses identify underperforming attractions, while attraction-scale value profiles and mechanistic insights provide traceable evidence to support targeted strategies for sustainable tourism development.

Figure placeholder for A multi agent large language model workflow for analyzing perceived cultural values from social media
Oct 2025 · Submitted to Information Fusion
Agent and RAG

A multi agent large language model workflow for analyzing perceived cultural values from social media

Xukai Zhao, Yuxing Lu, He Huang, et al.

Multi-agentKnowledge GraphRAGTourism
Abstract

Heritage attractions are key urban assets that embody cultural values, shape local identity, and support economic development through tourism. Understanding the values the public perceives in these assets, and the reasons behind these perceptions from a bottom-up, public-centered perspective, is essential for evidence-based urban and heritage policy. Social media data (SMD) presents opportunities for such analysis, yet current methods struggle to scale to national coverage and to convert unstructured text into operational, decision-ready urban knowledge. To address these challenges, we introduce a multi-agent large language model workflow that identifies and quantifies eight cultural values from SMD and extracts supporting evidence to construct Cultural Heritage Perception Graphs (CHPGs), which structure and reveal the underlying reasons. Analyzing 284,030 online comments on 703 heritage attractions across 141 Chinese cities, we map how different cultural values are expressed, co-occur, and vary spatially across cities. The results highlight the prominence of aesthetical and social dimensions in public discourse, alongside more selective and context-dependent expressions of the other value dimensions. At the city level, overall perception scores correlate positively with comment volume, tourism revenue, and GDP per capita, revealing four distinct city profiles that can guide targeted management strategies. In practice, our CHPG-powered retrieval-augmented generation (RAG) system translates unstructured SMD into interactive, evidence-based insights, helping bridge the gap between public feedback and actionable decision making and supporting heritage management that is more responsive to public perception.

Figure placeholder for KARMA: Leveraging Multi Agent LLMs for Automated Knowledge Graph Enrichment
Sep 2025 · NeurIPS 2025 Spotlight
Multi-agent

KARMA: Leveraging Multi Agent LLMs for Automated Knowledge Graph Enrichment

Yuxing Lu, Wei Wu, Xukai Zhao, et al.

Multi-agentKG EnrichmentAutomation
Abstract

Maintaining comprehensive and up-to-date knowledge graphs (KGs) is critical for modern AI systems, but manual curation struggles to scale with the rapid growth of scientific literature. This paper presents KARMA, a novel framework employing multi-agent large language models (LLMs) to automate KG enrichment through structured analysis of unstructured text. Our approach employs nine collaborative agents, spanning entity discovery, relation extraction, schema alignment, and conflict resolution that iteratively parse documents, verify extracted knowledge, and integrate it into existing graph structures while adhering to domain-specific schema. Experiments on 1,200 PubMed articles from three different domains demonstrate the effectiveness of KARMA in knowledge graph enrichment, with the identification of up to 38,230 new entities while achieving 83.1% LLM-verified correctness and reducing conflict edges by 18.6% through multi-layer assessments.

Figure placeholder for Towards Doctor Like Reasoning: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients
Sep 2025 · NeurIPS 2025 Poster
Agent and RAG

Towards Doctor Like Reasoning: Medical RAG Fusing Knowledge with Patient Analogy through Textual Gradients

Yuxing Lu, Gecheng Fu, Wei Wu, Xukai Zhao, et al.

Medical RAGTextGradMulti Agent
Abstract

Existing medical RAG systems mainly leverage knowledge from medical knowledge bases, neglecting the crucial role of experiential knowledge derived from similar patient cases - a key component of human clinical reasoning. To bridge this gap, we propose DoctorRAG, a RAG framework that emulates doctor-like reasoning by integrating both explicit clinical knowledge and implicit case-based experience. DoctorRAG enhances retrieval precision by first allocating conceptual tags for queries and knowledge sources, together with a hybrid retrieval mechanism from both relevant knowledge and patient. In addition, a Med-TextGrad module using multi-agent textual gradients is integrated to ensure that the final output adheres to the retrieved knowledge and patient query. Comprehensive experiments on multilingual, multitask datasets demonstrate that DoctorRAG significantly outperforms strong baseline RAG models and gains improvements from iterative refinements. Our approach generates more accurate, relevant, and comprehensive responses, taking a step towards more doctor-like medical reasoning systems.

Figure placeholder for ClinicalRAG: Enhancing Clinical Decision Support through Heterogeneous Knowledge Retrieval
Aug 2024 · ACL 2024 Workshop
RAG

ClinicalRAG: Enhancing Clinical Decision Support through Heterogeneous Knowledge Retrieval

Yuxing Lu, Xukai Zhao, Jinzhuo Wang.

Clinical RAGHeterogeneous RetrievalMulti Agent
Abstract

Large Language Models (LLMs) have revolutionized text generation across diverse domains, showcasing an ability to mimic human-like text with remarkable accuracy. Yet, these models frequently encounter a significant hurdle: producing hallucinations, a flaw particularly detrimental in the healthcare domain where precision is crucial. In this paper, we introduce ClinicalRAG, a novel multi-agent pipeline to rectify this issue by incorporating heterogeneous medical knowledge—both structured and unstructured—into LLMs to bolster diagnosis accuracy. ClinicalRAG can extract related medical entities from user inputs and dynamically integrate relevant medical knowledge during the text generation process. Comparative analyses reveal that ClinicalRAG significantly outperforms knowledge-deficient methods, offering enhanced reliability in clinical decision support. This advancement marks a pivotal proof-of-concept step towards mitigating misinformation risks in healthcare applications of LLMs.

Figure placeholder for MetaBench: A Multi task Benchmark for Assessing LLMs in Metabolomics
Oct 2025 · arXiv:2510.14944
Benchmark

MetaBench: A Multi task Benchmark for Assessing LLMs in Metabolomics

Yuxing Lu, Xukai Zhao, J. Ben Tamo, et al.

BenchmarkMetabolomicsEvaluation
Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities on general text; however, their proficiency in specialized scientific domains that require deep, interconnected knowledge remains largely uncharacterized. Metabolomics presents unique challenges with its complex biochemical pathways, heterogeneous identifier systems, and fragmented databases. To systematically evaluate LLM capabilities in this domain, we introduce MetaBench, the first benchmark for metabolomics assessment. Curated from authoritative public resources, MetaBench evaluates five capabilities essential for metabolomics research: knowledge, understanding, grounding, reasoning, and research. Our evaluation of 25 open- and closed-source LLMs reveals distinct performance patterns across metabolomics tasks: while models perform well on text generation tasks, cross-database identifier grounding remains challenging even with retrieval augmentation. Model performance also decreases on long-tail metabolites with sparse annotations. With MetaBench, we provide essential infrastructure for developing and evaluating metabolomics AI systems, enabling systematic progress toward reliable computational tools for metabolomics research.

Figure placeholder for Urban planning in the age of large language models: Assessing OpenAI o1's performance and capabilities across 556 tasks
Jul 2025 · Computers, Environment and Urban Systems (IF=8.3)
Benchmark

Urban planning in the age of large language models: Assessing OpenAI o1's performance and capabilities across 556 tasks

Xukai Zhao, He Huang, Tao Yang, et al.

OpenAI o1Urban PlanningLarge Language ModelPerformance evaluation
Abstract

Integrating Large Language Models (LLMs) into urban planning presents significant opportunities to enhance efficiency and support data-driven city development strategies. Despite their potential, the specific capabilities of LLMs within the urban planning context remain underexplored, and the field lacks standardized benchmarks for systematic evaluation. This study presents the first comprehensive evaluation focused on OpenAI o1's performance and capabilities in urban planning, systematically benchmarking it against GPT-3.5 and GPT-4o using an original open-source benchmark comprising 556 tasks across five critical categories: urban planning documentation, examinations, routine data analysis, AI algorithm support, and thesis writing. Through rigorous testing and manual analysis of 170,627 words of generated output, OpenAI o1 consistently outperformed its counterparts, achieving an average performance score of 84.08 % compared to 69.30 % for GPT-4o and 45.27 % for GPT-3.5. Our findings highlight o1's strengths in domain knowledge mastery, basic operational competence, and coding capabilities, demonstrating its potential applications in information retrieval, urban data analytics, planning decision support, educational assistance, and LLM-based agent development. However, significant limitations were identified, including inability in urban design, susceptibility to fabricating information, moderate academic writing quality, challenges in high-level professional examinations, and spatial reasoning, and limited support for specialized or emerging AI algorithms. Future optimizations should prioritize enhancing multimodal integration, implementing robust validation mechanisms, adopting adaptive learning strategies, and enabling domain-specific fine-tuning to meet urban planners' specialized needs. These advancements would enable LLMs to better support the evolving demands of urban planning, allowing professionals to focus more on strategic decision-making and the creative aspects of the field.

Figure placeholder for Exploring Temporal Spatial Patterns and Nonlinear Driving Mechanism of Park Perceptions: A Multi Source Big Data Study
Mar 2025 · Sustainable Cities and Society (JCR Q1, IF=12.0)
NLP

Exploring Temporal Spatial Patterns and Nonlinear Driving Mechanism of Park Perceptions: A Multi Source Big Data Study

Xukai Zhao, He Huang, Guangsi Lin, et al.

RoBERTaSocial media dataSHAPUrban Parks
Abstract

To fully realize the benefits of parks, they must be both accessible and usable, with those excelling in these aspects often perceived as more attractive. Traditional surveys for evaluating perceived park accessibility, usability, and attractiveness are expensive and time-consuming, prompting the adoption of social media data as a viable alternative. This study fine-tuned the Chinese-RoBERTa-wwm-ext model on a specially curated dataset to measure perceived accessibility, usability, and attractiveness across 270 parks in Beijing and Guangzhou through 153,872 online comments. We conducted statistical analyses to uncover temporal patterns and incorporate park perception scores into the 2SFCA method for spatial distribution analysis. Additionally, we utilized XGBoost, SHAP, and PDP to investigate the nonlinear driving mechanisms behind these perceptions. Key findings include: (1) Park visitation demonstrates a strong seasonal pattern, with central urban parks consistently outperforming suburban ones; (2) Central subdistricts might face reduced park services due to high population demands; (3) Accessibility is significantly influenced by ticket pricing and transportation availability, especially bus stations; (4) Usability is optimal at a moderate density of sports and fitness facilities (22 per km2) and proximity to residential areas; (5) Attractiveness benefits from closeness to the Central Business District and amenities such as toilets and restaurants, with a critical park size threshold of 9 km2. These public-oriented analyses identify areas for improvement and factors shaping public perceptions, providing valuable guidance for strategic decision-making and effective urban management.

Figure placeholder for Assessing and Interpreting Perceived Park Accessibility, Usability and Attractiveness through Texts and Images from Social Media
Feb 2025 · Sustainable Cities and Society (JCR Q1, IF=12.0)
Multi-model

Assessing and Interpreting Perceived Park Accessibility, Usability and Attractiveness through Texts and Images from Social Media

Xukai Zhao, Yuxing Lu, Wenwen Huang, et al.

CLIPMultimodalSHAPSocial media data
Abstract

Understanding public perceptions of urban parks is essential for their effective management. While conventional survey methods are resource-intensive, Social Media Data (SMD) offers a cost-effective alternative to gathering public insights. However, using SMD for park perception assessment remains challenges, particularly in integrating text and image analysis and filtering irrelevant content to identify influencing factors across various dimensions. Based on a manually curated dataset, this study introduces the Park Dual-modal Perception (PDP) model, a cutting-edge approach combining SMD text and image analysis to evaluate perceived park accessibility, usability, and attractiveness with an average accuracy of 86.81 %, outperforming the commonly used BERT model by 8.26 %. Utilizing SMD from 130 parks in Guangzhou, the model effectively quantifies the three dimensions, generating visual scoring maps to identify parks with lower perceived scores at the urban scale. Further incorporation of SHapley Additive exPlanations (SHAP) within the PDP model filters 82.79 % of irrelevant words and extracts 158 thematic words and 954 associated words, providing targeted suggestions for park improvements. Our findings indicate that (1) factors such as distance, travel time, ticket prices, and proximity to commercial amenities critically influence park accessibility. (2) Park usability hinges on park's ability to serve diverse groups and provide well-maintained, multifunctional facilities. (3) Park attractiveness is closely linked with the cultural and regulatory characteristics of ecosystem services. Our methodology combines assessment and interpretation of public perceptions at both city and park scales. It aids decision-makers in identifying low-rated parks and understanding the underlying reasons, thereby facilitating more informed urban planning decisions.

Figure placeholder for How to Quantify Multidimensional Perception of Urban Parks?
Dec 2024 · Urban Forestry and Urban Greening (JCR Q1, IF=6.7)
NLP

How to quantify multidimensional perception of urban parks? Integrating deep learning-based social media data analysis with questionnaire survey methods

Wenwen Huang, Xukai Zhao, Guangsi Lin, et al.

PerceptionERNIEText ClassificationMultidimensional perception
Abstract

Urban parks are places where people regularly connect with nature and each other. Quantifying perceptions of urban parks presents significant changes. Recently, social media data has been increasingly used for studying landscape perceptions, preferences, and management. With the advent of deep learning techniques, the performance of NLP tasks has seen considerable improvement. We posed research questions at the methodological level: How could deep learning-based NLP methods be constructed to assess the multidimensional perception (MDP) of urban parks? How could the assessment performance of this method be validated? In this study, we constructed an MDP of urban parks assessment model based on ERNIE and subsequently conducted a questionnaire survey. By comparing the differences and similarities between the two data sets, we verified the model's assessment performance and proposed the application potential of deep learning-based methods. The findings indicated: (1) our model effectively obtained and assessed sentiment information from online reviews about park accessibility, safety, aesthetics, attractiveness, maintenance, and usability with an accuracy rate exceeding 80 %. (2) The questionnaire survey data confirmed the model's high efficacy, showing consistency in accessibility, aesthetics, and maintenance, but inconsistency in attractiveness and usability due to differences in data expression and timeliness. (3) Deep learning-based NLP methods significantly enhanced sentiment analysis of social media data, showing great potential for practical applications. The results could enhance the performance of sentiment analysis on social media data, serving as a decision-aid tool for park managers and policymakers, and providing valuable insights and guidance for park construction and management.

Figure placeholder for Research on the Perception Evaluation of Urban Green Spaces Using Panoramic Images and Deep Learning
Jul 2024 · Landscape Architecture Frontiers
CV

Research on the Perception Evaluation of Urban Green Spaces Using Panoramic Images and Deep Learning

Xukai Zhao, Guangsi Lin.

Panoramic imagesViTSemantic SegmentationImage classification
Abstract

Visual quality assessment of urban green spaces is a majortopic in landscape architecture research, yet traditional methodsface limitations in practice. The rapid development of artificialintelligence and street-view big data offers opportunities foradvancing green space perception studies. However, the lack offull street view image coverage of green spaces in China poseschallenges for related research. Focusing on public landscapeperception evaluation, this research took Zhuiiang Park inGuangzhou, China as a case study. The research team utilizeoa convenient image collection method by panoramic cameraand an effective processing workflow, and then employed the Segformer-B5 semantic segmentation model and the ViT-base-p16image classification model to calculate four objective evaluationmetrics (green view index, sky view factor, road visibility index, andartificial structure visibility index) and four subiective evaluationmetrics (attractiveness, richness, naturalness, and depression)for visual quality assessment. Based on the spatial distributionresults of these metrics,comprehensive analvses were conductedand low-score areas were identified,Research results indicatethat vegetation and water features significantly enhance parkattractiveness and positive perceptions, while excessive sky andartificial structures produce negative effects; oppressive artificial landscapes and constrained architectural views also lower overalllandscape quality. The image collection and visual perceptionevaluation methods proposed in this study provide a scientific basisfor the renovation and management of urban green spaces.

Figure placeholder for Multiscale Scoring Model for Enhanced Urban Perception Evaluation
Mar 2024 · ICASSP 2024 (CCF-B)
CV

Multiscale Scoring Model for Enhanced Urban Perception Evaluation

Xukai Zhao, Yuxing Lu, Jinzhuo Wang.

ViTPlace PulseImage classificationPerception evaluation
Abstract

Effective urban management, renewal, and development rely on identifying low-quality areas within the city. However, previous studies have been limited by low-volume handcraft surveys and a dearth of data sources, making it difficult to understand human perception within the urban environment. In this paper, we propose a powerful yet simple scoring model to perform street view image recognition and evaluation which utilizes both global information and feature-level semantic information of street elements, resulting in a high-precision perception model on 6 indexes (Beautiful, Lively, Safe, Wealthy, Boring, and Depressing) from Place Pulse 2.0 dataset. The model is then independently applied to a large-scale and fine-grained evaluation task of 4,384 street view images in Shameen Region, Guangzhou, providing valuable perception details and decision-making support for urban planning for the local government. We believe our work will accelerate the digitization and intelligent transformation of municipal engineering.

Figure placeholder for An integrated deep learning approach for assessing the visual qualities of built environments utilizing street view images
Dec 2023 · EAAI (JCR Q1, IF=8.0)
CV

An integrated deep learning approach for assessing the visual qualities of built environments utilizing street view images

Xukai Zhao, Yuxing Lu, Guangsi Lin.

ConvNeXtSegformer-B5Grad CAMStreet View
Abstract

Investigating residents' visual preferences and perception of built environments is crucial in visual landscape assessment (VLA). While traditional methods face challenges in large-scale applications, the advancement of deep learning techniques and the availability of street view images (SVIs) present new opportunities. However, existing approaches for assessing SVIs' visual qualities are of lower precision, and the link between objective visual elements and subjective perceptions of SVIs remains unclear. In this study, we propose a novel deep learning approach, “SegFormer-B5 + ConvNeXt-B + RF”, which achieves an average accuracy of 78.47% in predicting six subjective perceptions (beautiful, boring, depressing, lively, safe, and wealthy) within the Place Pulse 2.0 dataset. This provides an effective tool for assessing citizens' visual perceptions of urban environments. Subsequently, to demonstrate its practical application, we conducted a case study using 36,620 SVIs from the Tianhe District of Guangzhou. Perception maps were constructed based on four objective metrics and six subjective metrics. Results showed a correlation between the spatial distribution of objective visual elements and subjective perceptions, with city centers generally perceived more positively than suburbs. Our application of SHapley Additive exPlanation (SHAP) and Class Activation Map (CAM) visualizations yielded interpretable insights consistent with eye-tracking studies, highlighting human focus on artificial objects, attractive and unattractive elements, and heterogeneous landscapes. It's noteworthy that urban planners and decision-makers in other cities can apply our approach to generate perception maps that identify low-quality areas. SHAP and CAM visualizations further assist in understanding which aspects draw human attention in these areas. This knowledge is crucial for urban designers to implement targeted renewal strategies, ultimately contributing to the creation of sustainable and living-friendly cities.

Figure placeholder for Medical knowledge enhanced prompt learning for diagnosis classification from clinical text
Jun 2023 · ACL 2023 Workshop
NLP

Medical knowledge enhanced prompt learning for diagnosis classification from clinical text

Yuxing Lu, Xukai Zhao, Jinzhuo Wang.

Prompt LearningMedical NLP
Abstract

Artificial intelligence based diagnosis systems have emerged as powerful tools to reform traditional medical care. Each clinician now wants to have his own intelligent diagnostic partner to expand the range of services he can provide. When reading a clinical note, experts make inferences with relevant knowledge. However, medical knowledge appears to be heterogeneous, including structured and unstructured knowledge. Existing approaches are incapable of uniforming them well. Besides, the descriptions of clinical findings in clinical notes, which are reasoned to diagnosis, vary a lot for different diseases or patients. To address these problems, we propose a Medical Knowledge-enhanced Prompt Learning (MedKPL) model for diagnosis classification. First, to overcome the heterogeneity of knowledge, given the knowledge relevant to diagnosis, MedKPL extracts and normalizes the relevant knowledge into a prompt sequence. Then, MedKPL integrates the knowledge prompt with the clinical note into a designed prompt for representation. Therefore, MedKPL can integrate medical knowledge into the models to enhance diagnosis and effectively transfer learned diagnosis capacity to unseen diseases using alternating relevant disease knowledge. The experimental results on two medical datasets show that our method can obtain better medical text classification results and can perform better in transfer and few-shot settings among datasets of different diseases.

Figure placeholder for Exploration of non linear influence mechanisms of traditional courtyard forms on thermal comfort
Feb 2025 · Sustainable Cities and Society (JCR Q1, IF=12.0)
Other

Exploration of non linear influence mechanisms of traditional courtyard forms on thermal comfort

Wenke Wang, Yang Shi, Jie Zhang, Xukai Zhao

Thermal ComfortCourtyardNonlinear
Abstract

Traditional courtyard architecture offers superior climate regulation function through its unique enclosed spatial form, which is significant in mitigating global warming, urban heat islands, and promoting sustainable urban development. However, existing research has not thoroughly explored the comprehensiveness of climate zones, the systematicity of morphological elements, and the non-linear mechanisms. This study investigates traditional courtyards in Beijing using an integrated approach combining field survey, ENVI-met numerical simulation, machine learning, and SHAP interpretation methods to reveal the non-linear influence mechanisms of courtyard morphology on thermal comfort. The results show that: 1) The XGBoost model achieves high accuracy in fitting study data, with R2 values of 0.99 and 0.85 for training and test sets, respectively; 2) The contribution of the built environment to thermal comfort in winter and summer (70 % and 52 %, respectively) is significantly higher than that of the natural environment, especially in winter; 3) Morphological features affect thermal comfort in winter and summer in a mostly synergistic way, but there are differences in the curve trend, threshold distribution, and the range of benefits, e.g., the benefits of Sky View Factor (SVF) on thermal comfort in winter and summer can be up to 2.0 °C and 1.5 °C, respectively; (4) The enhancement of thermal comfort of traditional courtyards in Beijing can be achieved through parameters such as aspect ratio (HAR), shape index (SI), and building coverage (BDG), but the implementation process is susceptible to limitations such as cultural values. The conclusions of this study make up for the shortcomings of current research areas, influencing elements, and research methods and help promote the sustainable development of courtyards in similar climatic and cultural contexts.

Figure placeholder for Evaluating the effectiveness of community gardens by a quantitative systematic framework
Apr 2022 · Sustainable Cities and Society (JCR Q1, IF=12.0)
Other

Evaluating the effectiveness of community gardens by a quantitative systematic framework

Jiesi Wang, Guanting Zhang, Xukai Zhao

Spatial optimizationEvaluation FrameworkDecision support systems
Abstract

Community gardens play a significant role in urban agriculture and provide a valuable resource for urban regeneration. Few studies quantitatively evaluate their impacts, which are valuable for policymakers to improve urban sustainability by site optimization. In this study, we set up a multimetric framework to quantitatively evaluate the effectiveness of community gardens at census tract resolution. Attractiveness is the external determinant of their effectiveness and reflects residents’ motivation to participate in garden activities, while the cover rate and serviceability represent two internal determinants. This framework is then applied to analyze the changes in community gardens in St. Louis, Missouri in the US from 2010 to 2019. The results reveal that tract attractiveness is positively correlated with the poverty rate in 2010 but negatively correlated in 2019, suggesting a significant temporal change in garden impacts. Collectively, our findings provide a multimetric framework for the spatial and temporal analysis of community gardens and represent a valuable decision support system to improve urban sustainability.

Intelligent Perception Analysis Platform

Explore my other site

I also developed an Intelligent Perception Analysis Platform to demonstrate my park perception models for social media data analysis.

Try the models
Intelligent Perception Analysis Platform screenshot

Honors and Awards

Scholarships and honors

  1. Sep 2025
    Tsinghua University Future Scholar Scholarship
    清华大学未来学者奖学金
  2. Nov 2024
    National Scholarship for Graduate Students
    硕士研究生国家奖学金
  3. Nov 2024
    Jingtang He Science and Technology Innovation Award
    何镜堂科技创新奖
  4. Nov 2023
    Longfor Scholarship
    龙湖奖学金
  5. Nov 2021
    Soochow University 2021 Model Student
    苏州大学2021年学生标兵
  6. Oct 2021
    National Endeavor Scholarship, Special Prize for Academic Excellence
    国家励志奖学金、学习优秀特等奖
  7. Nov 2020
    National Endeavor Scholarship, Special Prize for Academic Excellence
    国家励志奖学金、学习优秀特等奖
  8. Nov 2019
    National Endeavor Scholarship, Diandian Scholarship
    国家励志奖学金、点点奖学金

Academic activities

  1. Feb 2026
    International Academic Conference on Sustainable Cultural Heritage and Historic Cities (Oral)
    可持续的文化遗产与历史城市国际学术交流会・平遥
  2. Dec 2025
    NeurIPS 2025 (Spotlight and Poster)
    San Diego, United States
  3. Nov 2025
    Smart, Interdisciplinary, Green — The Future of Urban Development
    清华大学“智能·交叉·绿色——城市建设的未来”学术论坛・北京
  4. Jun 2025
    PhD Academic Forum of THU (Excellent Paper)
    清华大学第博士生学术论坛优秀论文・北京
  5. Sep 2024
    IFLA 2024 (Oral)
    Istanbul, Turkey
  6. Sep 2024
    PhD Academic Forum of THU (Excellent Paper)
    清华大学第博士生学术论坛优秀论文・北京
  7. May 2024
    ICASSP 2024 (Poster)
    Seoul, South Korea
  8. May 2024
    Youth Geoscience Forum (Poster)
    青年地学论坛・厦门
  9. Nov 2023
    Annual Conference of Territorial Spatial Planning (First prize)
    国土空间规划学术年会一等奖・广州
  10. Nov 2023
    PhD Academic Forum of THU (Excellent Paper)
    清华大学第博士生学术论坛优秀论文・北京

Competitions and design awards

Competition drawing for 18th Challenge Cup Guangdong Competition
May 2025
Grand Prize, 18th Challenge Cup Guangdong Competition
Competition drawing for Milan Design Week University Design Exhibition Additional competition drawing for Milan Design Week University Design Exhibition
May 2024
First Prize, Milan Design Week University Design Exhibition
Jun 2023
First Place, National Community Garden Design and Construction Competition
Apr 2023
Gold Award, Guangzhou Garden Expo Student Design Competition
May 2021
CHSLA 2nd award
Dec 2020

Patents

Jan 2026
A multi agent large language model system for perceived cultural heritage value analysis and retrieval
Published
Xukai Zhao, He Huang
Feb 2025
A method and device for scoring park social media reviews and extracting core phrases
Published
Guangsi Lin, Xukai Zhao, et al.
Feb 2025
A method for extracting focal objects in park images based on visual saliency prediction
Published
Guangsi Lin, Xukai Zhao, et al.
Jan 2024
A multimodal scoring method and system coupling park social media texts and images
Granted
Xukai Zhao, Guangsi Lin, et al.
Dec 2023
A built environment perception evaluation method based on street view images, system, device, and medium
Published
Xukai Zhao, Guangsi Lin, et al.

Peer Reviewer

Technology in Society journal cover
Technology in Society
JCR Q1 · IF = 12.5
Sustainable Cities and Society journal cover
Sustainable Cities and Society
JCR Q1 · IF = 12.0
Expert Systems with Applications journal cover
Expert Systems with Applications
JCR Q1 · IF = 7.5
Urban Forestry & Urban Greening journal cover
Urban Forestry & Urban Greening
JCR Q1 · IF = 6.7
Cities journal cover
Cities
JCR Q1 · IF = 6.6
Applied Geography journal cover
Applied Geography
JCR Q1 · IF = 5.4

Visitor Locations