Yiyang Bian โ˜•๏ธ
Yiyang Bian

Ph.D. Student in Computer Science

About Me

I am a Ph.D. student in Computer Science at the University of California, Riverside, where I am currently working in the Center of Geospatial Sciences. I am delighted to work with Prof. Amr Magdy, mainly engaged in spatial data science, data management, query optimization, and machine learning.

During my masterโ€™s study, I was very fortunate to participate in research work in the database laboratory led by Prof. Yinghui Wu.

Pleas click the button down below to get my latest Curriculum Vitae!

Download CV
Interests
  • Database
  • Data Management
  • Machine Learning
  • Information Retrieval
Education
  • Ph.D Computer Science

    University of California, Riverside

  • M.S Computer Science

    Case Western Reserve University

  • B.S Computer Science

    Central China Normal University

๐Ÿ“š My Research
My current research focuses on Geo-spatial Retrieval-Augmented Generation (RAG), where I explore how large language models can be enriched with spatial data and reasoning. Unlike traditional RAG that only leverages textual knowledge, Geo-spatial RAG requires integrating geographic context, spatial databases, and semantic retrieval to answer location-aware and context-rich questions. I am particularly interested in designing frameworks that combine spatial indexing, semantic embeddings, and conversational query reformulation to achieve more accurate, transparent, and interactive responses. This includes addressing challenges such as spatio-temporal constraints, user intent disambiguation, and trustworthy enrichment of queries. More broadly, my research lies at the intersection of data management, geospatial AI, and trustworthy machine learning, with the long-term goal of creating systems that bridge the gap between complex spatial data landscapes and human-centered decision-making. I am always excited to discuss collaborations and new ideas in this space ๐Ÿ˜ƒ
Recent Publications
(2025). MODis: Generating Skyline Datasets for Data Science Models. In EDBT 2025.
(2024). ModsNet: Performance-aware Top-k Model Search using Exemplar Datasets. In VLDB.
(2023). Selecting Top-k Data Science Models by Example Dataset. In CIKM.
(2022). CRUX: Crowdsourced Materials Science Resource and Workflow Exploration. In CIKM.
Recent News

๐ŸŽ‰ I will be a Ph.D student!

Starting from the Fall of 2024, I will join the University of California, Riverside to begin my PhD career, and I will join the Spatial Data Laboratory led by Prof. Amr Madgy.

โœ… CIKM 2023

I am very honored to receive the CIKM 2022 conference travel award. See you in Atlanta in October!