☕️

Qian Ma

Ph.D. Student in Computer Science

Biography

I am a Ph.D. student in Computer Science at the DAMI Lab, Rensselaer Polytechnic Institute, advised by Prof. Yao Ma. My research focuses on multimodal learning and reasoning, vision-language models, multimodal RAG, knowledge-based visual question answering (KB-VQA), and reliable foundation-model evaluation. I also have expertise in graph learning, graph foundation models, graph self-supervised learning, spatio-temporal forecasting, time-series modeling, and urban computing.

Download CV

Education

Ph.D. Student in Computer Science

Jan. 2024
Jan. 2028 (expected)

Rensselaer Polytechnic Institute

M.Sc. in Multimedia Information Technology, Distinction

2022
2024

City University of Hong Kong

B.Eng. in Software Engineering - Systems and Technology

2018
2022

University of Electronic Science and Technology of China

Interests

Multimodal Learning and Reasoning Vision-Language Models Multimodal RAG Knowledge-Based Visual Question Answering Reliable Foundation-Model Evaluation Graph Learning and Graph Foundation Models Spatio-Temporal Forecasting

Research

My current research focuses on multimodal learning and reasoning, vision-language models (VLMs), multimodal RAG, knowledge-based visual question answering (KB-VQA), and reliable foundation-model evaluation.

I am interested in how foundation models ground non-textual structure, visual entities, and external knowledge. Before moving toward multimodal RAG and KB-VQA, I worked on graph learning, graph self-supervised learning, graph foundation models, spatio-temporal forecasting, time-series modeling, and urban computing.

Publications

Qian Ma, S M Rayeed, Charles V. Stewart, Qiong Wu, Yao Ma. Identifying and Resolving Pitfalls of Knowledge-Based VQA Benchmarks: Auditing, Repairing, and Augmenting. ECCV 2026, to appear.
Qian Ma, Qiong Wu, Zhengyi Zhou, Yao Ma. Ground Then Rank: Revisiting Knowledge-Based VQA with Training-Free Entity Identification. ACL 2026 Findings, to appear.
Qian Ma, Haitao Mao, Jingzhe Liu, Zhehua Zhang, Chunlin Feng, Yu Song, Yihan Shao, Yao Ma. Do Neural Scaling Laws Exist on Graph Self-Supervised Learning? LoG 2024.
Qian Ma, Hongliang Chi, Hengrui Zhang, Kay Liu, Zhiwei Zhang, Lu Cheng, Suhang Wang, Philip S. Yu, Yao Ma. Overcoming Pitfalls in Graph Contrastive Learning Evaluation: Toward Comprehensive Benchmarks. ACM SIGKDD Explorations, 2025.
Qian Ma, Haitao Mao, Zhehua Zhang, Qiong Wu, Zhengyi Zhou, Yao Ma. Cross-Domain GraphWalker: Harnessing LLMs for Graph Structure Learning. Under review.
Qian Ma, Zijian Zhang, Xiangyu Zhao, Haoliang Li, Hongwei Zhao, Yiqi Wang, Zitao Liu, Wanyu Wang. Rethinking Sensors Modeling: Hierarchical Information Enhanced Traffic Forecasting. CIKM 2023.
Zijian Zhang, Xiangyu Zhao, Qidong Liu, Chunxu Zhang, Qian Ma, et al. PromptST: Prompt-Enhanced Spatio-Temporal Multi-Attribute Prediction. CIKM 2023.

See my Google Scholar profile for updates.

Misc

Conference notes, travel reflections, and small research-life updates will live here.

Gallery

A space for conference photos, posters, and snapshots from research travel.

Contact

Affiliation

Department of Computer Science

Rensselaer Polytechnic Institute

Troy, NY, United States

Connect

maq5@rpi.edu

Phone

+1 518-308-3995

Trending Searches

No results found