About Mingjie
Mingjie Tang works on LLM systems and algorithms. He previously served as tech lead at Ant Group and as a member of the technical staff at Hortonworks/Cloudera. His research interests include database systems, distributed machine learning, big data computation, and distributed deep learning.
He received his PhD in Computer Science from Purdue University, West Lafayette, Indiana, under the supervision of Professor Walid G. Aref. His doctoral research focused on distributed systems for spatial computation, machine learning, and artificial intelligence.
Industry Experience
| 2025.11 - present | Researcher, Consultant, iQuest Research Lab |
| 2018.10 - 2022.10 | AI Engineer, Ant Group, CA, USA |
| 2016.9 - 2018.10 | Member of tech staff, Hortonworks, CA, USA |
| 2015.5 - 2015.8 | Research Intern, IBM Research Almaden, CA, USA |
| 2012.5 - 2012.8 | Software Engineer Intern, Microsoft, Seattle, USA |
Education
| 2010.9 - 2016.9 | PhD, Purdue University, IN, USA |
| 2010.9 - 2012.12 | M.S., Purdue University, IN, USA |
| 2007.9 - 2010.7 | M.S., University of Chinese Academy of Sciences, Beijing, China |
| 2003.8 - 2007.7 | B.S., Sichuan University, Chengdu, China |
Publication (Recent)
Google Scholar, DBLP — View all publications →
2026
-
DLRover-LM: LLM Pre-Training Framework with Thousands of Accelerators in AntGroup
Ziling Huang, Zhengmao Ye, Qingsong Cai, Zelong Huang, Bo Sang, Haitao Zhang, Jian Sha, Tingfeng Lan, Hui Lu, Yuanchun Zhou, Mingjie Tang
Proceedings of the IEEE International Conference on Data Engineering (ICDE '26), May 2026.
-
Are Your LLM-based Text-to-SQL Models Secure? Exploring SQL Injection via Backdoor Attacks
Meiyu Lin, Haichuan Zhang, Jiale Lao, and Renyuan Li, Yuanchun Zhou, Carl Yang, Yang Cao, Mingjie Tang
Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD ’26), Jun 2026.
2025
-
Tuning Less, Prompting More: A Cost-Effective and Flexible Training and Inference Pipeline for Natural Language Transformation
Shuyun Yang, Zhengmao Ye, Yan Zhang, and Lei Duan, Mingjie Tang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP ’25), Nov 2025.
-
Concise reasoning via reinforcement learning
Mehdi Fatemi, Banafsheh Rafiee, Mingjie Tang, Kartik Talamadupula
-
mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs
Zhengmao Ye, Dengchun Li, Zetao Hu, Tingfen Lan, Sha Jian, Sicong Zheng, Lei Duan, Jie Zuo, Hui Lu, Yuanchun Zhou, Mingjie Tang
Proceedings of Very Large Data Bases Conference (VLDB), 2025.
2024
-
DLRover-RM: Resource Optimization for Deep Recommendation Models Training in Cloud
Qinlong Wang, Tingfeng Lan, Yinghao Tang, Bo Sang, Ziling Huang, Yihen Du, Haitao Zhang, Shajian, Ke Zhang, Hui Lu, Yuanchun Zhou, Mingjie Tang
Proceedings of Very Large Data Bases Conference (VLDB), 2024.
-
BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks
Zhiyuan Cheng, Zhaoyi Liu, Tengda Guo, Shiwei Feng, Dongfang Liu,Mingjie Tang, Xiangyu Zhang
International Conference on Machine Learning (ICML), 2024.
-
A Demonstration of GPTuner: A GPT-Based Manual-Reading Database Tuning System
Jiale Lao, Yibo Wang, Yufei Li, Jianping Wang, Yunjia Zhang, Zhiyuan Cheng, Wanghu Chen, Yuanchun Zhou, Mingjie Tang, Jianguo Wang
Demo Proceedings of ACM Conference on Management of Data (SIGMOD), 2024.
-
GPTuner: A Manual-Reading Database Tuning System via GPT-Guided Bayesian Optimization
Jiale Lao, Yibo Wang, Yufei Li, Jianping Wang, Yunjia Zhang, Zhiyuan Cheng, Wanghu Chen, Mingjie Tang, Jianguo Wang
Proceedings of Very Large Data Bases Conference (VLDB), 2024.
🏆 Selected for SIGMOD Research Highlight Awards 2025!
-
Couler: Unified Machine Learning Workflow Optimization in Cloud
Xiaoda Wang, Yuan Tang, Tengda Guo, Bo Sang,Jingji Wu, Jian Sha, Ke Zhang, Jiang Qian, Mingjie Tang
40th IEEE International Conference on Data Engineering (ICDE) 2024
2023
-
Cougar: A General Framework for Jobs Optimization In Cloud
Bo Sang, Shuwei Gu, Xiaojun Zhan, Mingjie Tang, Jian Liu, Xuan Chen, Jie Tan, Haoyuan Ge, Ke Zhang, Ruoyi Ruan, Wei Yan
39th IEEE International Conference on Data Engineering (ICDE) 2023