Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes
CIDR 2023
The Conference on Innovative Data Systems Research
Zui Chen, Zihui Gu, Lei Cao, Ju Fan, Samuel Madden, Nan Tang
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration
SIGMOD 2023
ACM SIGMOD Conference on Management of Data
Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, Guoliang Li, Xiaoyong Du, Jia Xiaofeng, Song Gao
Self-supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks [pdf]
VLDB 2023
The 49th International Conference on Very Large Data Bases
Jinfeng Peng, Derong Shen, Nan Tang, Tieying Liu, Yue Kou, Tiezheng Nie, Hang Cui, Ge Yu
PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-training [pdf]
EMNLP 2022
The 2022 Conference on Empirical Methods in Natural Language Processing
Zihui Gu, Ju Fan, Nan Tang, Preslav Nakov, Xiaoman Zhao, and Xiaoyong Du
Domain Adaptation for Deep Entity Resolution [pdf]
SIGMOD 2022
ACM SIGMOD Conference on Management of Data
Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, Chengliang Chai, Guoliang Li, Ruixue Fan, Xiaoyong Du
DADER: Hands-Off Entity Resolution with Domain Adaptation [pdf]
VLDB 2022 (demo)
The 48th International Conference on Very Large Data Bases
Jianhong Tu, Xiaoyue Han, Ju Fan, Nan Tang, Chengliang Chai, Guoliang Li, Xiaoyong Du
Synthesizing Privacy Preserving Entity Resolution Datasets [pdf]
ICDE 2021
The 38th IEEE International Conference on Data Engineering
Xuedi Qin, Chengliang Chai, Nan Tang, Jian Li, Yuyu Luo, Guoliang Li, Yaoyu Zhu
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation [pdf]
VLDB 2021
The 47th International Conference on Very Large Data Bases
Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Sam Madden, Mourad Ouzzani
Deep Learning for Blocking in Entity Matching: A Design Space Exploration [pdf]
VLDB 2021
The 47th International Conference on Very Large Data Bases
Saravanan Thirumuruganathan, Han Li, Nan Tang, Mourad Ouzzani, Yash Govind, Derek Paulsen, Glenn Fung, AnHai Doan
Data Curation with Deep Learning [Vision] [pdf]
EDBT 2020
The 23rd International Conference on Extending Database
Saravanan Thirumuruganathan, Nan Tang, Mourad Ouzzani, AnHai Doan
Raha: A Configuration-Free Error Detection System [pdf]
SIGMOD 2019
ACM SIGMOD Conference on Management of Data
Mohammad Mahdavi, Ziawasch Abedjan, Raul Castro Fernandez, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang
Explaining Entity Resolution Predictions : Where are we and What needs to be done? [pdf]
HILDA 2019
Workshop on Human-In-the-Loop Data Analytics (Co-located with SIGMOD)
Saravanan Thirumuruganathan, Mourad Ouzzani, Nan Tang
Unsupervised String Transformation Learning for Entity Consolidation [pdf]
ICDE 2019
The 35th IEEE International Conference on Data Engineering
Dong Deng, Wenbo Tao, Ziawasch Abedjan, Ahmed Elmagarmid, Ihab F. Ilyas, Guoliang Li, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang