RetClean: Retrieval-Based Data Cleaning Using LLMs and DataLakes
Zan Ahmad Naeem, Mohammad Shahmeer Ahmad, Mohamed Y Eltabakh, Mourad Ouzzani, Nan Tang
Proceedings of the VLDB Endowment (2024)
Cross Modal Data Discovery over Structured and Unstructured Data Lakes
Mohamed Y. Eltabakh, Mayuresh Kunjir, Ahmed Elmagarmid, Mohammad Shahmeer Ahmad
Proceedings of the VLDB Endowment (2023)
RPT: relational pre-trained transformer is almost all you need towards democratizing data preparation
Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Sam Madden, Mourad Ouzzani
Proceedings of the VLDB Endowment (2021)
Deep Learning for Blocking in Entity Matching: A Design Space Exploration
Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq Joty, Mourad Ouzzani, Nan Tang
Proceedings of the VLDB Endowment (2021)
DeepER–Deep Entity Resolution
Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq Joty, Mourad Ouzzani, Nan Tang
Proceedings of the VLDB Endowment (2018)
Rayyan—a web and mobile app for systematic reviews
Mourad Ouzzani, Hossam Hammady, Zbys Fedorowicz, Ahmed Elmagarmid
Systematic Reviews (2016)