AI foundations and systems

Deep learning, transformers and graph neural networks: a linear algebra perspective

Abdelkader Baggag, Yousef Saad
Numerical Algorithms (2025)

Distortion-aware Brushing for Reliable Cluster Analysis in Multidimensional Projections

Hyeon Jeon, Michael Aupetit, Soohyun Lee, Kwon Ko, Youngtaek Kim, Ghulam Jilani Quadri
IEEE Transactions on Visualization and Computer Graphics (2025)

ArnoldiGCL: Graph Contrastive Learning via Learnable Arnoldi-Based Guided Spectral Chebyshev Polynomial Filters

Mustafa Coşkun, Abdelkader Baggag, Mehmet Koyutürk
Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2025)

HCT-QA: A Benchmark for Question Answering on Human-Centric Tables

Mohammad Shahmeer Ahmad, Zan A Naeem, Michael Aupetit, Ahmed Elmagarmid, Mohamed Eltabakh, Xiasong Ma, Mourad Ouzzani, Chaoyi Ruan
arXiv preprint arXiv:2504.20047 (2025)

Measuring the Validity of Clustering Validation Datasets

Hyeon Jeon, Michael Aupetit, DongHwa Shin, Aeri Cho, Seokhyeon Park, Jinwook Seo
IEEE Transaction on Pattern Analysis and Machine Intelligence (2025)

Fanar: An Arabic-Centric Multimodal Generative AI Platform

Fanar Team: Ummar Abbas, Mohammad Shahmeer Ahmad, Firoj Alam, Enes Altinisik, Ehsannedin Asgari, Yazan Boshmaf, Sabri Boughorbel, Sanjay Chawla, Shammur Chowdhury, Fahim Dalvi, Kareem Darwish, Nadir Durrani, Mohamed Elfeky, Ahmed Elmagarmid, Mohamed Eltabakh, Masoomali Fatehkia, Anastasios Fragkopoulos, Maram Hasanain, Majd Hawasly, Mus’ab Husaini, Soon-Gyo Jung, Ji Kim Lucas, Walid Magdy, Safa Messaoud, Abubakr Mohamed, Tasnim Mohiuddin, Basel Mousi, Hamdy Mubarak, Ahmad Musleh, Zan Naeem, Mourad Ouzzani, Dorde Popovic, Amin Sadeghi, Husrev Taha Sencar, Mohammed Shinoy, Omar Sinan, Yifan Zhang, Ahmed Ali, Yassine El Kheir, Xiaosong Ma, Chaoyi Ruan
arXiv preprint arXiv:2501.13944 (2025)

RetClean: Retrieval-Based Data Cleaning Using LLMs and DataLakes

Zan Ahmad Naeem, Mohammad Shahmeer Ahmad, Mohamed Y Eltabakh, Mourad Ouzzani, Nan Tang
Proceedings of the VLDB Endowment (2024)

A pragmatic perspective on AI transparency at workplace

Ghanim Al-Sulaiti, Mohammad Amin Sadeghi, Lokendra Chauhan, Ji Lucas, Sanjay Chawla, Ahmed Elmagarmid
AI and Ethics (2024)

Classes are Not Clusters: Improving Label-Based Evaluation of Dimensionality Reduction

Hyeon Jeon, Yun-Hsin Kuo, Michael Aupetit, Kwan-Liu Ma, Jinwook Seo
IEEE Transactions on Visualization and Computer Graphics (2024)

Cross Modal Data Discovery over Structured and Unstructured Data Lakes

Mohamed Y. Eltabakh, Mayuresh Kunjir, Ahmed Elmagarmid, Mohammad Shahmeer Ahmad
Proceedings of the VLDB Endowment (2023)

RPT: relational pre-trained transformer is almost all you need towards democratizing data preparation

Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Sam Madden, Mourad Ouzzani
Proceedings of the VLDB Endowment (2021)

Deep Learning for Blocking in Entity Matching: A Design Space Exploration

Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq Joty, Mourad Ouzzani, Nan Tang
Proceedings of the VLDB Endowment (2021)

Steering Distortions to Preserve Classes and Neighbors in Supervised Dimensionality Reduction

Benoît Colange, Jaakko Peltonen, Michael Aupetit, Denys Dutykh, Sylvain Lespinats
Proceedings of NeurIPS (2020)

Toward Perception-Based Evaluation of Clustering Techniques for Visual Analytics

Michael Aupetit, Michael Sedlmair, Mostafa M. Abbas, Abdelkader Baggag, Halima Bensmail
Proceedings of the IEEE Visualization Conference (2019)

DeepER–Deep Entity Resolution

Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq Joty, Mourad Ouzzani, Nan Tang
Proceedings of the VLDB Endowment (2018)

Multidimensional Projection for Visual Analytics: Linking Techniques with Distortions, Tasks, and Layout Enrichment

Luis Gustavo Nonato, Michael Aupetit
IEEE Transactions on Visualization and Computer Graphics (2019)

Rayyan—a web and mobile app for systematic reviews

Mourad Ouzzani, Hossam Hammady, Zbys Fedorowicz, Ahmed Elmagarmid
Systematic Reviews (2016)