All Publications

I strive to make my research accessible, reproducible, and available for consultation. If a paper and/or code repository is not directly available from the list below, feel free to reach out.
ReferenceDatePaperCode

C. Liu, M. Russo, M. Cafarella, L. Cao, P. B. Chen, Z. Chen, M. Franklin, T. Kraska, S. Madden, R. Shahout, G. Vitagliano : International Conference on Innovative Data Systems Research (CIDR) , 2025

2025

M. Hameed, G. Vitagliano, F. Panse, F. Naumann : Proceedings of the International Conference on Extending Database Technology (EDBT) , 2024

2024

OpenProceedings

M. Hameed, G. Vitagliano, F. Naumann : MORPHER: Structural Transformation of ill-formed Rows. Proceedings of the International Conference on Information and Knowledge Management (CIKM) , 2023

2023

G. Vitagliano, M. Hameed, F. Naumann : Structural embedding of data files with MaGRiTTE. Table Representation Learning Workshop at NeurIPS (TRL@NeurIPS) , 2022

2022

TRL@NeurIPS

GitHub

G. Vitagliano, M. Hameed, L. Jiang, L. Reisener, E. Wu, F. Naumann : Pollock: A Data Loading Benchmark. PVLDB 16(8):1870–1882 , 2022

2022

PVLDB

GitHub

G. Vitagliano, L. Reisener, L. Jiang, M. Hameed, F. Naumann : Mondrian: Spreadsheet Layout Detection. Proceedings of the International Conference on Management of Data (SIGMOD) , 2022

2022

ACM

Web demo

M. Hameed, G. Vitagliano, L. Jiang, F. Naumann : SURAGH: Syntactic Pattern Matching to Identify Ill-Formed Records. Proceedings of the International Conference on Extending Database Technology (EDBT) , 2022

2022

OpenProceedings

GitHub

L. Jiang, G. Vitagliano, M. Hameed, F. Naumann : Aggregation Detection in CSV Files. Proceedings of the International Conference on Extending Database Technology (EDBT) , 2022

2022

OpenProceedings

GitHub

G. Vitagliano, L. Jiang, F. Naumann : Detecting Layout Templates in Complex Multiregion Files. PVLDB 15(3):646-658 , 2021

2021

PVDLB

GitHub

L. Jiang, G. Vitagliano, F. Naumann : Structure Detection in Verbose CSV Files. Proceedings of the International Conference on Extending Database Technology (EDBT) , 2021

2021

EDBT

GitHub

L. Jiang, G. Vitagliano, F. Naumann : A Scoring-based Approach for Data Preparator Suggestion. Lernen, Wissen, Daten, Analysen (LWDA) , 2019

2019

HPI

GitHub