All Publications

I strive to make my research accessible, reproducible, and available for consultation. If a paper and/or code repository is not directly available from the list below, feel free to reach out.
ReferenceDatePaperCode

M. Hameed, G. Vitagliano, F. Naumann : MORPHER: Structural Transformation of ill-formed Rows. Proceedings of the International Conference on Information and Knowledge Management (CIKM) , 2023

2023

G. Vitagliano, M. Hameed, F. Naumann : Structural embedding of data files with MaGRiTTE. Table Representation Learning Workshop at NeurIPS (TRL@NeurIPS) , 2022

2022

TRL@NeurIPS

GitHub

G. Vitagliano, M. Hameed, L. Jiang, L. Reisener, E. Wu, F. Naumann : Pollock: A Data Loading Benchmark. PVLDB 16(8):1870–1882 , 2022

2022

PVLDB

GitHub

G. Vitagliano, L. Reisener, L. Jiang, M. Hameed, F. Naumann : Mondrian: Spreadsheet Layout Detection. Proceedings of the International Conference on Management of Data (SIGMOD) , 2022

2022

ACM

Web demo

M. Hameed, G. Vitagliano, L. Jiang, F. Naumann : SURAGH: Syntactic Pattern Matching to Identify Ill-Formed Records. Proceedings of the International Conference on Extending Database Technology (EDBT) , 2022

2022

OpenProceedings

GitHub

L. Jiang, G. Vitagliano, M. Hameed, F. Naumann : Aggregation Detection in CSV Files. Proceedings of the International Conference on Extending Database Technology (EDBT) , 2022

2022

OpenProceedings

GitHub

G. Vitagliano, L. Jiang, F. Naumann : Detecting Layout Templates in Complex Multiregion Files. PVLDB 15(3):646-658 , 2021

2021

PVDLB

GitHub

L. Jiang, G. Vitagliano, F. Naumann : Structure Detection in Verbose CSV Files. Proceedings of the International Conference on Extending Database Technology (EDBT) , 2021

2021

EDBT

GitHub

L. Jiang, G. Vitagliano, F. Naumann : A Scoring-based Approach for Data Preparator Suggestion. Lernen, Wissen, Daten, Analysen (LWDA) , 2019

2019

HPI

GitHub