Bibtex file with the publications listed below.

2025

  1. Designing Hierarchies for Optimal Hyperbolic Embedding Melika Ayoughi, Max Spengler, Pascal Mettes, and Paul Groth In Proceedings of the 22nd European Semantic Web Conference (ESWC) 2025 [Abs] [Link] [DOI:10.1007/978-3-031-94575-5_20] [Code]
  2. The Five Facets of Data Quality Assessment Sedir Mohammed, Lisa Ehrlinger, Hazar Harmouch, Felix Naumann, and Divesh Srivastava SIGMOD Record 2025 [Link]
  3. Evaluation of unsupervised static topic models’ emergence detection ability Xue Li, Ciro D. Esposito, Paul Groth, Jonathan Sitruk, Balazs Szatmari, and Nachoem Wijnberg PeerJ Computer Science 2025 [Link] [DOI:10.7717/peerj-cs.2875] [Data]
  4. Infrastructure, Intermediaries, and Artificial Intelligence: A Rejoinder to Commentaries on “From Data Creator to Data Reuser: Distance Matters" Christine L. Borgman, and Paul Groth Harvard Data Science Review 2025 [DOI:10.1162/99608f92.c17c3adb]
  5. From Data Creator to Data Reuser: Distance Matters Christine L. Borgman, and Paul Groth Harvard Data Science Review 2025 [DOI:10.1162/99608f92.35d32cfc]
  6. The effects of mismatched train and test data cleaning pipelines on regression models: lessons for practice James Nevin, Michael Lees, and Paul Groth PeerJ Computer Science 2025 [Abs] [Link] [DOI:10.7717/peerj-cs.2793]
  7. The effects of data quality on machine learning performance on tabular data Sedir Mohammed, Lukas Budach, Moritz Feuerpfeil, Nina Ihde, Andrea Nathansen, Nele Noack, Hendrik Patzlaff, Felix Naumann, and Hazar Harmouch Information Systems 2025 [Abs] [Link] [DOI:https://doi.org/10.1016/j.is.2025.102549]
  8. Step-by-Step Data Cleaning Recommendations to Improve ML Prediction Accuracy Sedir Mohammed, Felix Naumann, and Hazar Harmouch In Proceedings 28th International Conference on Extending Database Technology, EDBT 2025, Barcelona, Spain, March 25-28, 2025 2025 [Link] [DOI:10.48786/EDBT.2025.43]
  9. Data Systems Education: Curriculum Recommendations, Course Syllabi, and Industry Needs Daphne Miedema, Toni Taipalus, Vangel V. Ajanovski, Abdussalam Alawini, Martin Goodfellow, Michael Liut, Svetlana Peltsverger, and Tiffany Young In 2024 Working Group Reports on Innovation and Technology in Computer Science Education 2025 [Abs] [Link] [DOI:10.1145/3689187.3709609]
  10. ANYMATCH – Efficient Zero-Shot Entity Matching with a Small Language Model Zeyu Zhang, Paul Groth, Iacer Calixto, and Sebastian Schelter In Workshop on Preparing Good Data for Generative AI: Challenges and Approaches at AAAI 2025 [Link] [Code]
  11. ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events Duygu Sezen Islakoglu, and Jan-Christoph Kalo In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics 2025 [arXiv] [Link]
  12. FAIR Research Objects and computational workflows Stian Soiland-Reyes 2025 [Abs] [Link]
  13. Proceedings of the Special Session on Harmonising Generative AI and Semantic Web Technologies (HGAIS 2024) co-located with the 23rd International Semantic Web Conference (ISWC 2024), Baltimore, Maryland, November 13, 2024 2025 [Link]
  14. Contrasting Global and Local Representations for Human Activity Recognition using Graph Neural Networks Andrés Tello, and Victoria Degeler In Proceedings of the 40th ACM/SIGAPP Symposium on Applied Computing 2025 [Abs] [Link] [DOI:10.1145/3672608.3707743]
  15. Rethinking Computing Systems in the Era of Climate Crisis: A Call for a Sustainable Computing Continuum Ella Peltonen, Suzan Bayhan, David Bermbach, Sebastian Buschjager, Victoria Degeler, Aaron Yi Ding, Ozlem Durmaz Incel, Dewant Katare, Mikkel Baun Kjargaard, Sam Leroux, Toktam Mahmoodi, Zoltan Adam Mann, Nirvana Meratnia, Andy D. Pimentel, Jan S. Rellermeyer, Etienne Riviere, Dolly Sapra, Gurkan Solmaz, and Bram van der Waaij IEEE Internet Computing 2025 [DOI:10.1109/MIC.2025.3566642]
  16. 3K: Knowledge-Enriched Digital Twin Framework Erkan Karabulut, Paul Groth, and Victoria Degeler In Proceedings of the 14th International Conference on the Internet of Things 2025 [Abs] [Link] [DOI:10.1145/3703790.3703834]
  17. A Deep Dive Into Cross-Dataset Entity Matching with Large and Small Language Models Zeyu Zhang, Paul Groth, Iacer Calixto, and Sebastian Schelter In Proceedings 28th International Conference on Extending Database Technology, EDBT 2025, Barcelona, Spain, March 25-28, 2025 2025 [Link] [DOI:10.48786/EDBT.2025.75] [Code]

2024

  1. Exploiting Subgraphs and Attributes for Representation Learning on Knowledge Graphs Daniel Fernando Daza Cruz 2024 [Abs] [DOI:10.5463/thesis.823]
  2. The Ramifications of Data Handling for Computational Models James Graham Nevin 2024 [Link]
  3. Understanding the Impact of Entity Linking on the Topology of Entity Co-occurrence Networks for Social Media Analysis James Nevin, Pengyu Zhang, Dimitar Dimitrov, Michael Lees, Paul Groth, and Stefan Dietze In Knowledge Engineering and Knowledge Management (EKAW) 2024 [Link] [DOI:10.1007/978-3-031-77792-9_5] [Code]
  4. Large Language Model for Ontology Learning in Drinking Water Distribution Network Domain Y. Huang, E. Karabulut, and V. Degeler In Proceedings of the 24th International Conference on Knowledge Engineering and Knowledge Management Posters, Demos, Workshops, and Tutorials (EKAW‑PDWT 2024) 2024 [Link]
  5. A benchmark for the detection of metalinguistic disagreements between LLMs and knowledge graphs Bradley Allen, and Paul Groth In Proceedings of the Special Session on Harmonising Generative AI and Semantic Web Technologies (HGAIS 2024) 2024 [Link]
  6. Influence Beyond Similarity: A Contrastive Learning Approach to Object Influence Retrieval Teresa Liberatore, Paul Groth, Monika Kackovic, and Nachoem Wijnberg In Knowledge Engineering and Knowledge Management (EKAW) 2024 [Link] [DOI:10.1007/978-3-031-77792-9_3] [Code]
  7. TIGER: Temporally Improved Graph Entity Linker Pengyu Zhang, Congfeng Cao, and Paul Groth In 27th European Conference on Artificial Intelligence (ECAI 24) 2024 [Link] [DOI:10.3233/faia240933] [Code]
  8. DiTEC: Digital Twin for Evolutionary Changes in Water Distribution Networks Victoria Degeler, Mostafa Hadadian, Erkan Karabulut, Alexander Lazovik, Hester Loo, Andrés Tello, and Huy Truong In Leveraging Applications of Formal Methods, Verification and Validation. Application Areas 2024 [Abs] [Link] [Code]
  9. CYCLE: Cross-Year Contrastive Learning in Entity-Linking Pengyu Zhang, Congfeng Cao, Klim Zaporojets, and Paul Groth In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM 24) 2024 [Abs] [Link] [DOI:10.1145/3627673.3679702] [Code]
  10. Testing prompt engineering methods for knowledge extraction from text Fina Polat, Ilaria Tiddi, and Paul Groth Semantic Web 2024 [Link] [DOI:10.3233/sw-243719]
  11. A Sparsity Principle for Partially Observable Causal Representation Learning Danru Xu, Dingling Yao, Sébastien Lachapelle, Perouz Taslakian, Julius Kügelgen, Francesco Locatello, and Sara Magliacane International Conference on Machine Learning (ICML) 2024 [Link]
  12. How different is different? Systematically identifying distribution shifts and their impacts in NER datasets Xue Li, and Paul Groth Language Resources and Evaluation 2024 [Link] [DOI:10.1007/s10579-024-09754-8]
  13. Towards Federated LLM-Powered CEP Rule Generation and Refinement Majid Lotfian Delouee, Daria G. Pernes, Victoria Degeler, and Boris Koldehofe In The 18th ACM International Conference on Distributed and Event-Based Systems (DEBS’24) 2024 [Abs] [Link]
  14. SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection Bradley P. Allen, Fina Polat, and Paul Groth In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024) 2024 [Link] [Code]
  15. Towards Interactively Improving ML Data Preparation Code via "Shadow Pipelines" Stefan Grafberger, Paul Groth, and Sebastian Schelter In Proceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning 2024 [Abs] [Link] [DOI:10.1145/3650203.3663327]
  16. Towards Efficient Data Wrangling with LLMs using Code Generation Xue Li, and Till Döhmen In Proceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning 2024 [Abs] [Link] [DOI:10.1145/3650203.3663334]
  17. Prompt Tuned Embedding Classification for Industry Sector Allocation Valentin Buchner, Lele Cao, Jan-Christoph Kalo, and Vilhelm Von Ehrenheim In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 6: Industry Track) 2024 [Abs] [Link] [DOI:10.18653/v1/2024.naacl-industry.10]
  18. Retrieval-based Question Answering with Passage Expansion Using a Knowledge Graph Benno Kruit, Yiming Xu, and Jan-Christoph Kalo In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) 2024 [Abs] [Link]
  19. Large-Scale Multipurpose Benchmark Datasets For Assessing Data-Driven Deep Learning Approaches For Water Distribution Networks Andrés Tello, Huy Truong, Alexander Lazovik, and Victoria Degeler In Engineering Proceedings 2024 [Abs] [Link] [Data]
  20. Evaluating Class Membership Relations in Knowledge Graphs using Large Language Models Bradley P. Allen, and Paul T. Groth In Proceedings of European Semantic Web Conference Special Track on Large Language Models for Knowledge Engineering 2024 [Link]
  21. Directions Towards Efficient and Automated Data Wrangling with Large Language Models Zeyu Zhang, Paul Groth, Iacer Calixto, and Sebastian Schelter In 2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW) 2024 [Link] [DOI:10.1109/ICDEW61823.2024.00044]
  22. SchemaPile: A Large Collection of Relational Database Schemas Till Döhmen, Radu Geacu, Madelon Hulsebos, and Sebastian Schelter Proc. ACM Manag. Data 2024 [Abs] [Link] [DOI:10.1145/3654975]
  23. Data Debugging with Shapley Importance over Machine Learning Pipelines Bojan Karlaš, David Dao, Matteo Interlandi, Sebastian Schelter, Wentao Wu, and Ce Zhang In The Twelfth International Conference on Learning Representations 2024 [Link]
  24. Multi-View Causal Representation Learning with Partial Observability Dingling Yao, Danru Xu, Sébastien Lachapelle, Sara Magliacane, Perouz Taslakian, Georg Martius, Julius Kügelgen, and Francesco Locatello In The Twelfth International Conference on Learning Representations 2024 [Link] [ :tada: Spotlight Presentation ]
  25. Evaluating FAIR Digital Object and Linked Data as distributed object systems Stian Soiland-Reyes, Carole Goble, and Paul Groth PeerJ Computer Science 2024 [Abs] [Link] [DOI:10.7717/peerj-cs.1781] [Data]
  26. Ontologies in digital twins: A systematic literature review Erkan Karabulut, Salvatore F. Pileggi, Paul Groth, and Victoria Degeler Future Generation Computer Systems 2024 [Link] [DOI:10.1016/j.future.2023.12.013] [Data]
  27. Driving Towards Efficiency: Adaptive Resource-aware Clustered Federated Learning in Vehicular Networks Ahmad Khalil, Majid Lotfian Delouee, Victoria Degeler, Tobias Meuser, Antonio Fernandez Anta, and Boris Koldehofe In The 22nd Mediterranean Communication and Computer Networking Conference (MedComNet’24) 2024 [Abs] [Link]
  28. Empirical ontology design patterns and shapes from Wikidata Valentina Anita Carriero, Paul Groth, and Valentina Presutti Semantic Web 2024 [Link] [DOI:10.3233/sw-243613]
  29. Assisted design of data science pipelines Sergey Redyuk, Zoi Kaoudi, Sebastian Schelter, and Volker Markl The VLDB Journal 2024 [Link] [DOI:10.1007/s00778-024-00835-2]
  30. Table Representation Learning Madelon Hulsebos 2024 [Link]
  31. Domain Generalization in Time Series Forecasting Songgaojun Deng, Olivier Sprangers, Ming Li, Sebastian Schelter, and Maarten Rijke ACM Trans. Knowl. Discov. Data 2024 [Abs] [Link] [DOI:10.1145/3643035]
  32. Large-Scale Forecasting of Electric Vehicle Charging Demand Using Global Time Series Modeling Tijmen Etten, Victoria Degeler, and Ding Luo In Proceedings of the 10th International Conference on Vehicle Technology and Intelligent Transport Systems 2024 [Link] [DOI:10.5220/0012555400003702]
  33. Editorial for the Special Issue on Knowledge Engineering Paul Groth, Eva Blomqvist, and Juan F. Sequeda Journal of Web Semantics 2024 [Link] [DOI:10.1016/j.websem.2024.100840]
  34. Automated Data Cleaning Can Hurt Fairness in Machine Learning-based Decision Making Shubha Guha, Falaah Arif Khan, Julia Stoyanovich, and Sebastian Schelter IEEE Transactions on Knowledge and Data Engineering 2024 [Link] [DOI:10.1109/TKDE.2024.3365524]
  35. Standardizing Knowledge Engineering Practices with a Reference Architecture Bradley P. Allen, and Filip Ilievski Transactions on Graph Data and Knowledge 2024 [Link] [DOI:10.4230/TGDK.2.1.5]
  36. Zero-Shot Topic Classification of Column Headers: Leveraging LLMs for Metadata Enrichment Margherita Martorana, Tobias Kuhn, Lise Stork, and Jacco Ossenbruggen 2024 [Link] [DOI:10.3233/SSW240006]
  37. Red Onions, Soft Cheese and Data: From Food Safety to Data Traceability for Responsible AI Stefan Grafberger, Zeyu Zhang, Sebastian Schelter, and Ce Zhang IEEE Data Engineering Bulletin 2024 [Link]
  38. Too Good To Be True: accuracy overestimation in (re)current practices for Human Activity Recognition Andrés Tello, Victoria Degeler, and Alexander Lazovik In 2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops) 2024 [DOI:10.1109/PerComWorkshops59983.2024.10503465]
  39. Graph Neural Networks for Pressure Estimation in Water Distribution Systems Huy Truong, Andrés Tello, Alexander Lazovik, and Victoria Degeler Water Resources Research 2024 [Link]

2023

  1. BioBLP: a modular framework for learning on multimodal biomedical knowledge graphs Daniel Daza, Dimitrios Alivanistos, Payal Mitra, Thom Pijnenburg, Michael Cochez, and Paul Groth Journal of Biomedical Semantics 2023 [Link] [DOI:10.1186/s13326-023-00301-y] [Code] [Data]
  2. APP-CEP: Adaptive Pattern-level Privacy Protection in Complex Event Processing Systems Majid Lotfian Delouee, Victoria Degeler, Peter Amthor, and Boris Koldehofe In 10th International Conference on Information Systems Security and Privacy (ICISSP’24) 2023 [Abs] [Link]
  3. Large Language Models and Knowledge Graphs: Opportunities and Challenges Jeff Z. Pan, Simon Razniewski, Jan-Christoph Kalo, Sneha Singhania, Jiaoyan Chen, Stefan Dietze, Hajira Jabeen, Janna Omeliyanenko, Wen Zhang, Matteo Lissandrini, Russa Biswas, Gerard Melo, Angela Bonifati, Edlira Vakaj, Mauro Dragoni, and Damien Graux Transactions on Graph Data and Knowledge 2023 [Link] [DOI:10.4230/TGDK.1.1.2]
  4. Evaluating the Knowledge Base Completion Potential of GPT Blerta Veseli, Simon Razniewski, Jan-Christoph Kalo, and Gerhard Weikum In Findings of the Association for Computational Linguistics: EMNLP 2023 2023 [Abs] [Link] [DOI:10.18653/v1/2023.findings-emnlp.426]
  5. A-NeSI: A Scalable Approximate Method for Probabilistic Neurosymbolic Inference Emile Krieken, Thiviyan Thanapalasingam, Jakub M. Tomczak, Frank Van Harmelen, and Annette Ten Teije In Thirty-seventh Conference on Neural Information Processing Systems 2023 [arXiv] [Link]
  6. Adapting Neural Link Predictors for Data-Efficient Complex Query Answering Erik Arakelyan, Pasquale Minervini, Daniel Daza, Michael Cochez, and Isabelle Augenstein In Thirty-seventh Conference on Neural Information Processing Systems 2023 [arXiv] [Link]
  7. Observatory: Characterizing Embeddings of Relational Tables Tianji Cong, Madelon Hulsebos, Zhenjie Sun, Paul Groth, and H. V. Jagadish Proceedings of the VLDB Endowment 2023 [Link] [DOI:10.14778/3636218.3636237]
  8. Knowledge Engineering Using Large Language Models Bradley P. Allen, Lise Stork, and Paul Groth Transactions on Graph Data and Knowledge 2023 [Link] [DOI:10.4230/TGDK.1.1.3]
  9. Preface: LM-KBC Challenge 2023 Sneha Singhania, Jan-Christoph Kalo, Simon Razniewski, Jeff Z. Pan In Joint proceedings of the 1st workshop on Knowledge Base Construction from Pre-Trained Language Models (KBC-LM) and the 2nd challenge on Language Models for Knowledge Base Construction (LM-KBC) 2023 [Link]
  10. Do Instruction-tuned Large Language Models Help with Relation Extraction? Xue Li, Fina Polat, and Paul Groth In KBC-LM’23: Knowledge Base Construction from Pre-trained Language Models workshop at ISWC 2023 2023 [Link] [Code]
  11. Knowledge-centric Prompt Composition for Knowledge Base Construction from Pre-trained Language Models Xue Li, Anthony Hughes, Majlinda Llugiqi, Fina Polat, Paul Groth, and Fajar J. Ekaputra In KBC-LM’23: Knowledge Base Construction from Pre-trained Language Models workshop at ISWC 2023 2023 [Link] [Code]
  12. Semantic Association Rule Learning from Time Series Data and Knowledge Graphs Erkan Karabulut, Victoria Degeler, and Paul Groth1 In SemIIM’23: 2nd International Workshop on Semantic Industrial Information Modelling co-located with 22nd International Semantic Web Conference (ISWC 2023) 2023 [arXiv] [Link]
  13. Mlwhatif: What If You Could Stop Re-Implementing Your Machine Learning Pipeline Analyses over and Over? Stefan Grafberger, Shubha Guha, Paul Groth, and Sebastian Schelter Proc. VLDB Endow. 2023 [Abs] [Link] [DOI:10.14778/3611540.3611606] [Code]
  14. Improving Graph-to-Text Generation Using Cycle Training Fina Polat, Ilaria Tiddi, Paul Groth, and Piek Vossen In Proceedings of the 4th Conference on Language, Data and Knowledge 2023 [Link]
  15. Harnessing the Web and Knowledge Graphs for Automated Impact Investing Scoring Qingzhi Hu, Daniel Daza, Laurens Swinkels, Kristina Usaite, Robbert-Jan Hoen, and Paul Groth In KDD Fragile Earth Workshop 2023 [Link] [DOI:10.48550/arXiv.2308.02622]
  16. An approach for analysing the impact of data integration on complex network diffusion models James Nevin, Paul Groth, and Michael Lees Journal of Complex Networks 2023 [Link] [DOI:10.1093/comnet/cnad025] [Code]
  17. Self-Contained Entity Discovery from Captioned Videos Melika Ayoughi, Pascal Mettes, and Paul Groth ACM Trans. Multimedia Comput. Commun. Appl. 2023 [Abs] [Link] [DOI:10.1145/3583138]
  18. Data journeys: Explaining AI workflows through abstraction Enrico Daga, and Paul Groth Semantic Web 2023 [Abs] [Link] [DOI:10.3233/sw-233407]
  19. Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines Stefan Grafberger, Paul Groth, and Sebastian Schelter Proc. ACM Manag. of Data 2023 [Abs] [Link] [DOI:10.1145/3589273]
  20. GitTables: A Large-Scale Corpus of Relational Tables Madelon Hulsebos, Çagatay Demiralp, and Paul Groth Proc. ACM Manag. Data 2023 [Abs] [Link] [DOI:10.1145/3588710]
  21. AQuA-CEP: Adaptive Quality-Aware Complex Event Processing in the Internet of Things Majid Lotfian Delouee, Boris Koldehofe, and Viktoriya Degeler In The 17th ACM International Conference on Distributed Event-Based Systems (DEBS 2023) 2023 [Abs] [Link]
  22. An Analysis of Machine Learning-Based Semantic Matchmaking Erkan Karabulut, and Rute C. Sofia IEEE Access 2023 [DOI:10.1109/ACCESS.2023.3259360] [Code]
  23. How to Make an Outlier? Studying the Effect of Presentational Features on the Outlierness of Items in Product Search Results Fatemeh Sarvi, Mohammad Aliannejadi, Sebastian Schelter, and Maarten Rijke In Proceedings of the 2023 Conference on Human Information Interaction and Retrieval 2023 [Abs] [Link] [DOI:10.1145/3576840.3578278]
  24. The Mysterious User of Research Data: Knitting Together Science and Technology Studies with Information and Computer Science Kathleen Gregory, Paul Groth, Andrea Scharnhorst, and Sally Wyatt 2023 [Abs] [Link] [DOI:10.1007/978-3-031-11108-2_11]
  25. SemTab 2022: Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, co-located with the 21st International Semantic Web Conference, ISWC 2022, Virtual conference, October 23-27, 2022 2023 [Link]
  26. E2EG: End-to-End Node Classification Using Graph Topology and Text-based Node Attributes Tu Anh Dinh, Jeroen Boef, Joran Cornelisse, and Paul Groth In 2023 IEEE International Conference on Data Mining Workshops (ICDMW) 2023 [DOI:10.1109/ICDMW60847.2023.00142]
  27. Towards Declarative Systems for Data-Centric Machine Learning Stefan Grafberger, Bojan Karlaš, Paul Groth, and Sebastian Schelter In Proceedings of the Data-Centric Machine Learning Research work- shop (DMLR) at ICML, 2023 2023 [Link]
  28. Approximate Answering of Graph Queries Michael Cochez, Dimitrios Alivanistos, Erik Arakelyan, Max Berrendorf, Daniel Daza, Mikhail Galkin, Pasquale Minervini, Mathias Niepert, and Hongyu Ren 2023 [Link] [DOI:10.3233/FAIA230149]
  29. Reasoning beyond Triples: Recent Advances in Knowledge Graph Embeddings Bo Xiong, Mojtaba Nayyeri, Daniel Daza, and Michael Cochez In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM 2023, Birmingham, United Kingdom, October 21-25, 2023 2023 [Link] [DOI:10.1145/3583780.3615294]
  30. Reconstructing and Querying ML Pipeline Intermediates Sebastian Schelter In 13th Conference on Innovative Data Systems Research, CIDR 2023, Amsterdam, The Netherlands, January 8-11, 2023 2023 [Link]
  31. Figure of Speech Detection and Generation as a Service in IDN Authoring Support Simon Akkerman, and Frank Nack 2023 [Link] [DOI:10.1007/978-3-031-47658-7_8]
  32. Empowering Machine Learning Development with Service-Oriented Computing Principles Mostafa Hadadian Nejad Yousefi, Viktoriya Degeler, and Alexander Lazovik In Service-Oriented Computing 2023 [Abs]
  33. Results of SemTab 2023 Oktie Hassanzadeh, Nora Abdelmageed, Vasilis Efthymiou, Jiaoyan Chen, Vincenzo Cutrona, Madelon Hulsebos, Ernesto Jiménez-Ruiz, Aamod Khatiwada, Keti Korini, Benno Kruit, Juan Sequeda, and Kavitha Srinivas In Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, SemTab 2023, co-located with the 22nd International Semantic Web Conference, ISWC 2023, Athens, Greece, November 6-10, 2023 2023 [Link]
  34. Introducing the Observatory Library for End-to-End Table Embedding Inference Tianji Cong, Zhenjie Sun, Paul Groth, H. Jagadish, and Madelon Hulsebos In NeurIPS 2023 Second Table Representation Learning Workshop 2023 [Link]
  35. Automated Data Cleaning Can Hurt Fairness in Machine Learning-based Decision Making Shubha Guha, Falaah Arif Khan, Julia Stoyanovich, and Sebastian Schelter In 2023 IEEE 39th International Conference on Data Engineering (ICDE) 2023 [DOI:10.1109/ICDE55515.2023.00303] [Code]
  36. Forget Me Now: Fast and Exact Unlearning in Neighborhood-Based Recommendation Sebastian Schelter, Mozhdeh Ariannezhad, and Maarten Rijke In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval 2023 [Abs] [Link] [DOI:10.1145/3539618.3591989]
  37. Data Integration Landscapes: The Case for Non-optimal Solutions in Network Diffusion Models James Nevin, Paul Groth, and Michael Lees In Computational Science – ICCS 2023 2023 [Abs] [DOI:10.1007/978-3-031-35995-8_35]
  38. Proactively Screening Machine Learning Pipelines with ARGUSEYES Sebastian Schelter, Stefan Grafberger, Shubha Guha, Bojan Karlas, and Ce Zhang In Companion of the 2023 International Conference on Management of Data 2023 [Abs] [Link] [DOI:10.1145/3555041.3589682] [ :tada: 2nd Place Demo ]
  39. Seventh Workshop on Data Management for End-to-End Machine Learning (DEEM) Matthias Boehm, Madelon Hulsebos, Shreya Shankar, and Paroma Varma In Companion of the 2023 International Conference on Management of Data 2023 [Abs] [Link] [DOI:10.1145/3555041.3590819]
  40. Models and Practice of Neural Table Representations Madelon Hulsebos, Xiang Deng, Huan Sun, and Paolo Papotti In Companion of the 2023 International Conference on Management of Data 2023 [Abs] [Link] [DOI:10.1145/3555041.3589411]
  41. Provenance Tracking for End-to-End Machine Learning Pipelines Stefan Grafberger, Paul Groth, and Sebastian Schelter In Companion Proceedings of the ACM Web Conference 2023 2023 [Link] [DOI:10.1145/3543873.3587557]
  42. A Simulation Environment and Reinforcement Learning Method for Waste Reduction Sami Jullien, Mozhdeh Ariannezhad, Paul Groth, and Maarten Rijke Transactions on Machine Learning Research 2023 [Link]
  43. Knowledge Graphs and their Role in the Knowledge Engineering of the 21st Century (Dagstuhl Seminar 22372) Paul Groth, Elena Simperl, Marieke Erp, and Denny Vrandečić Dagstuhl Reports 2023 [Link] [DOI:10.4230/DagRep.12.9.60]
  44. Poster: Towards Pattern-Level Privacy Protection in Distributed Complex Event Processing Majid Lotfian Delouee, Boris Koldehofe, and Viktoriya Degeler In Proceedings of the 17th ACM International Conference on Distributed and Event-Based Systems 2023 [Abs] [Link] [DOI:10.1145/3583678.3603278]
  45. Parameter Efficient Node Classification on Homophilic Graphs Lucas Prieto, Jeroen Den Boef, Paul Groth, and Joran Cornelisse Transactions on Machine Learning Research 2023 [Link] [Code]

2022

  1. Relational graph convolutional networks: a closer look Thiviyan Thanapalasingam, Lucas Berkel, Peter Bloem, and Paul Groth PeerJ Computer Science 2022 [Link] [DOI:10.7717/peerj-cs.1073]
  2. Question Answering with Additive Restrictive Training (QuAART): Question Answering for the Rapid Development of New Knowledge Extraction Pipelines Corey A. Harper, Ron Daniel, and Paul Groth In Knowledge Engineering and Knowledge Management (EKAW) 2022 [Abs] [Link] [DOI:10.1007/978-3-031-17105-5_4]
  3. Serenade - Low-Latency Session-Based Recommendation in e-Commerce at Scale Barrie Kersbergen, Olivier Sprangers, and Sebastian Schelter In Proceedings of the 2022 International Conference on Management of Data 2022 [Abs] [Link] [DOI:10.1145/3514221.3517901]
  4. Towards Data-Centric What-If Analysis for Native Machine Learning Pipelines Stefan Grafberger, Paul Groth, and Sebastian Schelter In Proceedings of the Sixth Workshop on Data Management for End-To-End Machine Learning 2022 [Abs] [Link] [DOI:10.1145/3533028.3533303]
  5. Responsible Data Management Julia Stoyanovich, Serge Abiteboul, Bill Howe, H. V. Jagadish, and Sebastian Schelter Communications of the ACM 2022 [Abs] [Link] [DOI:10.1145/3488717]
  6. SlotGAN: Detecting Mentions in Text via Adversarial Distant Learning Daniel Daza, Michael Cochez, and Paul Groth In Proceedings of the Sixth Workshop on Structured Prediction for NLP 2022 [Abs] [Link] [DOI:10.18653/v1/2022.spnlp-1.4]
  7. CITRIS: Causal Identifiability from Temporal Intervened Sequences Phillip Lippe, Sara Magliacane, Sindy Löwe, Yuki M. Asano, Taco Cohen, and Efstratios Gavves In Proceedings of the 39th International Conference on Machine Learning, ICML 2022 [arXiv]
  8. Methods Included Michael R. Crusoe, Sanne Abeln, Alexandru Iosup, Peter Amstutz, John Chilton, Nebojša Tijanić, Hervé Ménager, Stian Soiland-Reyes, Bogdan Gavrilović, Carole Goble, and The CWL Community Communications of the ACM 2022 [Abs] [Link] [DOI:10.1145/3486897]
  9. Making Canonical Workflow Building Blocks Interoperable across Workflow Languages Stian Soiland-Reyes, Genís Bayarri, Pau Andrio, Robin Long, Douglas Lowe, Ania Niewielska, Adam Hospital, and Paul Groth Data Intelligence 2022 [Abs] [Link] [DOI:10.1162/dint_a_00135]
  10. Letter from the Special Issue Editor Sebastian Schelter IEEE Data Engineering Bulletin (Special issue on Directions Towards GDPR-Compliant Data Systems and Applications) 2022 [Link]
  11. Defining a Knowledge Graph Development Process Through a Systematic Review Gytundefined Tamašauskaitundefined, and Paul Groth ACM Transactios on Software Engineering and Methodology 2022 [Abs] [Link] [DOI:10.1145/3522586]
  12. Packaging research artefacts with RO-Crate Stian Soiland-Reyes, Peter Sefton, Mercè Crosas, Leyla Jael Castro, Frederik Coppens, José M. Fernández, Daniel Garijo, Björn Grüning, Marco La Rosa, Simone Leo, and al. Data Science 2022 [Link] [DOI:10.3233/DS-210053]
  13. Data distribution debugging in machine learning pipelines Stefan Grafberger, Paul Groth, Julia Stoyanovich, and Sebastian Schelter The VLDB Journal 2022 [Link] [DOI:10.1007/s00778-021-00726-w]
  14. Structure-based knowledge acquisition from electronic lab notebooks for research data provenance documentation Max Schröder, Susanne Staehlke, Paul Groth, J. Barbara Nebe, Sascha Spors, and Frank Krüger Journal of Biomedical Semantics 2022 [Link] [DOI:10.1186/s13326-021-00257-x]
  15. Screening Native Machine Learning Pipelines with ArgusEyes Sebastian Schelter, Stefan Grafberger, Shubha Guha, Olivier Sprangers, Bojan Karlas, and Ce Zhang In 12th Conference on Innovative Data Systems Research, CIDR 2022, Chaminade, CA, USA, January 9-12, 2022 2022 [Link]
  16. The Semantic Web - 19th International Conference, ESWC 2022, Hersonissos, Crete, Greece, May 29 - June 2, 2022, Proceedings 2022 [Link] [DOI:10.1007/978-3-031-06981-9]
  17. Towards improving Wikidata reuse with emerging patterns Valentina Anita Carriero, Paul Groth, and Valentina Presutti In Proceedings of the 3rd Wikidata Workshop 2022 co-located with the 21st International Semantic Web Conference (ISWC2022), Virtual Event, Hanghzou, China, October 2022 2022 [Link]
  18. Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching co-located with the 20th International Semantic Web Conference (ISWC 2021), Virtual conference, October 27, 2021 Ernesto Jiménez-Ruiz, Vasilis Efthymiou, Jiaoyan Chen, Vincenzo Cutrona, Oktie Hassanzadeh, Juan Sequeda, Kavitha Srinivas, Nora Abdelmageed, Madelon Hulsebos, Daniela Oliveira, and Catia Pesquita 2022 [Link]
  19. AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning Biwei Huang, Fan Feng, Chaochao Lu, Sara Magliacane, and Kun Zhang In International Conference on Learning Representations 2022 [Link] [ :tada: Spotlight Presentation ]
  20. Towards Parameter-Efficient Automation of Data Wrangling Tasks with Prefix-Tuning David Vos, Till Döhmen, and Sebastian Schelter In NeurIPS 2022 First Table Representation Workshop 2022 [Link]
  21. GitSchemas: A Dataset for Automating Relational Data Preparation Tasks Till Döhmen, Madelon Hulsebos, Christian Beecks, and Sebastian Schelter In 2022 IEEE 38th International Conference on Data Engineering Workshops (ICDEW) 2022 [Link] [DOI:10.1109/ICDEW55742.2022.00016]
  22. Making Table Understanding Work in Practice Madelon Hulsebos, Sneha Gathani, James Gale, Isil Dillig, Paul Groth, and Demiralp In 12th Conference on Innovative Data Systems Research, CIDR 2022, Chaminade, CA, USA, January 9-12, 2022 2022 [Link]

2021

  1. The non-linear impact of data handling on network diffusion models James Nevin, Michael Lees, and Paul Groth Patterns 2021 [Link] [DOI:10.1016/j.patter.2021.100397]
  2. GraphPOPE: Retaining Structural Graph Information Using Position-aware Node Embeddings Jeroen Den Boef, Joran Cornelisse, and Paul Groth In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG 2021) 2021 [Link]
  3. Quality Assessment of Knowledge Graph Hierarchies using KG-BERT Kinga Szarkowska, Veronique Moore, Pierre-Yves Vandenbussche, and Paul Groth In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG 2021) 2021 [Link]
  4. Perspectives on automated composition of workflows in the life sciences Anna-Lena Lamprecht, Magnus Palmblad, Jon Ison, Veit Schwämmle, Mohammad Sadnan Al Manir, Ilkay Altintas, Christopher J. O. Baker, Ammar Ben Hadj Amor, Salvador Capella-Gutierrez, Paulos Charonyktakis, Michael R. Crusoe, Yolanda Gil, Carole Goble, Timothy J. Griffin, Paul Groth, Hans Ienasescu, Pratik Jagtap, Matúš Kalaš, Vedran Kasalica, Alireza Khanteymoori, Tobias Kuhn, Hailiang Mei, Hervé Ménager, Steffen Möller, Robin A. Richardson, Vincent Robert, Stian Soiland-Reyes, Robert Stevens, Szoke Szaniszlo, Suzan Verberne, Aswin Verhoeven, and Katherine Wolstencroft F1000Research 2021 [Link] [DOI:10.12688/f1000research.54159.1]
  5. SemEval-2021 Task 8: MeasEval – Extracting Counts and Measurements and their Related Contexts Corey Harper, Jessica Cox, Curt Kohler, Antony Scerri, Ron Daniel Jr., and Paul Groth In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021) 2021 [Abs] [Link] [DOI:10.18653/v1/2021.semeval-1.38] [ :tada: SemEval 2021 Best Task Paper ]
  6. Further with Knowledge Graphs: Proceedings of the 17th International Conference on Semantic Systems, 6–9 September 2021, Amsterdam, The Netherlands 2021 [Link] [DOI:10.3233/SSW53]
  7. Reinforcement Learning–Based Collective Entity Alignment with Adaptive Features Weixin Zeng, Xiang Zhao, Jiuyang Tang, Xuemin Lin, and Paul Groth ACM Trans. Inf. Syst. 2021 [Abs] [Link] [DOI:10.1145/3446428]
  8. Learnings from a Retail Recommendation System on Billions of Interactions at bol.com B. Kersbergen, and S. Schelter In 2021 IEEE 37th International Conference on Data Engineering (ICDE) 2021 [Link] [DOI:10.1109/ICDE51399.2021.00277]
  9. Inductive Entity Representations from Text via Link Prediction Daniel Daza, Michael Cochez, and Paul Groth In Proceedings of The Web Conference 2021 [arXiv] [DOI:10.1145/3442381.3450141] [Code]
  10. Letter from the Special Issue Editor Sebastian Schelter IEEE Data Engineering Bulletin (Special issue on Data validation for machine learning models and applications) 2021 [Link]
  11. Complex Query Answering with Neural Link Predictors Erik Arakelyan, Daniel Daza, Pasquale Minervini, and Michael Cochez In International Conference on Learning Representations (ICLR) 2021 [arXiv] [Link] [ :tada: Outstanding Paper Award ICLR 2021 ]
  12. Taming Technical Bias in Machine Learning Pipelines Sebastian Schelter, and Julia Stoyanovich IEEE Data Engineering Bulletin (Special Issue on Interdisciplinary Perspectives on Fairness and Artificial Intelligence Systems) 2021 [Link]
  13. Talking datasets – Understanding data sensemaking behaviours Laura Koesten, Kathleen Gregory, Paul Groth, and Elena Simperl International Journal of Human-Computer Studies 2021 [Abs] [arXiv] [Link] [DOI:10.1016/j.ijhcs.2020.102562]
  14. The Challenges of Cross-Document Coreference Resolution for Email Xue Li, Sara Magliacane, and Paul Groth In Proceedings of the 11th on Knowledge Capture Conference 2021 [Abs] [Link] [DOI:10.1145/3460210.3493573]
  15. Supporting Ontology Maintenance with Contextual Word Embeddings and Maximum Mean Discrepancy Natasha Shroff, Pierre-Yves Vandenbussche, Véronique Moore, and Paul Groth In Joint Proceedings of the 2nd International Workshop on Deep Learning meets Ontologies and Natural Language Processing (DeepOntoNLP 2021) & 6th International Workshop on Explainable Sentiment Mining and Emotion Detection (X-SENTIMENT 2021) co-located with co-located with 18th Extended Semantic Web Conference 2021, Hersonissos, Greece, June 6th - 7th, 2021 (moved online) 2021 [Link]
  16. Proceedings of Machine Learning with Symbolic Methods and Knowledge Graphs co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021), Virtual, September 17, 2021 Mehwish Alam, Mehdi Ali, Paul Groth, Pascal Hitzler, Jens Lehmann, Heiko Paulheim, Achim Rettinger, Harald Sack, Afshin Sadeghi, and Volker Tresp 2021 [Link]
  17. Verifiably Safe Exploration for End-to-End Reinforcement Learning Nathan Hunt, Nathan Fulton, Sara Magliacane, Trong Nghia Hoang, Subhro Das, and Armando Solar-Lezama In Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control 2021 [Abs] [Link] [DOI:10.1145/3447928.3456653] [ :tada: Best Paper Award ACM HSCC 2021 ]
  18. Summary of Tutorials at The Web Conference 2021 Robert West, Smriti Bhagat, Paul Groth, Marinka Zitnik, Francisco M. Couto, Pasquale Lisena, Albert Meroño-Peñuela, Xiangyu Zhao, Wenqi Fan, Dawei Yin, Jiliang Tang, Linjun Shou, Ming Gong, Jian Pei, Xiubo Geng, Xingjie Zhou, Daxin Jiang, Benjamin Ricaud, Nicolas Aspert, Volodymyr Miz, Jennifer Dy, Stratis Ioannidis, undefinedlkay Yıldız, Rezvaneh Rezapour, Samin Aref, Ly Dinh, Jana Diesner, Alexey Drutsa, Dmitry Ustalov, Nikita Popov, Daria Baidakova, Shubhanshu Mishra, Arjun Gopalan, Da-Cheng Juan, Cesar Ilharco Magalhaes, Chun-Sung Ferng, Allan Heydon, Chun-Ta Lu, Philip Pham, George Yu, Yicheng Fan, Yueqi Wang, Florian Laurent, Yanick Schraner, Christian Scheller, Sharada Mohanty, Jiawei Chen, Xiang Wang, Fuli Feng, Xiangnan He, Irene Teinemaa, Javier Albert, Dmitri Goldenberg, Flavian Vasile, David Rohde, Olivier Jeunen, Amine Benhalloum, Otmane Sakhi, Yu Rong, Wenbing Huang, Tingyang Xu, Yatao Bian, Hong Cheng, Fuchun Sun, Junzhou Huang, Shobeir Fakhraei, Christos Faloutsos, Onur Çelebi, Martin Müller, Manuel Schneider, Olesia Altunina, Wolfram Wingerath, Benjamin Wollmer, Felix Gessert, Stephan Succo, Norbert Ritter, Evann Courdier, Tudor Mihai Avram, Dragan Cvetinovic, Levan Tsinadze, Johny Jose, Rose Howell, Mario Koenig, Michaël Defferrard, Krishnaram Kenthapadi, Ben Packer, Mehrnoosh Sameki, and Nashlie Sephus In Companion Proceedings of the Web Conference 2021 2021 [Abs] [Link] [DOI:10.1145/3442442.3453701]
  19. HedgeCut: Maintaining Randomised Trees for Low-Latency Machine Unlearning Sebastian Schelter, Stefan Grafberger, and Ted Dunning In Proceedings of the 2021 International Conference on Management of Data 2021 [Abs] [Link] [DOI:10.1145/3448016.3457239]
  20. MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines Stefan Grafberger, Shubha Guha, Julia Stoyanovich, and Sebastian Schelter In Proceedings of the 2021 International Conference on Management of Data 2021 [Abs] [Link] [DOI:10.1145/3448016.3452759]
  21. Automating Data Quality Validation for Dynamic Data Ingestion Sergey Redyuk, Zoi Kaoudi, Volker Markl, and Sebastian Schelter In Proceedings of the 24th International Conference on Extending Database Technology, EDBT 2021, Nicosia, Cyprus, March 23 - 26, 2021 2021 [Link] [DOI:10.5441/002/edbt.2021.07]
  22. JENGA - A Framework to Study the Impact of Data Errors on the Predictions of Machine Learning Models Sebastian Schelter, Tammo Rukat, and Felix Biessmann In Proceedings of the 24th International Conference on Extending Database Technology, EDBT 2021, Nicosia, Cyprus, March 23 - 26, 2021 2021 [Link] [DOI:10.5441/002/edbt.2021.63]
  23. Lightweight Inspection of Data Preprocessing in Native Machine Learning Pipelines Stefan Grafberger, Julia Stoyanovich, and Sebastian Schelter In 11th Conference on Innovative Data Systems Research, CIDR 2021, Virtual Event, January 11-15, 2021, Online Proceedings 2021 [Link]

2020

  1. Towards Olfactory Information Extraction from Text: A Case Study on Detecting Smell Experiences in Novels Ryan Brate, Paul Groth, and Marieke Erp In Proceedings of the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2020 [Abs] [Link]
  2. Dataset Reuse: Toward Translating Principles to Practice Laura Koesten, Pavlos Vougiouklis, Elena Simperl, and Paul Groth Patterns 2020 [Link] [DOI:10.1016/j.patter.2020.100136]
  3. Effective distributed representations for academic expert search Mark Berger, Jakub Zavrel, and Paul Groth In Proceedings of the First Workshop on Scholarly Document Processing at EMNLP 2020 [Abs] [Link]
  4. Dataset search: a survey Adriane Chapman, Elena Simperl, Laura Koesten, George Konstantinidis, Luis-Daniel Ibáñez, Emilia Kacprzak, and Paul Groth The VLDB Journal 2020 [arXiv] [Link] [DOI:10.1007/s00778-019-00564-x]
  5. Introduction – FAIR data, systems and analysis Paul Groth, and Michel Dumontier Data Science 2020 [Link] [DOI:10.3233/DS-200029]
  6. Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning Ke Yang, Biao Huang, Julia Stoyanovich, and Sebastian Schelter In Workshop on Human-In-the-Loop Data Analytics (HILDA’20) 2020 [Link] [DOI:10.1145/3398730.3399194]
  7. Towards Entity Spaces Marieke Erp, and Paul Groth In Proceedings of The 12th Language Resources and Evaluation Conference 2020 [Abs] [Link]
  8. Lost or Found? Discovering Data Needed for Research Kathleen Gregory, Paul Groth, Andrea Scharnhorst, and Sally Wyatt Harvard Data Science Review 2020 [Link] [DOI:10.1162/99608f92.e38165eb]
  9. PANDAcap: A Framework for Streamlining Collection of Full-System Traces Manolis Stamatogiannakis, Herbert Bos, and Paul Groth In EuroSec 2020 [Link] [DOI:10.1145/3380786.3391396] [Code]
  10. Estimating the imageability of words by mining visual characteristics from crawled image data Marc A. Kastner, Ichiro Ide, Frank Nack, Yasutomo Kawanishi, Takatsugu Hirayama, Daisuke Deguchi, and Hiroshi Murase Multimedia Tools and Applications 2020 [Link] [DOI:10.1007/s11042-019-08571-4]
  11. FAIR Data Reuse – the Path through Data Citation Paul Groth, Helena Cousijn, Tim Clark, and Carole Goble Data Intelligence 2020 [Link] [DOI:10.1162/dint_a_00030]
  12. Message Passing Query Embedding Daniel Daza, and Michael Cochez In ICML Workshop - Graph Representation Learning and Beyond 2020 [arXiv] [Link]
  13. The state of altmetrics: a tenth anniversary celebration Altmetric Engineering, Stacy Konkiel, Jason Priem, Euan Adie, Gemma Derrick, Fereshteh Didegah, Paul Groth, Cameron Neylon, Shenmeng Xu, Zohreh Zahedi, Timothy Bowman, Vanash M Patel, Robin Haunschild, Lutz Bornmann, Mike Taylor, Liesa Ross, Yin-Leng Theng, Saeed-Ul Hassan, and Naif R. Aljohani 2020 [Link] [DOI:10.6084/M9.FIGSHARE.13010000.V2]
  14. CSSA’20: Workshop on Combining Symbolic and Sub-Symbolic Methods and Their Applications Mehwish Alam, Paul Groth, Pascal Hitzler, Heiko Paulheim, Harald Sack, and Volker Tresp In Proceedings of the 29th ACM International Conference on Information & Knowledge Management 2020 [Abs] [Link] [DOI:10.1145/3340531.3414072]
  15. ICIDS2020 Panel: Building the Discipline of Interactive Digital Narratives Mark Bernstein, Mirjam Palosaari Eladhari, Hartmut Koenitz, Sandy Louchart, Frank Nack, Chris Martens, Giulia Carla Rossi, Anne-Gwenn Bosser, and David E. Millard In Interactive Storytelling 2020 [Abs] [DOI:10.1007/978-3-030-62516-0_1]
  16. Technical Perspective: Query Optimization for Faster Deep CNN Explanations Sebastian Schelter ACM SIGMOD Record 2020 [Link]
  17. Apache Mahout: Machine Learning on Distributed Dataflow Systems Robin Anil, Gokhan Capan, Isabel Drost-Fromm, Ted Dunning, Ellen Friedman, Trevor Grant, Shannon Quinn, Paritosh Ranjan, Sebastian Schelter, and Özgür Yılmazel Journal of Machine Learning Research 2020 [Link]
  18. Semantic Systems. In the Era of Knowledge Graphs - 16th International Conference on Semantic Systems, SEMANTiCS 2020, Amsterdam, The Netherlands, September 7-10, 2020, Proceedings Eva Blomqvist, Paul Groth, Victor Boer, Tassilo Pellegrini, Mehwish Alam, Tobias Käfer, Peter Kieseberg, Sabrina Kirrane, Albert Meroño-Peñuela, and Harshvardhan J. Pandit 2020 [Link] [DOI:10.1007/978-3-030-59833-4]
  19. A longitudinal analysis of university rankings Friso Selten, Cameron Neylon, Chun-Kai Huang, and Paul Groth Quantitative Science Studies 2020 [Link] [DOI:10.1162/qss_a_00052]

2019

  1. How Relevant Is Your Choice? Lobke Kolhoff, and Frank Nack In ICIDS 2019. Lecture Notes in Computer Science, vol 11869 2019 [Abs]
  2. Transfer Learning for Biomedical Named Entity Recognition with BioBERT Anthi Symeonidou, Viachaslau Sazonau, and Paul Groth In Proceedings of the Posters and Demo Track of the 15th International Conference on Semantic Systems co-located with 15th International Conference on Semantic Systems (SEMANTiCS 2019), Karlsruhe, Germany, September 9th - to - 12th, 2019. 2019 [Link]
  3. Understanding data search as a socio-technical practice Kathleen M Gregory, Helena Cousijn, Paul Groth, Andrea Scharnhorst, and Sally Wyatt Journal of Information Science 2019 [Abs] [Link] [DOI:10.1177/0165551519837182]
  4. Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines Kathleen Gregory, Paul Groth, Helena Cousijn, Andrea Scharnhorst, and Sally Wyatt Journal of the Association for Information Science and Technology 2019 [Abs] [Link] [DOI:10.1002/asi.24165]
  5. End-to-End Learning for Answering Structured Queries Directly over Text Paul T. Groth, Antony Scerri, Ron Daniel, and Bradley P. Allen In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG2019) Co-located with the 16th Extended Semantic Web Conference 2019 (ESWC 2019), Portoroz, Slovenia, June 2, 2019. 2019 [arXiv] [Link]

2018

  1. Open Information Extraction on Scientific Text: An Evaluation Paul T. Groth, Michael Lauruhn, Antony Scerri, and Ron Daniel In Proceedings of the 27th International Conference on Computational Linguistics, COLING 2018, Santa Fe, New Mexico, USA, August 20-26, 2018 2018 [Link]
  2. Elsevier’s Healthcare Knowledge Graph and the Case for Enterprise Level Linked Data Standards Alex DeJong, Radmila Bord, Will Dowling, Rinke Hoekstra, Ryan Moquin, Charlie O, Mevan Samarasinghe, Paul Snyder, Craig Stanley, Anna Tordai, Michael Trefry, and Paul T. Groth In Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, USA, October 8th - to - 12th, 2018. 2018 [Link]
  3. Use of Internal Testing Data to Help Determine Compensation for Crowdsourcing Tasks Michael Lauruhn, Paul T. Groth, Corey A. Harper, and Helena F. Deus In Proceedings of the 2nd International Workshop on Augmenting Intelligence with Humans\--in-\-the-\-Loop co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, California, October 9th, 2018. 2018 [Link]