publications
Selected recent publications. Generated by jekyll-scholar.
Bibtex file with the publications listed below.
2026
-
A Multivariate Statistical Framework for Detection, Classification and Pre-localization of Anomalies in Water Distribution Networks Expert Systems with Applications 2026 [Abs] [Link] [DOI:https://doi.org/10.1016/j.eswa.2026.131450]
2025
-
Minimizing Hyperbolic Embedding Distortion with LLM-Guided Hierarchy Restructuring In Proceedings of the 13th Knowledge Capture Conference 2025 2025 [Abs] [Link] [DOI:10.1145/3731443.3771357]
-
Pattern-Level Privacy Protection in Event-Based Systems SN Computer Science 2025 [Abs] [Link] [DOI:10.1007/s42979-025-04574-1]
-
DiTEC-WDN: A Large-Scale Dataset of Hydraulic Scenarios across Multiple Water Distribution Networks Scientific Data 2025 [Abs] [Link] [DOI:10.1038/s41597-025-06026-0]
-
LM-KBC 2025: 4th Challenge on Knowledge Base Construction from Pre-trained Language Models In Joint Proceedings of the 3rd Workshop on Knowledge Base Construction from Pre-Trained Language Models and the 4th Challenge on Language Models for Knowledge Base Construction (KBC-LM+LM-KBC 2025) 2025 [Link]
-
Learning Semantic Association Rules from Internet of Things Data Neurosymbolic Artificial Intelligence 2025 [Abs] [Link] [DOI:10.1177/29498732251377518] [Code]
-
mlidea: Interactively Improving ML Data Preparation Code via "Shadow Pipelines" Proc. VLDB Endow. 2025 [Abs] [Link] [DOI:10.14778/3750601.3750671]
-
A survey of large language models for data challenges in graphs Expert Systems with Applications 2025 [Abs] [arXiv] [Link] [DOI:https://doi.org/10.1016/j.eswa.2025.129643] [Code]
-
PyAerial: Scalable association rule mining from tabular data SoftwareX 2025 [Abs] [Link] [DOI:10.1016/j.softx.2025.102341] [Code]
-
The Librarian Skillset and the Needs of Artificial Intelligence Cataloging & Classification Quarterly 2025 [Link] [DOI:10.1080/01639374.2025.2539787]
-
BASIL DB: bioactive semantic integration and linking database Journal of Biomedical Semantics 2025 [Link] [DOI:10.1186/s13326-025-00336-3]
-
Designing Hierarchies for Optimal Hyperbolic Embedding In Proceedings of the 22nd European Semantic Web Conference (ESWC) 2025 [Abs] [Link] [DOI:10.1007/978-3-031-94575-5_20] [Code]
-
Evaluation of unsupervised static topic models’ emergence detection ability PeerJ Computer Science 2025 [Link] [DOI:10.7717/peerj-cs.2875] [Data]
-
Infrastructure, Intermediaries, and Artificial Intelligence: A Rejoinder to Commentaries on “From Data Creator to Data Reuser: Distance Matters" Harvard Data Science Review 2025 [DOI:10.1162/99608f92.c17c3adb]
-
From Data Creator to Data Reuser: Distance Matters Harvard Data Science Review 2025 [DOI:10.1162/99608f92.35d32cfc]
-
The effects of mismatched train and test data cleaning pipelines on regression models: lessons for practice PeerJ Computer Science 2025 [Abs] [Link] [DOI:10.7717/peerj-cs.2793]
-
Towards Graph Foundation Models for Water Distribution Networks 2025 [Abs] [Link] [DOI:10.15131/SHEF.DATA.29921231]
-
Proceedings of the Special Session on Harmonising Generative AI and Semantic Web Technologies (HGAIS 2024) co-located with the 23rd International Semantic Web Conference (ISWC 2024), Baltimore, Maryland, November 13, 2024 2025 [Link]
-
Contrasting Global and Local Representations for Human Activity Recognition using Graph Neural Networks In Proceedings of the 40th ACM/SIGAPP Symposium on Applied Computing 2025 [Abs] [Link] [DOI:10.1145/3672608.3707743]
-
Rethinking Computing Systems in the Era of Climate Crisis: A Call for a Sustainable Computing Continuum IEEE Internet Computing 2025 [DOI:10.1109/MIC.2025.3566642]
-
3K: Knowledge-Enriched Digital Twin Framework In Proceedings of the 14th International Conference on the Internet of Things 2025 [Abs] [Link] [DOI:10.1145/3703790.3703834]
-
A Deep Dive Into Cross-Dataset Entity Matching with Large and Small Language Models In Proceedings 28th International Conference on Extending Database Technology, EDBT 2025, Barcelona, Spain, March 25-28, 2025 2025 [Link] [DOI:10.48786/EDBT.2025.75] [Code]
-
The effects of data quality on machine learning performance on tabular data Information Systems 2025 [Abs] [Link] [DOI:https://doi.org/10.1016/j.is.2025.102549]
-
Step-by-Step Data Cleaning Recommendations to Improve ML Prediction Accuracy In Proceedings 28th International Conference on Extending Database Technology, EDBT 2025, Barcelona, Spain, March 25-28, 2025 2025 [Link] [DOI:10.48786/EDBT.2025.43]
-
Data Systems Education: Curriculum Recommendations, Course Syllabi, and Industry Needs In 2024 Working Group Reports on Innovation and Technology in Computer Science Education 2025 [Abs] [Link] [DOI:10.1145/3689187.3709609]
2024
-
Exploiting Subgraphs and Attributes for Representation Learning on Knowledge Graphs 2024 [Abs] [DOI:10.5463/thesis.823]
-
Understanding the Impact of Entity Linking on the Topology of Entity Co-occurrence Networks for Social Media Analysis In Knowledge Engineering and Knowledge Management (EKAW) 2024 [Link] [DOI:10.1007/978-3-031-77792-9_5] [Code]
-
Large Language Model for Ontology Learning in Drinking Water Distribution Network Domain In Proceedings of the 24th International Conference on Knowledge Engineering and Knowledge Management Posters, Demos, Workshops, and Tutorials (EKAW‑PDWT 2024) 2024 [Link]
-
A benchmark for the detection of metalinguistic disagreements between LLMs and knowledge graphs In Proceedings of the Special Session on Harmonising Generative AI and Semantic Web Technologies (HGAIS 2024) 2024 [Link]
-
Influence Beyond Similarity: A Contrastive Learning Approach to Object Influence Retrieval In Knowledge Engineering and Knowledge Management (EKAW) 2024 [Link] [DOI:10.1007/978-3-031-77792-9_3] [Code]
-
TIGER: Temporally Improved Graph Entity Linker In 27th European Conference on Artificial Intelligence (ECAI 24) 2024 [Link] [DOI:10.3233/faia240933] [Code]
-
CYCLE: Cross-Year Contrastive Learning in Entity-Linking In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM 24) 2024 [Abs] [Link] [DOI:10.1145/3627673.3679702] [Code]
-
Testing prompt engineering methods for knowledge extraction from text Semantic Web 2024 [Link] [DOI:10.3233/sw-243719]
-
A Sparsity Principle for Partially Observable Causal Representation Learning International Conference on Machine Learning (ICML) 2024 [Link]
-
How different is different? Systematically identifying distribution shifts and their impacts in NER datasets Language Resources and Evaluation 2024 [Link] [DOI:10.1007/s10579-024-09754-8]
-
Towards Interactively Improving ML Data Preparation Code via "Shadow Pipelines" In Proceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning 2024 [Abs] [Link] [DOI:10.1145/3650203.3663327]
-
Towards Efficient Data Wrangling with LLMs using Code Generation In Proceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning 2024 [Abs] [Link] [DOI:10.1145/3650203.3663334]
-
Prompt Tuned Embedding Classification for Industry Sector Allocation In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 6: Industry Track) 2024 [Abs] [Link] [DOI:10.18653/v1/2024.naacl-industry.10]
-
Evaluating Class Membership Relations in Knowledge Graphs using Large Language Models In Proceedings of European Semantic Web Conference Special Track on Large Language Models for Knowledge Engineering 2024 [Link]
-
Directions Towards Efficient and Automated Data Wrangling with Large Language Models In 2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW) 2024 [Link] [DOI:10.1109/ICDEW61823.2024.00044]
-
SchemaPile: A Large Collection of Relational Database Schemas Proc. ACM Manag. Data 2024 [Abs] [Link] [DOI:10.1145/3654975]
-
Data Debugging with Shapley Importance over Machine Learning Pipelines In The Twelfth International Conference on Learning Representations 2024 [Link]
-
Multi-View Causal Representation Learning with Partial Observability In The Twelfth International Conference on Learning Representations 2024 [Link] [
Spotlight Presentation ]
-
Evaluating FAIR Digital Object and Linked Data as distributed object systems PeerJ Computer Science 2024 [Abs] [Link] [DOI:10.7717/peerj-cs.1781] [Data]
-
Ontologies in digital twins: A systematic literature review Future Generation Computer Systems 2024 [Link] [DOI:10.1016/j.future.2023.12.013] [Data]
-
Empirical ontology design patterns and shapes from Wikidata Semantic Web 2024 [Link] [DOI:10.3233/sw-243613]
-
Assisted design of data science pipelines The VLDB Journal 2024 [Link] [DOI:10.1007/s00778-024-00835-2]
-
Domain Generalization in Time Series Forecasting ACM Trans. Knowl. Discov. Data 2024 [Abs] [Link] [DOI:10.1145/3643035]
-
Large-Scale Forecasting of Electric Vehicle Charging Demand Using Global Time Series Modeling In Proceedings of the 10th International Conference on Vehicle Technology and Intelligent Transport Systems 2024 [Link] [DOI:10.5220/0012555400003702]
-
Editorial for the Special Issue on Knowledge Engineering Journal of Web Semantics 2024 [Link] [DOI:10.1016/j.websem.2024.100840]
-
Automated Data Cleaning Can Hurt Fairness in Machine Learning-based Decision Making IEEE Transactions on Knowledge and Data Engineering 2024 [Link] [DOI:10.1109/TKDE.2024.3365524]
-
Standardizing Knowledge Engineering Practices with a Reference Architecture Transactions on Graph Data and Knowledge 2024 [Link] [DOI:10.4230/TGDK.2.1.5]
-
Zero-Shot Topic Classification of Column Headers: Leveraging LLMs for Metadata Enrichment 2024 [Link] [DOI:10.3233/SSW240006]
-
Red Onions, Soft Cheese and Data: From Food Safety to Data Traceability for Responsible AI IEEE Data Engineering Bulletin 2024 [Link]
-
Too Good To Be True: accuracy overestimation in (re)current practices for Human Activity Recognition In 2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops) 2024 [DOI:10.1109/PerComWorkshops59983.2024.10503465]
-
Graph Neural Networks for Pressure Estimation in Water Distribution Systems Water Resources Research 2024 [Link]
2023
-
BioBLP: a modular framework for learning on multimodal biomedical knowledge graphs Journal of Biomedical Semantics 2023 [Link] [DOI:10.1186/s13326-023-00301-y] [Code] [Data]
-
Large Language Models and Knowledge Graphs: Opportunities and Challenges Transactions on Graph Data and Knowledge 2023 [Link] [DOI:10.4230/TGDK.1.1.2]
-
Evaluating the Knowledge Base Completion Potential of GPT In Findings of the Association for Computational Linguistics: EMNLP 2023 2023 [Abs] [Link] [DOI:10.18653/v1/2023.findings-emnlp.426]
-
Observatory: Characterizing Embeddings of Relational Tables Proceedings of the VLDB Endowment 2023 [Link] [DOI:10.14778/3636218.3636237]
-
Knowledge Engineering Using Large Language Models Transactions on Graph Data and Knowledge 2023 [Link] [DOI:10.4230/TGDK.1.1.3]
-
Preface: LM-KBC Challenge 2023 Sneha Singhania, Jan-Christoph Kalo, Simon Razniewski, Jeff Z. Pan In Joint proceedings of the 1st workshop on Knowledge Base Construction from Pre-Trained Language Models (KBC-LM) and the 2nd challenge on Language Models for Knowledge Base Construction (LM-KBC) 2023 [Link]
-
Mlwhatif: What If You Could Stop Re-Implementing Your Machine Learning Pipeline Analyses over and Over? Proc. VLDB Endow. 2023 [Abs] [Link] [DOI:10.14778/3611540.3611606] [Code]
-
Improving Graph-to-Text Generation Using Cycle Training In Proceedings of the 4th Conference on Language, Data and Knowledge 2023 [Link]
-
Harnessing the Web and Knowledge Graphs for Automated Impact Investing Scoring In KDD Fragile Earth Workshop 2023 [Link] [DOI:10.48550/arXiv.2308.02622]
-
An approach for analysing the impact of data integration on complex network diffusion models Journal of Complex Networks 2023 [Link] [DOI:10.1093/comnet/cnad025] [Code]
-
Self-Contained Entity Discovery from Captioned Videos ACM Trans. Multimedia Comput. Commun. Appl. 2023 [Abs] [Link] [DOI:10.1145/3583138]
-
Data journeys: Explaining AI workflows through abstraction Semantic Web 2023 [Abs] [Link] [DOI:10.3233/sw-233407]
-
Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines Proc. ACM Manag. of Data 2023 [Abs] [Link] [DOI:10.1145/3589273]
-
GitTables: A Large-Scale Corpus of Relational Tables Proc. ACM Manag. Data 2023 [Abs] [Link] [DOI:10.1145/3588710]
-
An Analysis of Machine Learning-Based Semantic Matchmaking IEEE Access 2023 [DOI:10.1109/ACCESS.2023.3259360] [Code]
-
How to Make an Outlier? Studying the Effect of Presentational Features on the Outlierness of Items in Product Search Results In Proceedings of the 2023 Conference on Human Information Interaction and Retrieval 2023 [Abs] [Link] [DOI:10.1145/3576840.3578278]
-
The Mysterious User of Research Data: Knitting Together Science and Technology Studies with Information and Computer Science 2023 [Abs] [Link] [DOI:10.1007/978-3-031-11108-2_11]
-
SemTab 2022: Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, co-located with the 21st International Semantic Web Conference, ISWC 2022, Virtual conference, October 23-27, 2022 2023 [Link]
-
E2EG: End-to-End Node Classification Using Graph Topology and Text-based Node Attributes In 2023 IEEE International Conference on Data Mining Workshops (ICDMW) 2023 [DOI:10.1109/ICDMW60847.2023.00142]
-
Towards Declarative Systems for Data-Centric Machine Learning In Proceedings of the Data-Centric Machine Learning Research work- shop (DMLR) at ICML, 2023 2023 [Link]
-
Reasoning beyond Triples: Recent Advances in Knowledge Graph Embeddings In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM 2023, Birmingham, United Kingdom, October 21-25, 2023 2023 [Link] [DOI:10.1145/3583780.3615294]
-
Reconstructing and Querying ML Pipeline Intermediates In 13th Conference on Innovative Data Systems Research, CIDR 2023, Amsterdam, The Netherlands, January 8-11, 2023 2023 [Link]
-
Figure of Speech Detection and Generation as a Service in IDN Authoring Support 2023 [Link] [DOI:10.1007/978-3-031-47658-7_8]
-
Empowering Machine Learning Development with Service-Oriented Computing Principles In Service-Oriented Computing 2023 [Abs]
-
Results of SemTab 2023 In Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, SemTab 2023, co-located with the 22nd International Semantic Web Conference, ISWC 2023, Athens, Greece, November 6-10, 2023 2023 [Link]
-
Introducing the Observatory Library for End-to-End Table Embedding Inference In NeurIPS 2023 Second Table Representation Learning Workshop 2023 [Link]
-
Automated Data Cleaning Can Hurt Fairness in Machine Learning-based Decision Making In 2023 IEEE 39th International Conference on Data Engineering (ICDE) 2023 [DOI:10.1109/ICDE55515.2023.00303] [Code]
-
Forget Me Now: Fast and Exact Unlearning in Neighborhood-Based Recommendation In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval 2023 [Abs] [Link] [DOI:10.1145/3539618.3591989]
-
Data Integration Landscapes: The Case for Non-optimal Solutions in Network Diffusion Models In Computational Science – ICCS 2023 2023 [Abs] [DOI:10.1007/978-3-031-35995-8_35]
-
Proactively Screening Machine Learning Pipelines with ARGUSEYES In Companion of the 2023 International Conference on Management of Data 2023 [Abs] [Link] [DOI:10.1145/3555041.3589682] [
2nd Place Demo ]
-
Seventh Workshop on Data Management for End-to-End Machine Learning (DEEM) In Companion of the 2023 International Conference on Management of Data 2023 [Abs] [Link] [DOI:10.1145/3555041.3590819]
-
Models and Practice of Neural Table Representations In Companion of the 2023 International Conference on Management of Data 2023 [Abs] [Link] [DOI:10.1145/3555041.3589411]
-
Provenance Tracking for End-to-End Machine Learning Pipelines In Companion Proceedings of the ACM Web Conference 2023 2023 [Link] [DOI:10.1145/3543873.3587557]
-
A Simulation Environment and Reinforcement Learning Method for Waste Reduction Transactions on Machine Learning Research 2023 [Link]
-
Knowledge Graphs and their Role in the Knowledge Engineering of the 21st Century (Dagstuhl Seminar 22372) Dagstuhl Reports 2023 [Link] [DOI:10.4230/DagRep.12.9.60]
-
Poster: Towards Pattern-Level Privacy Protection in Distributed Complex Event Processing In Proceedings of the 17th ACM International Conference on Distributed and Event-Based Systems 2023 [Abs] [Link] [DOI:10.1145/3583678.3603278]
2022
-
Relational graph convolutional networks: a closer look PeerJ Computer Science 2022 [Link] [DOI:10.7717/peerj-cs.1073]
-
Question Answering with Additive Restrictive Training (QuAART): Question Answering for the Rapid Development of New Knowledge Extraction Pipelines In Knowledge Engineering and Knowledge Management (EKAW) 2022 [Abs] [Link] [DOI:10.1007/978-3-031-17105-5_4]
-
Serenade - Low-Latency Session-Based Recommendation in e-Commerce at Scale In Proceedings of the 2022 International Conference on Management of Data 2022 [Abs] [Link] [DOI:10.1145/3514221.3517901]
-
Towards Data-Centric What-If Analysis for Native Machine Learning Pipelines In Proceedings of the Sixth Workshop on Data Management for End-To-End Machine Learning 2022 [Abs] [Link] [DOI:10.1145/3533028.3533303]
-
SlotGAN: Detecting Mentions in Text via Adversarial Distant Learning In Proceedings of the Sixth Workshop on Structured Prediction for NLP 2022 [Abs] [Link] [DOI:10.18653/v1/2022.spnlp-1.4]
-
CITRIS: Causal Identifiability from Temporal Intervened Sequences In Proceedings of the 39th International Conference on Machine Learning, ICML 2022 [arXiv]
-
Making Canonical Workflow Building Blocks Interoperable across Workflow Languages Data Intelligence 2022 [Abs] [Link] [DOI:10.1162/dint_a_00135]
-
Letter from the Special Issue Editor IEEE Data Engineering Bulletin (Special issue on Directions Towards GDPR-Compliant Data Systems and Applications) 2022 [Link]
-
Defining a Knowledge Graph Development Process Through a Systematic Review ACM Transactios on Software Engineering and Methodology 2022 [Abs] [Link] [DOI:10.1145/3522586]
-
Data distribution debugging in machine learning pipelines The VLDB Journal 2022 [Link] [DOI:10.1007/s00778-021-00726-w]
-
Structure-based knowledge acquisition from electronic lab notebooks for research data provenance documentation Journal of Biomedical Semantics 2022 [Link] [DOI:10.1186/s13326-021-00257-x]
-
Screening Native Machine Learning Pipelines with ArgusEyes In 12th Conference on Innovative Data Systems Research, CIDR 2022, Chaminade, CA, USA, January 9-12, 2022 2022 [Link]
-
The Semantic Web - 19th International Conference, ESWC 2022, Hersonissos, Crete, Greece, May 29 - June 2, 2022, Proceedings 2022 [Link] [DOI:10.1007/978-3-031-06981-9]
-
Towards improving Wikidata reuse with emerging patterns In Proceedings of the 3rd Wikidata Workshop 2022 co-located with the 21st International Semantic Web Conference (ISWC2022), Virtual Event, Hanghzou, China, October 2022 2022 [Link]
-
Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching co-located with the 20th International Semantic Web Conference (ISWC 2021), Virtual conference, October 27, 2021 2022 [Link]
-
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning In International Conference on Learning Representations 2022 [Link] [
Spotlight Presentation ]
-
Towards Parameter-Efficient Automation of Data Wrangling Tasks with Prefix-Tuning In NeurIPS 2022 First Table Representation Workshop 2022 [Link]
-
GitSchemas: A Dataset for Automating Relational Data Preparation Tasks In 2022 IEEE 38th International Conference on Data Engineering Workshops (ICDEW) 2022 [Link] [DOI:10.1109/ICDEW55742.2022.00016]
-
Making Table Understanding Work in Practice In 12th Conference on Innovative Data Systems Research, CIDR 2022, Chaminade, CA, USA, January 9-12, 2022 2022 [Link]
2021
-
The non-linear impact of data handling on network diffusion models Patterns 2021 [Link] [DOI:10.1016/j.patter.2021.100397]
-
GraphPOPE: Retaining Structural Graph Information Using Position-aware Node Embeddings In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG 2021) 2021 [Link]
-
Quality Assessment of Knowledge Graph Hierarchies using KG-BERT In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG 2021) 2021 [Link]
-
Perspectives on automated composition of workflows in the life sciences F1000Research 2021 [Link] [DOI:10.12688/f1000research.54159.1]
-
SemEval-2021 Task 8: MeasEval – Extracting Counts and Measurements and their Related Contexts In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021) 2021 [Abs] [Link] [DOI:10.18653/v1/2021.semeval-1.38] [
SemEval 2021 Best Task Paper ]
-
Further with Knowledge Graphs: Proceedings of the 17th International Conference on Semantic Systems, 6–9 September 2021, Amsterdam, The Netherlands 2021 [Link] [DOI:10.3233/SSW53]
-
Reinforcement Learning–Based Collective Entity Alignment with Adaptive Features ACM Trans. Inf. Syst. 2021 [Abs] [Link] [DOI:10.1145/3446428]
-
Learnings from a Retail Recommendation System on Billions of Interactions at bol.com In 2021 IEEE 37th International Conference on Data Engineering (ICDE) 2021 [Link] [DOI:10.1109/ICDE51399.2021.00277]
-
Inductive Entity Representations from Text via Link Prediction In Proceedings of The Web Conference 2021 [arXiv] [DOI:10.1145/3442381.3450141] [Code]
-
Letter from the Special Issue Editor IEEE Data Engineering Bulletin (Special issue on Data validation for machine learning models and applications) 2021 [Link]
-
Complex Query Answering with Neural Link Predictors In International Conference on Learning Representations (ICLR) 2021 [arXiv] [Link] [
Outstanding Paper Award ICLR 2021 ]
-
Taming Technical Bias in Machine Learning Pipelines IEEE Data Engineering Bulletin (Special Issue on Interdisciplinary Perspectives on Fairness and Artificial Intelligence Systems) 2021 [Link]
-
Talking datasets – Understanding data sensemaking behaviours International Journal of Human-Computer Studies 2021 [Abs] [arXiv] [Link] [DOI:10.1016/j.ijhcs.2020.102562]
-
The Challenges of Cross-Document Coreference Resolution for Email In Proceedings of the 11th on Knowledge Capture Conference 2021 [Abs] [Link] [DOI:10.1145/3460210.3493573]
-
Supporting Ontology Maintenance with Contextual Word Embeddings and Maximum Mean Discrepancy In Joint Proceedings of the 2nd International Workshop on Deep Learning meets Ontologies and Natural Language Processing (DeepOntoNLP 2021) & 6th International Workshop on Explainable Sentiment Mining and Emotion Detection (X-SENTIMENT 2021) co-located with co-located with 18th Extended Semantic Web Conference 2021, Hersonissos, Greece, June 6th - 7th, 2021 (moved online) 2021 [Link]
-
Proceedings of Machine Learning with Symbolic Methods and Knowledge Graphs co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021), Virtual, September 17, 2021 2021 [Link]
-
Verifiably Safe Exploration for End-to-End Reinforcement Learning In Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control 2021 [Abs] [Link] [DOI:10.1145/3447928.3456653] [
Best Paper Award ACM HSCC 2021 ]
-
Summary of Tutorials at The Web Conference 2021 In Companion Proceedings of the Web Conference 2021 2021 [Abs] [Link] [DOI:10.1145/3442442.3453701]
-
HedgeCut: Maintaining Randomised Trees for Low-Latency Machine Unlearning In Proceedings of the 2021 International Conference on Management of Data 2021 [Abs] [Link] [DOI:10.1145/3448016.3457239]
-
MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines In Proceedings of the 2021 International Conference on Management of Data 2021 [Abs] [Link] [DOI:10.1145/3448016.3452759]
-
Automating Data Quality Validation for Dynamic Data Ingestion In Proceedings of the 24th International Conference on Extending Database Technology, EDBT 2021, Nicosia, Cyprus, March 23 - 26, 2021 2021 [Link] [DOI:10.5441/002/edbt.2021.07]
-
JENGA - A Framework to Study the Impact of Data Errors on the Predictions of Machine Learning Models In Proceedings of the 24th International Conference on Extending Database Technology, EDBT 2021, Nicosia, Cyprus, March 23 - 26, 2021 2021 [Link] [DOI:10.5441/002/edbt.2021.63]
-
Lightweight Inspection of Data Preprocessing in Native Machine Learning Pipelines In 11th Conference on Innovative Data Systems Research, CIDR 2021, Virtual Event, January 11-15, 2021, Online Proceedings 2021 [Link]
2020
-
Dataset Reuse: Toward Translating Principles to Practice Patterns 2020 [Link] [DOI:10.1016/j.patter.2020.100136]
-
Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning In Workshop on Human-In-the-Loop Data Analytics (HILDA’20) 2020 [Link] [DOI:10.1145/3398730.3399194]
-
Lost or Found? Discovering Data Needed for Research Harvard Data Science Review 2020 [Link] [DOI:10.1162/99608f92.e38165eb]
-
PANDAcap: A Framework for Streamlining Collection of Full-System Traces In EuroSec 2020 [Link] [DOI:10.1145/3380786.3391396] [Code]
-
Estimating the imageability of words by mining visual characteristics from crawled image data Multimedia Tools and Applications 2020 [Link] [DOI:10.1007/s11042-019-08571-4]
-
FAIR Data Reuse – the Path through Data Citation Data Intelligence 2020 [Link] [DOI:10.1162/dint_a_00030]
-
CSSA’20: Workshop on Combining Symbolic and Sub-Symbolic Methods and Their Applications In Proceedings of the 29th ACM International Conference on Information & Knowledge Management 2020 [Abs] [Link] [DOI:10.1145/3340531.3414072]
-
ICIDS2020 Panel: Building the Discipline of Interactive Digital Narratives In Interactive Storytelling 2020 [Abs] [DOI:10.1007/978-3-030-62516-0_1]
-
Technical Perspective: Query Optimization for Faster Deep CNN Explanations ACM SIGMOD Record 2020 [Link]
-
Apache Mahout: Machine Learning on Distributed Dataflow Systems Journal of Machine Learning Research 2020 [Link]
-
Semantic Systems. In the Era of Knowledge Graphs - 16th International Conference on Semantic Systems, SEMANTiCS 2020, Amsterdam, The Netherlands, September 7-10, 2020, Proceedings 2020 [Link] [DOI:10.1007/978-3-030-59833-4]
-
A longitudinal analysis of university rankings Quantitative Science Studies 2020 [Link] [DOI:10.1162/qss_a_00052]
2019
-
Transfer Learning for Biomedical Named Entity Recognition with BioBERT In Proceedings of the Posters and Demo Track of the 15th International Conference on Semantic Systems co-located with 15th International Conference on Semantic Systems (SEMANTiCS 2019), Karlsruhe, Germany, September 9th - to - 12th, 2019. 2019 [Link]
-
Understanding data search as a socio-technical practice Journal of Information Science 2019 [Abs] [Link] [DOI:10.1177/0165551519837182]
-
Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines Journal of the Association for Information Science and Technology 2019 [Abs] [Link] [DOI:10.1002/asi.24165]
2018
-
Open Information Extraction on Scientific Text: An Evaluation In Proceedings of the 27th International Conference on Computational Linguistics, COLING 2018, Santa Fe, New Mexico, USA, August 20-26, 2018 2018 [Link]
-
Elsevier’s Healthcare Knowledge Graph and the Case for Enterprise Level Linked Data Standards In Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, USA, October 8th - to - 12th, 2018. 2018 [Link]
-
Use of Internal Testing Data to Help Determine Compensation for Crowdsourcing Tasks In Proceedings of the 2nd International Workshop on Augmenting Intelligence with Humans\--in-\-the-\-Loop co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, California, October 9th, 2018. 2018 [Link]