Bibtex file with the publications listed below.

2022

  1. Making Canonical Workflow Building Blocks Interoperable across Workflow Languages Soiland-Reyes, Stian, Bayarri, Genís, Andrio, Pau, Long, Robin, Lowe, Douglas, Niewielska, Ania, Hospital, Adam, and Groth, Paul Data Intelligence 2022 [Abs] [Link] [DOI:10.1162/dint_a_00135]
  2. Defining a Knowledge Graph Development Process Through a Systematic Review Tamašauskaitundefined, Gytundefined, and Groth, Paul ACM Transactios on Software Engineering and Methodology 2022 [Abs] [Link] [DOI:10.1145/3522586]
  3. Data distribution debugging in machine learning pipelines Grafberger, Stefan, Groth, Paul, Stoyanovich, Julia, and Schelter, Sebastian The VLDB Journal 2022 [Link] [DOI:10.1007/s00778-021-00726-w]
  4. Structure-based knowledge acquisition from electronic lab notebooks for research data provenance documentation Schröder, Max, Staehlke, Susanne, Groth, Paul, Nebe, J. Barbara, Spors, Sascha, and Krüger, Frank Journal of Biomedical Semantics 2022 [Link] [DOI:10.1186/s13326-021-00257-x]
  5. Packaging research artefacts with RO-Crate Soiland-Reyes, Stian, Sefton, Peter, Crosas, Mercè, Castro, Leyla Jael, Coppens, Frederik, Fernández, José M., Garijo, Daniel, Grüning, Björn, La Rosa, Marco, Leo, Simone, and al., Data Science 2022 [Link] [DOI:10.3233/DS-210053]
  6. Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching co-located with the 20th International Semantic Web Conference (ISWC 2021), Virtual conference, October 27, 2021 Jiménez-Ruiz, Ernesto, Efthymiou, Vasilis, Chen, Jiaoyan, Cutrona, Vincenzo, Hassanzadeh, Oktie, Sequeda, Juan, Srinivas, Kavitha, Abdelmageed, Nora, Hulsebos, Madelon, Oliveira, Daniela, and Pesquita, Catia 2022 [Link]
  7. AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning Huang, Biwei, Feng, Fan, Lu, Chaochao, Magliacane, Sara, and Zhang, Kun In International Conference on Learning Representations 2022 [Link] [ :tada: Spotlight Presentation ]

2021

  1. The non-linear impact of data handling on network diffusion models Nevin, James, Lees, Michael, and Groth, Paul Patterns 2021 [Link] [DOI:10.1016/j.patter.2021.100397]
  2. GraphPOPE: Retaining Structural Graph Information Using Position-aware Node Embeddings Boef, Jeroen Den, Cornelisse, Joran, and Groth, Paul In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG 2021) 2021 [Link]
  3. Quality Assessment of Knowledge Graph Hierarchies using KG-BERT Szarkowska, Kinga, Moore, Veronique, Vandenbussche, Pierre-Yves, and Groth, Paul In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG 2021) 2021 [Link]
  4. Perspectives on automated composition of workflows in the life sciences Lamprecht, Anna-Lena, Palmblad, Magnus, Ison, Jon, Schwämmle, Veit, Manir, Mohammad Sadnan Al, Altintas, Ilkay, Baker, Christopher J. O., Amor, Ammar Ben Hadj, Capella-Gutierrez, Salvador, Charonyktakis, Paulos, Crusoe, Michael R., Gil, Yolanda, Goble, Carole, Griffin, Timothy J., Groth, Paul, Ienasescu, Hans, Jagtap, Pratik, Kalaš, Matúš, Kasalica, Vedran, Khanteymoori, Alireza, Kuhn, Tobias, Mei, Hailiang, Ménager, Hervé, Möller, Steffen, Richardson, Robin A., Robert, Vincent, Soiland-Reyes, Stian, Stevens, Robert, Szaniszlo, Szoke, Verberne, Suzan, Verhoeven, Aswin, and Wolstencroft, Katherine F1000Research 2021 [Link] [DOI:10.12688/f1000research.54159.1]
  5. SemEval-2021 Task 8: MeasEval – Extracting Counts and Measurements and their Related Contexts Harper, Corey, Cox, Jessica, Kohler, Curt, Scerri, Antony, Daniel Jr., Ron, and Groth, Paul In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021) 2021 [Abs] [Link] [DOI:10.18653/v1/2021.semeval-1.38] [ :tada: SemEval 2021 Best Task Paper ]
  6. Further with Knowledge Graphs: Proceedings of the 17th International Conference on Semantic Systems, 6–9 September 2021, Amsterdam, The Netherlands 2021 [Link] [DOI:10.3233/SSW53]
  7. Reinforcement Learning–Based Collective Entity Alignment with Adaptive Features Zeng, Weixin, Zhao, Xiang, Tang, Jiuyang, Lin, Xuemin, and Groth, Paul ACM Trans. Inf. Syst. 2021 [Abs] [Link] [DOI:10.1145/3446428]
  8. Learnings from a Retail Recommendation System on Billions of Interactions at bol.com Kersbergen, B., and Schelter, S. In 2021 IEEE 37th International Conference on Data Engineering (ICDE) 2021 [Link] [DOI:10.1109/ICDE51399.2021.00277]
  9. Inductive Entity Representations from Text via Link Prediction Daza, Daniel, Cochez, Michael, and Groth, Paul In Proceedings of The Web Conference 2021 [arXiv] [DOI:10.1145/3442381.3450141] [Code]
  10. Letter from the Special Issue Editor Schelter, Sebastian IEEE Data Engineering Bulletin (Special issue on Data validation for machine learning models and applications) 2021 [Link]
  11. Complex Query Answering with Neural Link Predictors Arakelyan, Erik, Daza, Daniel, Minervini, Pasquale, and Cochez, Michael In International Conference on Learning Representations (ICLR) 2021 [arXiv] [Link] [ :tada: Outstanding Paper Award ICLR 2021 ]
  12. Taming Technical Bias in Machine Learning Pipelines Schelter, Sebastian, and Stoyanovich, Julia IEEE Data Engineering Bulletin (Special Issue on Interdisciplinary Perspectives on Fairness and Artificial Intelligence Systems) 2021 [Link]
  13. Talking datasets – Understanding data sensemaking behaviours Koesten, Laura, Gregory, Kathleen, Groth, Paul, and Simperl, Elena International Journal of Human-Computer Studies 2021 [Abs] [arXiv] [Link] [DOI:10.1016/j.ijhcs.2020.102562]
  14. The Challenges of Cross-Document Coreference Resolution for Email Li, Xue, Magliacane, Sara, and Groth, Paul In Proceedings of the 11th on Knowledge Capture Conference 2021 [Abs] [Link] [DOI:10.1145/3460210.3493573]
  15. Supporting Ontology Maintenance with Contextual Word Embeddings and Maximum Mean Discrepancy Shroff, Natasha, Vandenbussche, Pierre-Yves, Moore, Véronique, and Groth, Paul In Joint Proceedings of the 2nd International Workshop on Deep Learning meets Ontologies and Natural Language Processing (DeepOntoNLP 2021) & 6th International Workshop on Explainable Sentiment Mining and Emotion Detection (X-SENTIMENT 2021) co-located with co-located with 18th Extended Semantic Web Conference 2021, Hersonissos, Greece, June 6th - 7th, 2021 (moved online) 2021 [Link]
  16. Proceedings of Machine Learning with Symbolic Methods and Knowledge Graphs co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2021), Virtual, September 17, 2021 Alam, Mehwish, Ali, Mehdi, Groth, Paul, Hitzler, Pascal, Lehmann, Jens, Paulheim, Heiko, Rettinger, Achim, Sack, Harald, Sadeghi, Afshin, and Tresp, Volker 2021 [Link]
  17. Verifiably Safe Exploration for End-to-End Reinforcement Learning Hunt, Nathan, Fulton, Nathan, Magliacane, Sara, Hoang, Trong Nghia, Das, Subhro, and Solar-Lezama, Armando In Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control 2021 [Abs] [Link] [DOI:10.1145/3447928.3456653] [ :tada: Best Paper Award ACM HSCC 2021 ]
  18. Summary of Tutorials at The Web Conference 2021 West, Robert, Bhagat, Smriti, Groth, Paul, Zitnik, Marinka, Couto, Francisco M., Lisena, Pasquale, Meroño-Peñuela, Albert, Zhao, Xiangyu, Fan, Wenqi, Yin, Dawei, Tang, Jiliang, Shou, Linjun, Gong, Ming, Pei, Jian, Geng, Xiubo, Zhou, Xingjie, Jiang, Daxin, Ricaud, Benjamin, Aspert, Nicolas, Miz, Volodymyr, Dy, Jennifer, Ioannidis, Stratis, Yıldız, undefinedlkay, Rezapour, Rezvaneh, Aref, Samin, Dinh, Ly, Diesner, Jana, Drutsa, Alexey, Ustalov, Dmitry, Popov, Nikita, Baidakova, Daria, Mishra, Shubhanshu, Gopalan, Arjun, Juan, Da-Cheng, Ilharco Magalhaes, Cesar, Ferng, Chun-Sung, Heydon, Allan, Lu, Chun-Ta, Pham, Philip, Yu, George, Fan, Yicheng, Wang, Yueqi, Laurent, Florian, Schraner, Yanick, Scheller, Christian, Mohanty, Sharada, Chen, Jiawei, Wang, Xiang, Feng, Fuli, He, Xiangnan, Teinemaa, Irene, Albert, Javier, Goldenberg, Dmitri, Vasile, Flavian, Rohde, David, Jeunen, Olivier, Benhalloum, Amine, Sakhi, Otmane, Rong, Yu, Huang, Wenbing, Xu, Tingyang, Bian, Yatao, Cheng, Hong, Sun, Fuchun, Huang, Junzhou, Fakhraei, Shobeir, Faloutsos, Christos, Çelebi, Onur, Müller, Martin, Schneider, Manuel, Altunina, Olesia, Wingerath, Wolfram, Wollmer, Benjamin, Gessert, Felix, Succo, Stephan, Ritter, Norbert, Courdier, Evann, Avram, Tudor Mihai, Cvetinovic, Dragan, Tsinadze, Levan, Jose, Johny, Howell, Rose, Koenig, Mario, Defferrard, Michaël, Kenthapadi, Krishnaram, Packer, Ben, Sameki, Mehrnoosh, and Sephus, Nashlie In Companion Proceedings of the Web Conference 2021 2021 [Abs] [Link] [DOI:10.1145/3442442.3453701]
  19. HedgeCut: Maintaining Randomised Trees for Low-Latency Machine Unlearning Schelter, Sebastian, Grafberger, Stefan, and Dunning, Ted In Proceedings of the 2021 International Conference on Management of Data 2021 [Abs] [Link] [DOI:10.1145/3448016.3457239]
  20. MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines Grafberger, Stefan, Guha, Shubha, Stoyanovich, Julia, and Schelter, Sebastian In Proceedings of the 2021 International Conference on Management of Data 2021 [Abs] [Link] [DOI:10.1145/3448016.3452759]
  21. Automating Data Quality Validation for Dynamic Data Ingestion Redyuk, Sergey, Kaoudi, Zoi, Markl, Volker, and Schelter, Sebastian In Proceedings of the 24th International Conference on Extending Database Technology, EDBT 2021, Nicosia, Cyprus, March 23 - 26, 2021 2021 [Link] [DOI:10.5441/002/edbt.2021.07]
  22. JENGA - A Framework to Study the Impact of Data Errors on the Predictions of Machine Learning Models Schelter, Sebastian, Rukat, Tammo, and Biessmann, Felix In Proceedings of the 24th International Conference on Extending Database Technology, EDBT 2021, Nicosia, Cyprus, March 23 - 26, 2021 2021 [Link] [DOI:10.5441/002/edbt.2021.63]
  23. Lightweight Inspection of Data Preprocessing in Native Machine Learning Pipelines Grafberger, Stefan, Stoyanovich, Julia, and Schelter, Sebastian In 11th Conference on Innovative Data Systems Research, CIDR 2021, Virtual Event, January 11-15, 2021, Online Proceedings 2021 [Link]

2020

  1. Towards Olfactory Information Extraction from Text: A Case Study on Detecting Smell Experiences in Novels Brate, Ryan, Groth, Paul, and Erp, Marieke In Proceedings of the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2020 [Abs] [Link]
  2. Dataset Reuse: Toward Translating Principles to Practice Koesten, Laura, Vougiouklis, Pavlos, Simperl, Elena, and Groth, Paul Patterns 2020 [Link] [DOI:10.1016/j.patter.2020.100136]
  3. Effective distributed representations for academic expert search Berger, Mark, Zavrel, Jakub, and Groth, Paul In Proceedings of the First Workshop on Scholarly Document Processing at EMNLP 2020 [Abs] [Link]
  4. Dataset search: a survey Chapman, Adriane, Simperl, Elena, Koesten, Laura, Konstantinidis, George, Ibáñez, Luis-Daniel, Kacprzak, Emilia, and Groth, Paul The VLDB Journal 2020 [arXiv] [Link] [DOI:10.1007/s00778-019-00564-x]
  5. Introduction – FAIR data, systems and analysis Groth, Paul, and Dumontier, Michel Data Science 2020 [Link] [DOI:10.3233/DS-200029]
  6. Fairness-Aware Instrumentation of Preprocessing Pipelines for Machine Learning Yang, Ke, Huang, Biao, Stoyanovich, Julia, and Schelter, Sebastian In Workshop on Human-In-the-Loop Data Analytics (HILDA’20) 2020 [Link] [DOI:10.1145/3398730.3399194]
  7. Towards Entity Spaces Erp, Marieke, and Groth, Paul In Proceedings of The 12th Language Resources and Evaluation Conference 2020 [Abs] [Link]
  8. Lost or Found? Discovering Data Needed for Research Gregory, Kathleen, Groth, Paul, Scharnhorst, Andrea, and Wyatt, Sally Harvard Data Science Review 2020 [Link] [DOI:10.1162/99608f92.e38165eb]
  9. PANDAcap: A Framework for Streamlining Collection of Full-System Traces Stamatogiannakis, Manolis, Bos, Herbert, and Groth, Paul In EuroSec 2020 [Link] [DOI:10.1145/3380786.3391396] [Code]
  10. Estimating the imageability of words by mining visual characteristics from crawled image data Kastner, Marc A., Ide, Ichiro, Nack, Frank, Kawanishi, Yasutomo, Hirayama, Takatsugu, Deguchi, Daisuke, and Murase, Hiroshi Multimedia Tools and Applications 2020 [Link] [DOI:10.1007/s11042-019-08571-4]
  11. FAIR Data Reuse – the Path through Data Citation Groth, Paul, Cousijn, Helena, Clark, Tim, and Goble, Carole Data Intelligence 2020 [Link] [DOI:10.1162/dint_a_00030]
  12. Message Passing Query Embedding Daza, Daniel, and Cochez, Michael In ICML Workshop - Graph Representation Learning and Beyond 2020 [arXiv] [Link]
  13. The state of altmetrics: a tenth anniversary celebration Altmetric Engineering, , Konkiel, Stacy, Priem, Jason, Adie, Euan, Derrick, Gemma, Didegah, Fereshteh, Groth, Paul, Neylon, Cameron, Shenmeng Xu, , Zahedi, Zohreh, Bowman, Timothy, Vanash M Patel, , Haunschild, Robin, Bornmann, Lutz, Taylor, Mike, Ross, Liesa, Theng, Yin-Leng, Hassan, Saeed-Ul, and Aljohani, Naif R. 2020 [Link] [DOI:10.6084/M9.FIGSHARE.13010000.V2]
  14. CSSA’20: Workshop on Combining Symbolic and Sub-Symbolic Methods and Their Applications Alam, Mehwish, Groth, Paul, Hitzler, Pascal, Paulheim, Heiko, Sack, Harald, and Tresp, Volker In Proceedings of the 29th ACM International Conference on Information & Knowledge Management 2020 [Abs] [Link] [DOI:10.1145/3340531.3414072]
  15. ICIDS2020 Panel: Building the Discipline of Interactive Digital Narratives Bernstein, Mark, Palosaari Eladhari, Mirjam, Koenitz, Hartmut, Louchart, Sandy, Nack, Frank, Martens, Chris, Rossi, Giulia Carla, Bosser, Anne-Gwenn, and Millard, David E. In Interactive Storytelling 2020 [Abs] [DOI:10.1007/978-3-030-62516-0_1]
  16. Technical Perspective: Query Optimization for Faster Deep CNN Explanations Schelter, Sebastian ACM SIGMOD Record 2020 [Link]
  17. Apache Mahout: Machine Learning on Distributed Dataflow Systems Anil, Robin, Capan, Gokhan, Drost-Fromm, Isabel, Dunning, Ted, Friedman, Ellen, Grant, Trevor, Quinn, Shannon, Ranjan, Paritosh, Schelter, Sebastian, and Yılmazel, Özgür Journal of Machine Learning Research 2020 [Link]
  18. Semantic Systems. In the Era of Knowledge Graphs - 16th International Conference on Semantic Systems, SEMANTiCS 2020, Amsterdam, The Netherlands, September 7-10, 2020, Proceedings Blomqvist, Eva, Groth, Paul, Boer, Victor, Pellegrini, Tassilo, Alam, Mehwish, Käfer, Tobias, Kieseberg, Peter, Kirrane, Sabrina, Meroño-Peñuela, Albert, and Pandit, Harshvardhan J. 2020 [Link] [DOI:10.1007/978-3-030-59833-4]
  19. A longitudinal analysis of university rankings Selten, Friso, Neylon, Cameron, Huang, Chun-Kai, and Groth, Paul Quantitative Science Studies 2020 [Link] [DOI:10.1162/qss_a_00052]

2019

  1. How Relevant Is Your Choice? Kolhoff, Lobke, and Nack, Frank In ICIDS 2019. Lecture Notes in Computer Science, vol 11869 2019 [Abs]
  2. Transfer Learning for Biomedical Named Entity Recognition with BioBERT Symeonidou, Anthi, Sazonau, Viachaslau, and Groth, Paul In Proceedings of the Posters and Demo Track of the 15th International Conference on Semantic Systems co-located with 15th International Conference on Semantic Systems (SEMANTiCS 2019), Karlsruhe, Germany, September 9th - to - 12th, 2019. 2019 [Link]
  3. Understanding data search as a socio-technical practice Gregory, Kathleen M, Cousijn, Helena, Groth, Paul, Scharnhorst, Andrea, and Wyatt, Sally Journal of Information Science 2019 [Abs] [Link] [DOI:10.1177/0165551519837182]
  4. Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines Gregory, Kathleen, Groth, Paul, Cousijn, Helena, Scharnhorst, Andrea, and Wyatt, Sally Journal of the Association for Information Science and Technology 2019 [Abs] [Link] [DOI:10.1002/asi.24165]
  5. End-to-End Learning for Answering Structured Queries Directly over Text Groth, Paul T., Scerri, Antony, Daniel, Ron, and Allen, Bradley P. In Proceedings of the Workshop on Deep Learning for Knowledge Graphs (DL4KG2019) Co-located with the 16th Extended Semantic Web Conference 2019 (ESWC 2019), Portoroz, Slovenia, June 2, 2019. 2019 [arXiv] [Link]

2018

  1. Open Information Extraction on Scientific Text: An Evaluation Groth, Paul T., Lauruhn, Michael, Scerri, Antony, and Daniel, Ron In Proceedings of the 27th International Conference on Computational Linguistics, COLING 2018, Santa Fe, New Mexico, USA, August 20-26, 2018 2018 [Link]
  2. Elsevier’s Healthcare Knowledge Graph and the Case for Enterprise Level Linked Data Standards DeJong, Alex, Bord, Radmila, Dowling, Will, Hoekstra, Rinke, Moquin, Ryan, O, Charlie, Samarasinghe, Mevan, Snyder, Paul, Stanley, Craig, Tordai, Anna, Trefry, Michael, and Groth, Paul T. In Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, USA, October 8th - to - 12th, 2018. 2018 [Link]
  3. Use of Internal Testing Data to Help Determine Compensation for Crowdsourcing Tasks Lauruhn, Michael, Groth, Paul T., Harper, Corey A., and Deus, Helena F. In Proceedings of the 2nd International Workshop on Augmenting Intelligence with Humans\--in-\-the-\-Loop co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, California, October 9th, 2018. 2018 [Link]

2017

  1. Indicators for the use of robotic labs in basic biomedical research: a literature analysis Groth, Paul, and Cox, Jessica PeerJ 2017 [Abs] [Link] [DOI:10.7717/peerj.3997]
  2. Storing, Tracking, and Querying Provenance in Linked Data Wylot, Marcin, Cudré-Mauroux, Philippe, Hauswirth, Manfred, and Groth, Paul T. IEEE Trans. Knowl. Data Eng. 2017 [Link] [DOI:10.1109/TKDE.2017.2690299]
  3. PROV2R: Practical Provenance Analysis of Unstructured Processes Stamatogiannakis, Manolis, Athanasopoulos, Elias, Bos, Herbert, and Groth, Paul T. ACM Trans. Internet Techn. 2017 [Link] [DOI:10.1145/3062176]
  4. Linked Data Management Hauswirth, Manfred, Wylot, Marcin, Grund, Martin, Groth, Paul T., and Cudré-Mauroux, Philippe 2017 [Link] [DOI:10.1007/978-3-319-49340-4_9]

2016

  1. Sources of Change for Modern Knowledge Organization Systems Lauruhn, Michael, and Groth, Paul KNOWLEDGE ORGANIZATION 2016 [arXiv] [DOI:10.5771/0943-7444-2016-8-622]
  2. Applying Universal Schemas for Domain Specific Ontology Expansion Groth, Paul T., Pal, Sujit, McBeath, Darin, Allen, Brad, and Daniel, Ron In Proceedings of the 5th Workshop on Automated Knowledge Base Construction, AKBC@NAACL-HLT 2016, San Diego, CA, USA, June 17, 2016 2016 [Link]
  3. The FAIR Guiding Principles for scientific data management and stewardship Wilkinson, Mark D., Dumontier, Michel, Aalbersberg, IJsbrand Jan, Appleton, Gabrielle, Axton, Myles, Baak, Arie, Blomberg, Niklas, Boiten, Jan Willem, da Silva Santos, Luiz Bonino, Bourne, Philip E., Bouwman, Jildau, Brookes, Anthony J., Clark, Tim, Crosas, Mercè, Dillo, Ingrid, Dumon, Olivier, Edmunds, Scott, Evelo, Chris T., Finkers, Richard, Gonzalez-Beltran, Alejandra, Gray, Alasdair J.G., Groth, Paul, Goble, Carole, Grethe, Jeffrey S., Heringa, Jaap, ’t Hoen, Peter A C, Hooft, Rob, Kuhn, Tobias, Kok, Ruben, Kok, Joost, Lusher, Scott J., Martone, Maryann E, Mons, Albert, Packer, Abel L., Persson, Bengt, Rocca-Serra, Philippe, Roos, Marco, van Schaik, Rene, Sansone, Susanna Assunta, Schultes, Erik, Sengstag, Thierry, Slater, Ted, Strawn, George, Swertz, Morris A., Thompson, Mark, Van Der Lei, Johan, Van Mulligen, Erik, Velterop, Jan, Waagmeester, Andra, Wittenburg, Peter, Wolstencroft, Katherine, Zhao, Jun, and Mons, Barend Scientific Data 2016 [Abs] [DOI:10.1038/sdata.2016.18]