Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary

Piloting a Cooperative Open Web Search Infrastructure to Support Europe's Digital Sovereignty

CORDIS provides links to public deliverables and publications of HORIZON projects.

Links to deliverables and publications from FP7 projects, as well as links to some specific result types such as dataset and software, are dynamically retrieved from OpenAIRE .

Deliverables

Dissemination, Exploitation and Communication (DEC) Report V1 (opens in new window)

Dissemination, Exploitation and Communication (DEC) Report first version

ELSA-catalogue & code of conduct for open Web search (opens in new window)

ELSA-catalogue & code of conduct for open Web search initial version

Model governance for federating an open search infrastructure V1 (opens in new window)

Model governance for federating an open search infrastructure Version 1

Report on scientific cooperation, community building and stakeholder involvement V1 (opens in new window)

Report on scientific cooperation, community building and stakeholder involvement initial version

Report of privacy, transparency, and trust models for search applications V1 (opens in new window)

Report of privacy, transparency, and trust models for search applications in its first version

Launch of the Pilot infrastructure (opens in new window)
Crawler Coordination Software Stack & Demonstrator V1 (opens in new window)

Open Source Software Stack for coordinating multiple, distributed and usually independent crawlers.

The OpenWebSearch Hub and the Open Web Index V1 (opens in new window)

The OpenWebSearch Hub and the Open Web Index in a first version indexing common crawls and providing first specifications

Publications

Cross-Market Product-Related Question Answering (opens in new window)

Author(s): Ghasemi, Negin; Aliannejadi, Mohammad; Bonab, Hamed; Kanoulas, Evangelos; de Vries, Arjen P.; Allan, James; Hiemstra, Djoerd
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591658

A Longitudinal Study of Content Control Mechanisms (opens in new window)

Author(s): Michael Dinzinger, Michael Granitzer
Published in: Companion Proceedings of the ACM Web Conference 2024, 2024
Publisher: ACM
DOI: 10.1145/3589335.3651893

A User Study on the Acceptance of Native Advertising in Generative IR (opens in new window)

Author(s): Ines Zelch, Matthias Hagen and Martin Potthast
Published in: Proceedings of the 2024 Conference on Human Information Interaction and Retrieval (CHIIR '24), 2024, ISBN 979-8-4007-0434-5
Publisher: ACM
DOI: 10.1145/3627508.3638316

PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents (opens in new window)

Author(s): Saber Zerhoudi, Michael Granitzer
Published in: 2024
Publisher: CEUR-WS
DOI: 10.48550/arXiv.2407.09394

Challenges of Index Exchange for Search Engine Interoperability (opens in new window)

Author(s): Hiemstra, D., Hendriksen, G., Kamphuis, C., & de Vries, A. P.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10529619

Overview of Touché 2023: Argument and Causal Retrieval (opens in new window)

Author(s): Alexander Bondarenko, Maik Fröbe, Johannes Kiesel, Ferdinand Schlatt, Valentin Barriere, Brian Ravenet, Léo Hemamou, Simon Luck, Jan Heinrich Reimer, Benno Stein, Martin Potthast, and Matthias Hagen
Published in: Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2023. Lecture Notes in Computer Science, 2023, ISBN 978-3-031-42447-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-42448-9_31

On Stance Detection in Image Retrieval for Argumentation (opens in new window)

Author(s): Carnot, Miriam Louise; Schreieder, Tobias; Heinemann, Lorenz; Kiesel, Johannes; Braker, Jan; Fröbe, Maik; Potthast, Martin; Stein, Benno
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591917

Overview of PAN 2023: Authorship Verification, Multi-Author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection (opens in new window)

Author(s): Janek Bevendorff, Ian Borrego-Obrador, Mara Chinea-Ríos, Marc Franco-Salvador, Maik Fröbe, Annina Heini, Krzysztof Kredens, Maximilian Mayerl, Piotr Pęzik, Martin Potthast, Francisco Rangel, Paolo Rosso, Efstathios Stamatatos, Benno Stein, Matti Wiegmann,
Published in: Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2023. Lecture Notes in Computer Science, 2023, ISBN 978-3-031-42447-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-42448-9_29

Conceptual Design and Implementation of a Prototype Search Application using the Open Web Search Index (opens in new window)

Author(s): Nussbaumer, A., Kaushik, R., Hendriksen, G., Gürtl, S., & Gütl, C.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10636166

Generating Natural Language Queries for More Effective Systematic Review Screening Prioritisation (opens in new window)

Author(s): Shuai Wang; Harrisen Scells; Bevan Koopman; Martin Potthast; Guido Zuccon
Published in: SIGIR-AP '23: Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023
DOI: 10.48550/arxiv.2309.05238

Generative Agents Navigating Digital Libraries (opens in new window)

Author(s): Saber Zerhoudi, Michael Granitzer
Published in: Lecture Notes in Computer Science, Sustainability and Empowerment in the Context of Digital Libraries, 2024
Publisher: Springer Nature Singapore
DOI: 10.1007/978-981-96-0865-2_14

Federated Data Infrastructure for the Open Web Search (opens in new window)

Author(s): Fathima, N. A., Golasowski, M., Granitzer, M., Wagner, A., Ariyo, C., Hendriksen, G., Truckenbrodt, J., Mankinen, K., Dinzinger, M., Karlsson, M., Hayek, M., Moiras, S., Vojacek, L., Hachinger, S., & Martinovič, J.
Published in: 2024
Publisher: Zenodo
DOI: 10.5281/zenodo.13872163

Simulating Follow-up Questions in Conversational Search (opens in new window)

Author(s): Kiesel, J., Gohsen, M., Mirzakhmedova, N., Hagen, M., Stein, B.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14609, 2024, ISBN 978-3-031-56059-0
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56060-6_25

The Information Retrieval Experiment Platform (opens in new window)

Author(s): Fröbe, Maik; Deckers, Niklas; Stein, Benno; Reimer, Jan Heinrich; Reich, Simon; Hagen, Matthias; MacAvaney, Sean; Bevendorff, Janek; Potthast, Martin
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.48550/arXiv.2305.18932

Designing an Integration Concept of the Provenance Verification Indicator into Open Web Search Engines (opens in new window)

Author(s): Nussbaumer, Alexander; Ebner, Sylvia M.; Gütl, Christian; Munnelly, Gary; Spillane, Brendan; Conlan, Owen; Plote, Christine; Frank, Anton
Published in: "Proceedings of the 5th International Open Search Symposium #ossym2022", 2022
Publisher: CERN
DOI: 10.5281/zenodo.8064758

Architecting the Opensearch Service at CERN For OpenWebSearch.EU (opens in new window)

Author(s): Fathima, N. A., Granitzer, M., Dinzinger, M., & Wagner, A.
Published in: 2024
Publisher: Zenodo
DOI: 10.5281/zenodo.13872517

Indicative Summarization of Long Discussions (opens in new window)

Author(s): Syed, Shahbaz; Schwabe, Dominik; Al-Khatib, Khalid; Potthast, Martin
Published in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Publisher: ACL
DOI: 10.48550/arxiv.2311.01882

Continuous Integration for Reproducible Shared Tasks with TIRA.io (opens in new window)

Author(s): Maik Fröbe, Matti Wiegmann, Nikolay Kolyada, Bastian Grahm, Theresa Elstner, Frank Loebe, Matthias Hagen, Benno Stein, and Martin Potthast
Published in: Advances in Information Retrieval. 45th European Conference on IR Research (ECIR 2023), 2023, ISBN 978-3-031-28240-9
Publisher: Springer
DOI: 10.1007/978-3-031-28241-6_20

SemEval-2023 Task 5: Clickbait Spoiling (opens in new window)

Author(s): Maik Fröbe, Tim Gollub, Benno Stein, Matthias Hagen, and Martin Potthast
Published in: Proceedings of 17th International Workshop on Semantic Evaluation (SemEval 2023), 2023
Publisher: ACL
DOI: 10.18653/v1/2023.semeval-1.315

An Empirical Comparison of Web Content Extraction Algorithms (opens in new window)

Author(s): Bevendorff, Janek; Kiesel, Johannes; Gupta, Sanket; Stein, Benno
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023, ISBN 978-1-4503-9408-6
Publisher: ACM
DOI: 10.1145/3539618.3591920

The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives (opens in new window)

Author(s): Reimer, Jan Heinrich; Gienapp, Lukas; Schmidt, Sebastian; Scells, Harrisen; Fröbe, Maik; Stein, Benno; Hagen, Matthias; Potthast, Martin
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.48550/arXiv.2304.00413

MMEAD: MS MARCO Entity Annotations and Disambiguations (opens in new window)

Author(s): Kamphuis, Chris; Lin, Jimmy; Lin, Aileen; de Vries, Arjen P.; Yang, Siwen; Hasibi, Faegheh
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591887

Overview of Touché 2024: Argumentation Systems (opens in new window)

Author(s): Kiesel, J. et al.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14612, 2024, ISBN 978-3-031-56068-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56069-9_64

OWLer: A Distributed and Collaborative Open Web Crawler (opens in new window)

Author(s): Dinzinger, M., Granitzer, M., Mitrović, J., Zerhoudi, S.
Published in: 6th International Open Search Symposium (OSSYM2024), 2024
Publisher: Zenodo
DOI: 10.5281/zenodo.13863478

Weighted AUReC: Handling Skew in Shard Map Quality Estimation for Selective Search (opens in new window)

Author(s): Hendriksen, G., Hiemstra, D., de Vries, A.P.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14611, 2024, ISBN 978-3-031-56065-1
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56066-8_10

Citance-Contextualized Summarization of Scientific Papers (opens in new window)

Author(s): Syed, Shahbaz; Hakimi, Ahmad Dawar; Al-Khatib, Khalid; Potthast, Martin
Published in: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Publisher: ACL
DOI: 10.48550/arxiv.2311.02408

Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models (opens in new window)

Author(s): Parry, A., Fröbe, M., MacAvaney, S., Potthast, M., Hagen, M.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14609, 2024, ISBN 978-3-031-56059-0
Publisher: Springer, Cham
DOI: 10.48550/arXiv.2403.07654

Enriching Science Search with the Open Search Framework MOSAIC (opens in new window)

Author(s): Nussbaumer, A., Gürtl, S., Honeder, J., Hecking, T., & Gütl, C.
Published in: 2024, ISBN 978-92-9083-669-8
Publisher: Zenodo
DOI: 10.5281/zenodo.13871624

Bootstrapped nDCG Estimation in the Presence of Unjudged Documents (opens in new window)

Author(s): Maik Fröbe, Lukas Gienapp, Martin Potthast, and Matthias Hagen
Published in: Advances in Information Retrieval. 45th European Conference on IR Research (ECIR 2023), 2023, ISBN 978-3-031-28243-0
Publisher: Springer
DOI: 10.1007/978-3-031-28244-7_20

OWler: Preliminary results for building a Collaborative Open Web Crawler (opens in new window)

Author(s): Dinzinger, M., Al-Maamari, M., Zerhoudi, S., Istaiti, M., Mitrović, J., & Granitzer, M.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10581841

Detecting Generated Native Ads in Conversational Search (opens in new window)

Author(s): Sebastian Schmidt, Ines Zelch, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast
Published in: Companion Proceedings of the ACM Web Conference 2024, 2024
Publisher: ACM
DOI: 10.1145/3589335.3651489

Understanding and Mitigating Cognitive Bias during Web Search (opens in new window)

Author(s): Hitzginger, S., Nussbaumer, A., Gütl, C., & Ruß-Baumann, C.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10607402

Trigger Warnings: Bootstrapping a Violence Detector for Fan Fiction (opens in new window)

Author(s): Magdalena Wolska, Matti Wiegmann, Christopher Schröder, Ole Borchardt, Benno Stein, and Martin Potthast
Published in: Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Publisher: ACL
DOI: 10.18653/v1/2023.findings-emnlp.41

Commercialized Generative AI: A Critical Study of the Feasibility and Ethics of Generating Native Advertising Using Large Language Models in Conversational Web Search (opens in new window)

Author(s): Zelch, I., Hagen, M., and Potthast, M.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.48550/arXiv.2310.04892

Beyond Benchmarks: Evaluating Embedding Model Similarity for Retrieval Augmented Generation Systems (opens in new window)

Author(s): Caspari, Laura; Dastidar, Kanishka Ghosh; Zerhoudi, Saber; Mitrovic, Jelena; Granitzer, Michael
Published in: 2024
Publisher: CEUR-WS
DOI: 10.48550/arXiv.2407.08275

Investigating the Effects of Sparse Attention on Cross-Encoders (opens in new window)

Author(s): Schlatt, F., Fröbe, M., Hagen, M.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14608, 2024, ISBN 978-3-031-56027-9
Publisher: Springer, Cham
DOI: 10.48550/arXiv.2312.17649

A Comprehensive Dataset for Webpage Classification (opens in new window)

Author(s): Al-Maamari, M., Istaiti, M., Zerhoudi, S., Dinzinger, M., Granitzer, M., & Mitrović, J.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10594210

Overview of PAN 2024: Multi-Author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification (opens in new window)

Author(s): Bevendorff et al.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14613, 2024, ISBN 978-3-031-56071-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56072-9_1

An Open Source Implementation of Web Clustering Algorithms for Selective Search (opens in new window)

Author(s): Hendriksen, G., Hiemstra, D., & de Vries, A.
Published in: 2024
Publisher: Zanodo
DOI: 10.5281/zenodo.13882966

Smooth Operators for Effective Systematic Review Queries (opens in new window)

Author(s): Scells, Harrisen; Schlatt, Ferdinand; Potthast, Martin
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591768

Geoparsing at Web-scale - Challenges and Opportunities

Author(s): Farzana, Sheikh Mastura; Hecking, Tobias
Published in: GeoExT 2023: First International Workshop on Geographic Information Extraction from Texts at ECIR 2023 (CEUR Workshop Proceedings), Issue 3385, 2023, ISSN 1613-0073
Publisher: CEUR-WS

Product Spam On YouTube: a Case Study (opens in new window)

Author(s): Bevendorff, J., Wiegmann, M., Potthast, M., & Stein, B.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10498306

UNFair: Search Engine Manipulation, Undetectable by Amortized Inequity (opens in new window)

Author(s): De Jonge, Tim; Hiemstra, Djoerd
Published in: FAccT 2023 - Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023
Publisher: ACM
DOI: 10.1145/3593013.3594046

Pybool_ir: A Toolkit for Domain-Specific Search Experiments (opens in new window)

Author(s): Scells, Harrisen; Potthast, Martin
Published in: SIGIR 2023 - Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Publisher: ACM
DOI: 10.1145/3539618.3591819

Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search Engines (opens in new window)

Author(s): Bevendorff, J., Wiegmann, M., Potthast, M., Stein, B.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14610, 2024, ISBN 978-3-031-56062-0
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56063-7_4

Open Web Search at LongEval 2023: Reciprocal Rank Fusion on Automatically Generated Query Variants

Author(s): Maik Fröbe, Gijs Hendriksen, Arjen Paul de Vries, and Martin Potthast
Published in: 2023
Publisher: CEUR-WS.org

Advancing Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications with ImageCLEF 2024 (opens in new window)

Author(s): Ionescu, B. et al.
Published in: Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, Issue 14613, 2024, ISBN 978-3-031-56071-2
Publisher: Springer, Cham
DOI: 10.1007/978-3-031-56072-9_6

Impact and development of an Open Web Index for Open Web Search (opens in new window)

Author(s): Granitzer Michael; Voigt Stefan; Noor Afshan Fathima; Golasowski Martin; Guetl Christian; Hecking Tobias; Gijs Hendriksen; Djoerd Hiemstra; Jan Martinovič; Jelena Mitrović; Izidor Mlakar; Stavros Moiras; Alexander Nussbaumer; Per Öster; Martin Potthast; Marjana Senčar Srdič; Sharikadze Megi; Kateřina Slaninová; Benno Stein; Arjen P. de Vries; Vít Vondrák; Andreas Wagner; Saber Zerhoudi
Published in: JASIST, 2023, ISSN 2330-1635
Publisher: Willey
DOI: 10.1002/asi.24818

Evaluating Generative Ad Hoc Information Retrieval (opens in new window)

Author(s): Gienapp, Lukas; Scells, Harrisen; Deckers, Niklas; Bevendorff, Janek; Wang, Shuai; Kiesel, Johannes; Syed, Shahbaz; Fröbe, Maik; Zuccon, Guido; Stein, Benno; Hagen, Matthias; Potthast, Martin
Published in: Computing Research Repository (CoRR) in arXiv, 2023
Publisher: ArXiv
DOI: 10.48550/arxiv.2311.04694

Prototyping Open Web Search Applications with TIRA: A Case Study in Research-oriented Teaching (opens in new window)

Author(s): Fröbe, M., Elstner, T., Scells, H., Stein, B., & Potthast, M.
Published in: Proceedings of 5th International Open Search Symposium (OSSYM2023), 2023, ISBN 978-92-9083-653-7
Publisher: CERN
DOI: 10.5281/zenodo.10557539

Searching for OpenAIRE data...

There was an error trying to search data from OpenAIRE

No results available