Session: Full Papers 1 - Core Retrieval Models, Representations & Evaluation
(Monday 10:30–12:30, Centrale)
| Paper | Authors |
|---|---|
| Sample-Free Almost-Exact Estimation of Plackett-Luce Propensities for Off-Policy Ranking | Norman Knyazev, Harrie Oosterhuis |
| Validating Search Query Simulations: A Taxonomy of Measures | Andreas Konstantin Kruff, Nolwenn Bernard, Philipp Schaer |
| Reducing Human Effort to Validate LLM Relevance Judgements via Stratified Sampling | Simone Merlo, Stefano Marchesin, Guglielmo Faggioli, Nicola Ferro |
| Revealing MonoT5's Learning Mechanisms via Prompt-Token Adaptation | Marco Braga, Sean MacAvaney, Craig Macdonald, Gabriella Pasi |
| When Reducing Representations Improves Performance | Andrea Pasin, Guglielmo Faggioli, Nicola Ferro, Raffaele Perego, Nicola Tonellotto |
| An Empirical Study of Model Casing in Learned Sparse Retrieval | Emmanouil Georgios Lionis, Jia-Huei Ju, Angelos Nalmpantis, Casper Thuis, Sean MacAvaney, Andrew Yates |
| Improving Instruction-Aware Retrieval with Query-Preserving Regularization | Hyewon Kim, Hyun-Je Song |
Session: Full Papers 2 - Applied Generation, Evaluation & Analysis with LLMs
(Monday 10:30–12:30, Chaos)
| Paper | Authors |
|---|---|
| Contradictions in Context: Challenges for Retrieval-Augmented Generation in Healthcare | Saeedeh Javadi, Sara Mirabi, Manan Gangar, Bahadorreza Ofoghi |
| Small Models, Big Picture! A Language Model Augmentation for Enhanced Reader-Aware Summarization | Raghvendra Kumar, A S Poornash, Sriparna Saha |
| From Comments to Conclusions: Adaptive Reader-Aware Summary Generation in Low-Resource Languages via Agent Debate | Raghvendra Kumar, Mohammed Salman S A, Jaya Verma, Sriparna Saha |
| Prompt Compression in the Wild: Measuring Latency, Rate Adherence, and Quality for Faster LLM Inference | Cornelius Kummer, Lena Jurkschat, Michael Färber, Sahar Vahdati |
| Towards Quantitative Summarization Evaluation: An Integrated Atomic-Based Evaluation Framework and Dataset for Text Summarization | Yan Lei, Suncong Zheng, Roberts Wang, Liang Pang, Lei He, Shuang Chen, Wang Yu, Huawei Shen, Xueqi Cheng, Yuanzhuo Wang |
| ExpertMix: Aspect and Severity Detection in Conversational Complaints | Sarmistha Das, Apoorva Singh, Rishu Kumar Singh, Navneet Shreya, Sriparna Saha |
| MemTool: Optimizing Short-Term Memory Management for Dynamic Tool Retrieval and Invocation in LLM Agent Multi-Turn Conversations | Elias Lumer, Anmol Gulati, Vamse Kumar Subbiah, Pradeep Honaganahalli Basavaraju, James A Burke |
Session: IR4Good 1 - IR-for-Good Paper Session I
(Monday 10:30–12:30, Chemie)
| Paper | Authors |
|---|---|
| From Engagement to Empowerment: A Capability-Theoretic Rethinking of Recommender Systems | Vittoria Vineis, Gabriele Tolomei |
| Bias in Book Recommendation: A Case Study on the Danish Public Libraries | Savvina Daniil, Søren Højlund Mollerup, Laura Hollink |
| How Do LLMs Cite? A Mechanistic Interpretation of Attribution in RAG | Ian van Dort, Maria Heuss |
| All That Matters: Revisiting Children's Concept of Relevance in Primary School Context | Diletta Micol Tobia, Hrishita Chakrabarti, Maria Soledad Pera, Monica Landoni |
| When Attention Becomes Exposure in Generative Search | Shayan Alipour, Mehdi Kargar, Morteza Zihayat |
| Counterfactual Understanding via Retrieval-aware Multimodal Modeling for Time-to-Event Survival Prediction | Ha-Anh Hoang Nguyen, Tri-Duc Phan Le, Duc-Hoang Pham, Huy-Son Nguyen, Cam-Van Thi Nguyen, Duc-Trong Le, Hoang-Quynh Le |
| Joint Modeling of Candidate and Recruiter Preferences for Fair Two-Sided Job Matching | Clara Rus, Masoud Mansoury, Andrew Yates, Maarten de Rijke |
Session: Full Papers 3 - Specialized Retrieval Domains & Architectures
(Monday 14:30–15:30, Centrale)
| Paper | Authors |
|---|---|
| Filtering Few-Level Segment Regions for Efficient Subsequence Search in 3D Human Motions | Andrej Černek, Jan Sedmidubsky |
| Starbucks: Improved Training for 2D Matryoshka Embeddings | Shengyao Zhuang, Shuai Wang, Fabio Zheng, Bevan Koopman, Guido Zuccon |
| Website Segmentation Beyond Structure: A Benchmark on Functional and Digital Maturity Classes | Jonathan Gerber, Jasmin Saxer, Andreas Weiler, Michael Grossniklaus |
Session: Reproducibility 1 - Reproducibility I: Recommender Systems
(Monday 14:30–15:30, Chaos)
| Paper | Authors |
|---|---|
| Are Multimodal Embeddings Truly Beneficial for Recommendation? A Deep Dive into Whole vs. Individual Modalities | Yu Ye, Junchen Fu, Yu Song, Kaiwen Zheng, Joemon Jose |
| RecRankerEval: A Reproducible Framework for Deploying and Evaluating LLM-based Top-k Recommenders | Zeyuan Meng, Zixuan Yi, Iadh Ounis |
| Efficient Optimization of Hierarchical Identifiers for Generative Recommendation | Federica Valeau, David Vos, Odysseas Boufalis, Polytimi Gkotsi, Joshua Rosenthal |
| A Reproducible and Fair Evaluation of Partition-aware Collaborative Filtering | Domenico de Gioia, Claudio Pomo, Ludovico Boratto, Tommaso Di Noia |
| A Systematic Reproducibility Study of BSARec for Sequential Recommendation | Jan Hutter, Hua Chang Bakker, Stan Fris, Angela Madelon Bernardy, Yuanna Liu |
Session: IR4Good 2 - IR-for-Good Paper Session II
(Monday 14:30–15:30, Chemie)
| Paper | Authors |
|---|---|
| Measuring Political Stance and Consistency in Large Language Models | Mucahid Kutlu, Saban Kardas, Salah Feras Alali, Mohammad Nashat Maasfeh |
| Judiciously Reducing Sub-group Comparisons for Learning Intersectional Fair Representations | Clara Rus, Andrew Yates, Maarten de Rijke |
| Modeling Behavioral Patterns in News Recommendations Using Fuzzy Neural Networks | Kevin Innerebner, Stephan Bartl, Markus Reiter-Haas, Elisabeth Lex |
| Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning Rerankers | Saron Samuel, Benjamin Van Durme, Eugene Yang |
Session: Findings Lightning Talks
(Monday 16:00–17:00, Chemie)
| Paper | Authors |
|---|---|
| Measuring Individual User Fairness with User Similarity and Effectiveness Disparity | Theresia Veronika Rampisela, Maria Maistro, Tuukka Ruotsalo, Christina Lioma |
| Nested Named Entity Recognition in Plasma Physics Research Articles | Muhammad Haris, Hans Höft, Markus Becker, Markus Stocker |
| Exploring User Simulators in Conversational Search: A Comparison between LLMs and Humans | Lili Lu, Fabio Crestani |
| Query Harmfulness Prediction (QHP): a New Challenge for Safer Retrieval Systems | Xiana Carrera, Marcos Fernández-Pichel, David E. Losada |
| The Effect of Document Summarization on LLM-based Relevance Judgments | Samaneh Mohtadi, Kevin Roitero, Stefano Mizzaro, Gianluca Demartini |
| Query–Document Dense Vectors for LLM Relevance Judgment Bias Analysis | Samaneh Mohtadi, Gianluca Demartini |
| Stop Contrasting, Start Distilling: Cross-Encoder Listwise Distillation and Synthetic Data for Dense Retrieval | Manveer Singh Tamber, Suleman Kazi, Vivek Sourabh, Jimmy Lin |
| Let Me Explain - Knowledge-based Retrieval Augmented Generation for Agricultural Recommendation Explanations | Daan Di Scala, Maaike de Boer |
| From Quotes to Concepts: Axial Coding of Political Debates with Ensemble LMs | Angelina Parfenova, David Graus, Juergen Pfeffer |
| Breaking Flat: A Generalised Query Performance Prediction Evaluation Framework | Payel Santra, Partha Basuchowdhuri, Debasis Ganguly |
Session: IR4Good - Invited Talks & Panel
(Monday 16:00–17:00, Centrale)
| Time | Speaker / Talk |
|---|---|
| 16:00–16:15 | Invited Talk - Dana McKay: "Anything But Ordinary: How other disciplines can help move beyond the average searcher" |
| 16:15–16:30 | Invited Talk - Djoerd Hiemstra: "OpenWebSearch.eu: Towards a shared infrastructure for assembling web search engines" |
| 16:30–17:15 | Panel Discussion - Moderator: Maria Heuss; Panelists: Madeleine I. G. Daepp, Dana McKay, Djoerd Hiemstra, Sanne Vrijenhoek, Bhaskar Mitra |
Session: Reproducibility 2 - Reproducibility II: Retrieval
(Monday 16:00–17:00, Chaos)
| Paper | Authors |
|---|---|
| Fast, Compact, Dynamic Indexing for Learned Sparse Retrieval Systems | Billy Rule, Joel Mackenzie |
| Down with the Hierarchy: The 'H' in HNSW Stands for "Hubs" | Blaise Munyampirwa, Vihan Lakshman, Benjamin Coleman |
| Multi-vector Reranking in the Era of Strong First-Stage Retrievers | Silvio Martinico, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini |
| Temporal Fact Conflicts in LLMs: Reproducibility Insights from Unifying DYNAMICQA and MULAN | Ritajit Dey, Iadh Ounis, Graham McDonald, Yashar Moshfeghi |
Session: Full Papers 4 - LLMs as Rankers, Rerankers & Judges
(Tuesday 10:30–12:30, Centrale)
| Paper | Authors |
|---|---|
| Training-Induced Bias Toward LLM-Generated Content in Dense Retrieval | William Xion, Wolfgang Nejdl |
| OrLog: Resolving Complex Queries with LLMs and Probabilistic Reasoning | Mohanna Hoveyda, Jelle Piepenbrock, Arjen de Vries, Maarten De Rijke, Faegheh Hasibi |
| LLM-based Listwise Reranking under the Effect of Positional Bias | Jingfen Qiao, Jin Huang, Xinyu Ma, Shuaiqiang Wang, Dawei Yin, Evangelos Kanoulas, Andrew Yates |
| RerAnchor: Anchoring Important Context in Multi-Modal Document Reranking | Tz-Huan Hsu, Sian-Yao Huang, Kuanlun Liao, Che-Yu Lin, Cheng-Lin Yang |
| How role-play shapes relevance judgment in zero-shot LLM rankers | Yumeng Wang, Jirui Qi, Catherine Chen, Panagiotis Eustratiadis, Suzan Verberne |
| Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMs | Yuxi Xia, Loris Schoenegger, Benjamin Roth |
| LANCER: LLM Reranking for Nugget Coverage | Jia-Huei Ju, François G. Landry, Eugene Yang, Suzan Verberne, Andrew Yates |
Session: Full Papers 5 - RAG: Retrieval Utility, Scaling & Infrastructure
(Tuesday 10:30–12:30, Chaos)
| Paper | Authors |
|---|---|
| Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias | Mahdi Dehghan, Graham McDonald |
| Utilizing Metadata for Better Retrieval-Augmented Generation | Raquib Bin Yousuf, Shengzhe Xu, Mandar Sharma, Andrew Neeser, Chris Latimer, Naren Ramakrishnan |
| Predicting Retrieval Utility and Answer Quality in Retrieval-Augmented Generation | Fangzheng Tian, Debasis Ganguly, Craig Macdonald |
| Open Web Indexes for Remote Querying | Gijs Hendriksen, Djoerd Hiemstra, Arjen de Vries |
| LURE-RAG: Lightweight Utility-driven Reranking for Efficient RAG | Manish Chandra, Debasis Ganguly, Iadh Ounis |
| Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets | Laura Dietz, Bryan Li, Eugene Yang, Dawn Lawrie, William Walden, James Mayfield |
| Less LLM, More Documents: Searching for Improved RAG | Jingjie Ning, Yibo Kong, Yunfan Long, Jamie Callan |
Session: IR4Good 3 - IR-for-Good Paper Session III
(Tuesday 10:30–12:30, Chemie)
| Paper | Authors |
|---|---|
| AgriIR: A Scalable Framework for Domain-Specific Knowledge Retrieval | Shuvam Banerji Seal, Aheli Poddar, Alok Mishra, Dwaipayan Roy |
| Extending Logic Tensor Networks to Implicit Feedback for Representation-Aware Music Recommendation | Hannah Eckert, Oleg Lesota, Markus Schedl |
| Cultural Analytics for Good: Building Inclusive Evaluation Frameworks for Historical IR | Suchana Datta, Dwaipayan Roy, Derek Greene, Gerardine Meaney, Karen Wade, Philipp Mayr |
| One LLM to Train Them All: A Multi-Task Learning Framework for Fact-Checking | Malin Astrid Larsson, Harald Fosen Grunnaleite, Vinay Setty |
| How Information Retrieval Systems Construct and Amplify Immigration Narratives | Zarif Masud, Abhijit Paul, Syed Ishtiaque Ahmed, Ebrahim Bagheri |
| Towards Reliable Machine Translation: Scaling LLMs for Critical Error Detection and Safety | Muskaan Chopra, Lorenz Sparrenberg, Rafet Sifa |
| Integrating AI and IR paradigms for sustainable and trustworthy accurate access to large scale Biomedical information | Federico Borazio, Danilo Croce, Roberto Basili, Francesco Labbate |
| Debiasing CLIP with Neural Interventions | Amelia Gómez Grabowska, Jordi Gonzalez, Lluis Gomez |
Session: Full Papers 6 - Multimodal Retrieval & Embeddings
(Tuesday 14:30–16:00, Centrale)
| Paper | Authors |
|---|---|
| Event-aware Video Corpus Moment Retrieval | Danyang Hou, Liang Pang, Yanyan Lan, Huawei Shen, Xueqi Cheng |
| Scalable Music Cover Retrieval Using Lyrics-Aligned Audio Embeddings | Joanne Affolter, Benjamin Martin, Elena V. Epure, Gabriel Meseguer-Brocal, Frédéric Kaplan |
| Image Complexity-Aware Adaptive Retrieval for Efficient Vision-Language Models | Mikel Williams-Lekuona, Georgina Cosma |
| Cross-Sensory Brain Passage Retrieval: Scaling Beyond Visual to Audio | Niall McGuire, Yashar Moshfeghi |
| Learning Audio–Visual Embeddings with Inferred Latent Interaction Graphs | Donghuo Zeng, Hao Niu, Yanan Wang, Masato Taya |
Session: Full Papers 7 - Trustworthy and Responsible Retrieval-Augmented Systems
(Tuesday 14:30–16:00, Chaos)
| Paper | Authors |
|---|---|
| Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate | Charles Moslonka, Hicham Randrianarivo, Arthur Garnier, Emmanuel Malherbe |
| FACTUM: Mechanistic Detection of Citation Hallucination in Long-Form RAG | Maxime Dassen, Rebecca Kotula, Kenton Murray, Andrew Yates, Dawn Lawrie, Efsun Kayi, James Mayfield, Kevin Duh |
| SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs | Nitish Kumar, Sannu Kumar, S Akash, Manish Gupta, Ankith Karat, Sriparna Saha |
| Bribery-Resistant Ranking Systems: A Multipartite User-Agnostic Framework for AI Act Compliance | Martim Baltazar, Ludovico Boratto, Mirko Marras, Guilherme Ramos |
| RAC: Retrieval-Augmented Clarification for Faithful Conversational Search | Ahmed Rayane Kebir, Vincent Guigue, Lynda Said Lhadj, Laure Soulier |
Session: Resource 1 - Resource I: Interactive and Conversational Search
(Tuesday 14:30–16:00, Chemie)
| Paper | Authors |
|---|---|
| WildClaims: Conversational Information Access in the Wild(Chat) | Hideaki Joko, Shakiba Amirshahi, Charles L. A. Clarke, Faegheh Hasibi |
| LISP - A Rich Interaction Dataset and Loggable Interactive Search Platform | Jana Isabelle Friese, Andreas Konstantin Kruff, Philipp Schaer, Norbert Fuhr, Nicola Ferro |
| UserSimCRS v2: Simulation-Based Evaluation for Conversational Recommender Systems | Nolwenn Bernard, Krisztian Balog |
| Sim4IA-Bench: A User Simulation Benchmark Suite for Next Query and Utterance Prediction | Andreas Konstantin Kruff, Christin Katharina Kreutz, Timo Breuer, Philipp Schaer, Krisztian Balog |
| Beyond the Click: A Framework for Inferring Cognitive Traces in Search | Saber Zerhoudi, Michael Granitzer |
Session: IRRJ Papers
(Wednesday 10:30–12:30, Centrale)
| Paper | Authors |
|---|---|
| On the challenges of studying bias in Recommender Systems: The effect of data characteristics and algorithm configuration | Savvina Daniil, Manel Slokom, Mirjam Cuper, Cynthia Liem, Jacco van Ossenbruggen |
| Annotative Indexing (demonstration) | Charles Clarke |
| Effectiveness of In-Context Learning for Due Diligence: A Reproducibility Study | Madhukar Dwivedi, Jaap Kamps |
| Emancipatory Information Retrieval | Bhaskar Mitra |
| Evaluating Dense Model-based Approaches for Multimodal Medical Case Retrieval | Catarina Pires, Sérgio Nunes, Luís F. Teixeira |
| A Survey of Inclusive Information Access | Yue Zheng, Haiming Liu, Mike Wald |
| Graph Embeddings to Empower Entity Retrieval | Emma J. Gerritse, Faegheh Hasibi, Arjen P. de Vries |
| Discussion: The Future of Open Access and IRRJ | (panel) |
Session: CLEF 2026 Tracks Presentations
(Wednesday 10:30–12:30, Chaos)
- BioASQ: Large-scale biomedical semantic indexing and question answering
- HIPE: Person-place relation extraction from historical documents
- CheckThat!: Identifying and verifying claims
- Touché: Argumentation systems
- ELOQUENT: Evaluation of generative language model quality
- PAN: Stylometry and digital text forensics
- eRisk: Early risk detection on the internet
- EXIST: Sexism identification in social networks
- FinMMeval: Multilingual and multimodal evaluation of financial AI systems
- TalentCLEF: Skill and job title intelligence for human capital management
- ImageCLEF: Multimodal challenge in CLEF
- LifeCLEF: Biodiversity monitoring using AI-powered tools
- LongEval: Longitudinal evaluation of model performance
- JOKER: Humor Detection, Search and Translation
- SimpleText: Simplify scientific text
- qCLEF: QuantumCLEF
Session: Resource 2 - Resource II: Domain- and Language-specific Datasets
(Wednesday 10:30–12:30, Chemie)
| Paper | Authors |
|---|---|
| FaE: A Resource of Logs, Profiles, and Rankings for Academic Expert Finding | Marjan Azimi, Alistair Moffat, Justin Zobel |
| SciNUP: Natural Language User Interest Profiles for Scientific Literature Recommendation | Mariam Arustashvili, Krisztian Balog |
| FoodNexus: Massive Food Knowledge for Recommender Systems | Ludovico Boratto, Gianni Fenu, Mirko Marras, Giacomo Medda, Giovanni Zedda |
| pt-image-ir-dataset: An Image Retrieval Dataset in European Portuguese | Rodrigo Duarte, António Branco, Hugo Proença, Ricardo Campos |
| CitiLink-Minutes: A Multilayer Annotated Dataset of Municipal Meeting Minutes | Ricardo Campos, Ana Pacheco, Ana Fernandes, Inês Cantante, Rute Rebouças, Luís Filipe Cunha, José Isidro, José Pedro Evans, Miguel Marques, Rodrigo Batista, Evelin Amorim, Alípio Jorge, Nuno Guimarães, Sérgio Nunes, António Leal, Purificação Silvano |
| ClaimPT: A Portuguese Dataset of Annotated Claims in News Articles | Ricardo Campos, Raquel Sequeira, Sara Nerea, Inês Cantante, Diogo Folques, Luís Filipe Cunha, João Canavilhas, António Branco, Alípio Jorge, Sérgio Nunes, Nuno Guimarães, Purificação Silvano |
| BioGraphletQA: Knowledge-Anchored Generation of Complex Question Answering Datasets | Richard A. A. Jonker, Bárbara Maria Ribeiro de Abreu Martins, Sérgio Matos |
Session: Full Papers 8 - Recommendation Systems & LLMs
(Wednesday 14:30–16:00, Chemie)
| Paper | Authors |
|---|---|
| From What to Why: Thought-Space Recommendation with Small Language Models | Prosenjit Biswas, Pervez Shaik, Abhinav Thorat, Ravi Kolla, Niranjan Pedanekar |
| Post-Training Denoising of User Profiles with LLMs in Collaborative Filtering Recommendation | Ervin Dervishaj, Maria Maistro, Tuukka Ruotsalo, Christina Lioma |
| PromptHG: Prompt-Enhanced Heterogeneous Graph for Personalized News Recommendation | Dang Kieu, Delvin Ce Zhang, Minh-Duc Nguyen, Qiang Wu, Min Xu, Dung D. Le |
| Interplay: Training Independent Simulators for Reference-Free Conversational Recommendation | Jerome Ramos, Feng Xia, Xi Wang, Shubham Chatterjee, Xiao Fu, Hossein A. Rahmani, Aldo Lipani |
| Improving Conversational Recommendation with Contextual Adaptation of External Recommenders and LLM-based Reranking | Chuang Li, Yang Deng, Weida Liang, Hengchang Hu, See-Kiong Ng, Min-Yen Kan, Haizhou Li |
Session: Resource 3 - Resource III: Evaluation Tooling for Retrieval and RecSys
(Wednesday 14:30–16:00, Centrale)
| Paper | Authors |
|---|---|
| CoRECT: A Framework for Evaluating Embedding Compression Techniques at Scale | Laura Caspari, Michael Dinzinger, Kanishka Ghosh Dastidar, Christofer Fellicious, Jelena Mitrović, Michael Granitzer |
| GREAT: Group Recommender Evaluation and Analysis Tool | Ariel Smith, David Contreras, Maria Salamo, Ludovico Boratto |
| Evaluating the Efficiency and Effectiveness of Learned Sparse Retrieval with the lsr_benchmark | Maik Fröbe, Ferdinand Schlatt, Cosimo Rulli, Tim Hagen, Jan Heinrich Merker, Gijs Hendriksen, Carlos Lassance, Franco Maria Nardini, Rossano Venturini, Martin Potthast |
| An Open SERP Mining Infrastructure for the Archive Query Log | Jan Heinrich Merker, Simon Ruth, Harrisen Scells, Martin Potthast |
| RoutIR: Fast Serving of Retrieval Pipelines for Retrieval-Augmented Generation | Eugene Yang, Andrew Yates, Dawn Lawrie, James Mayfield, Trevor Adriaanse |
Poster Session 1 - Short Papers & Demos
(Monday 13:30–14:30)
Short Papers
- Multi-Step Semantic Reasoning in Generative Retrieval
- SSEmb: A Joint Structural and Semantic Embedding Framework for Mathematical Formula Retrieval
- On the Viability of Exploiting Large Language Models for Misinformation Annotation
- Incorporating Q&A Nuggets into Retrieval-Augmented Generation
- Evolving Mixture of Low-Rank Experts for Continual User Modeling
- Personalized Autocompletion of Interactions with LLM-based Chatbots
- Evaluating Large Language Models as Domain-Specific Retrieval Agents: A Study on Cybersecurity Challenge Benchmarks
- Large Language Models as Assessors: On the Impact of Relevance Scales
- Analyzing AI Evaluation Benchmarks Through Information Retrieval and Network Science
- Evaluating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries
- DARE: A Dialectical Framework for Adversarial and Evidence-Aware RAG
- Do We Still Need Text for Video Retrieval in the Era of Vision-Language Models?
- Query Performance Prediction using a Child-focused Definition of Relevance
- ReFormeR: Learning and Applying Explicit Query Reformulation Patterns
- One Word is Enough: Minimal Adversarial Perturbations for Neural Text Ranking
- Text vs. Speech? Detecting Audio Deepfakes on Instagram
- MiNER: A Two-Stage Pipeline for Metadata Extraction from Municipal Meeting Minutes
- Revisiting Human-vs-LLM judgments on the TREC Podcast Track
- Forward Index Compression for Learned Sparse Retrieval
Demos
- OmniRec: The All-In-One Solution for Reproducible and Interoperable Recommender Systems Experimentation
- GutBrainKB: Exploring the Gut–Brain Interaction through a Reliable Biomedical KB
- CancerRAGent: Evidence-Linked and Safety-Guided Oncology Question Answering
- Talmud-IR: A Talmud-Inspired Interface for Discussing RAG Response Quality
- Enhancing Job Search Effectiveness with LLM-Powered Context-Aware Query Reformulation
- Pipeline Inspection, Visualization, and Interoperability in PyTerrier
Poster Session 2 - Short Papers & Demos
(Tuesday 13:30–14:30)
Short Papers
- LLM-Assisted Pseudo-Relevance Feedback
- Adversarial Edge Perturbation Framework in Graph-based Retrieval
- Zero-Cost Multilingual Context Pruning for Retrieval-augmented Generation
- EmbMerge: A Transformer-based Method for Fusing CDR Lists
- Enhancing Attention-based Context Attribution via Token Selection and Think-Twice Mechanism
- Beyond Persuasiveness: A User-Centric Evaluation Framework of Explanations for Food Recommendation
- Pre-trained LLMs Meet Sequential Recommenders: Efficient User-Centric Knowledge Distillation
- Beyond Correlations: A Downstream Evaluation Framework for Query Performance Prediction
- Trust Me on This: A User Study of Explainability for AI-Generated Responses
- Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction
- Structure-aware Pre-Retrieval Performance Prediction on Query Affinity Graphs
- Controlling Gender Bias in Retrieval via a Backpack Architecture
- Knowledge-enhanced Multi-Agent for LLM-based Recommendation
- From Single to Multi-Agent Reasoning: Advancing GeneGPT for Genomics QA
- Aligning Instruction-Tuned LLMs for Event Extraction with Multi-objective Reinforcement Learning
- Topological Metric for Unsupervised Embedding Quality Evaluation
- Generative Retrieval via Few-shot Indexing
- Correct but Incomplete: Why Chain-of-Thought Cannot Currently Support Auditable Reasoning
Demos
- ImageSeek: A Hybrid Text-to-Image Image Retrieval System for Domain-Specific Collections
- LectureChat: Hybrid RAG over Wikipedia and Multilingual Lectures
- MedNuggetizer: Confidence-Based Information Nugget Extraction from Medical Documents
- SuiteEval: Simplifying Retrieval Benchmarks
- CitiLink: Enhancing Municipal Transparency and Citizen Engagement through Searchable Meeting Minutes
- Context Engineering for Agentic Data Science
- Creating Specialized RAG-Based Search Engines Using the Open Web Index
Collab-a-thon Sessions
- Monday 16:00–17:00 Collab-a-thon Session 1 (LAB.115)
- Tuesday 14:30–16:00 Collab-a-thon Session 2 (LAB.115)
- Wednesday 14:30–16:00 Collab-a-thon Session 3 (LAB.115)
Poster Session 3 - CLEF Papers & FDIA Doctoral Consortium
(Wednesday 13:30–14:30)
CLEF Papers
- Overview of Touché 2026: Argumentation Systems
- CLEF HIPE-2026: Evaluating Accurate and Efficient Person–Place Relation Extraction from Multilingual Historical Texts
- Evaluating Information Retrieval Models Along Time: The LongEval Lab
- ImageCLEF 2026: Multimodal Challenges in Medicine, Science, Agritech, and Security
- The CLEF-2026 CheckThat! Lab: Advancing Multilingual Fact-Checking
- BioASQ at CLEF2026: The fourteenth edition of the large-scale biomedical semantic indexing and question answering challenge
- QuantumCLEF 2026 - The Third Edition of the Quantum Computing Lab at CLEF
- EXIST 2026: Human Sensor Data for Multimodal Sexism Characterization in Social Media
- LifeCLEF 2026 Teaser: AI Challenges for Biodiversity Understanding and Ecosystem Management
- The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems
- ELOQUENT CLEF Shared Tasks for Evaluation of Generative Language Model Quality, 2026 edition
- TalentCLEF at CLEF2026: Skill and Job Title Intelligence for Human Capital Management
- CLEF 2026 JOKER Track: Humor Detection, Search, and Translation
- eRisk 2026: Tasks on Symptoms Ranking, Contextual and Conversational Approaches for Early Mental Health Detection
- Overview of PAN 2026: Voight-Kampff Generative AI Detection, Text Watermarking, Multi-Author Writing Style Analysis, Generative Plagiarism Detection, and Reasoning Trajectory Detection
- CLEF 2026 SimpleText Track: Simplify Scientific Text (and Nothing More)
FDIA Doctoral Consortium
- Generation of Metadata to Improve Tabular Information Discovery in Data Spaces
- Information Extraction from Data Visualizations in Scientific Literature
- Relevance by Design: A Systematic Review on Methodologies in Computational Legal IR
- Retrieval Augmented Generation for Proactive Research
- Document Retrieval with Fine-grained Relevance Cues
- Slicing Digital Hermeneutics Into Chunks
- User-Centric Interactive Search in Mixed Reality Toward Adaptive, Context-Aware Retrieval