Program - Core Days

Session: Full Papers 1 - Core Retrieval Models, Representations & Evaluation

(Monday 10:30–12:30, Centrale)

PaperAuthors
Sample-Free Almost-Exact Estimation of Plackett-Luce Propensities for Off-Policy RankingNorman Knyazev, Harrie Oosterhuis
Validating Search Query Simulations: A Taxonomy of MeasuresAndreas Konstantin Kruff, Nolwenn Bernard, Philipp Schaer
Reducing Human Effort to Validate LLM Relevance Judgements via Stratified SamplingSimone Merlo, Stefano Marchesin, Guglielmo Faggioli, Nicola Ferro
Revealing MonoT5's Learning Mechanisms via Prompt-Token AdaptationMarco Braga, Sean MacAvaney, Craig Macdonald, Gabriella Pasi
When Reducing Representations Improves PerformanceAndrea Pasin, Guglielmo Faggioli, Nicola Ferro, Raffaele Perego, Nicola Tonellotto
An Empirical Study of Model Casing in Learned Sparse RetrievalEmmanouil Georgios Lionis, Jia-Huei Ju, Angelos Nalmpantis, Casper Thuis, Sean MacAvaney, Andrew Yates
Improving Instruction-Aware Retrieval with Query-Preserving RegularizationHyewon Kim, Hyun-Je Song

Session: Full Papers 2 - Applied Generation, Evaluation & Analysis with LLMs

(Monday 10:30–12:30, Chaos)

PaperAuthors
Contradictions in Context: Challenges for Retrieval-Augmented Generation in HealthcareSaeedeh Javadi, Sara Mirabi, Manan Gangar, Bahadorreza Ofoghi
Small Models, Big Picture! A Language Model Augmentation for Enhanced Reader-Aware SummarizationRaghvendra Kumar, A S Poornash, Sriparna Saha
From Comments to Conclusions: Adaptive Reader-Aware Summary Generation in Low-Resource Languages via Agent DebateRaghvendra Kumar, Mohammed Salman S A, Jaya Verma, Sriparna Saha
Prompt Compression in the Wild: Measuring Latency, Rate Adherence, and Quality for Faster LLM InferenceCornelius Kummer, Lena Jurkschat, Michael Färber, Sahar Vahdati
Towards Quantitative Summarization Evaluation: An Integrated Atomic-Based Evaluation Framework and Dataset for Text SummarizationYan Lei, Suncong Zheng, Roberts Wang, Liang Pang, Lei He, Shuang Chen, Wang Yu, Huawei Shen, Xueqi Cheng, Yuanzhuo Wang
ExpertMix: Aspect and Severity Detection in Conversational ComplaintsSarmistha Das, Apoorva Singh, Rishu Kumar Singh, Navneet Shreya, Sriparna Saha
MemTool: Optimizing Short-Term Memory Management for Dynamic Tool Retrieval and Invocation in LLM Agent Multi-Turn ConversationsElias Lumer, Anmol Gulati, Vamse Kumar Subbiah, Pradeep Honaganahalli Basavaraju, James A Burke

Session: IR4Good 1 - IR-for-Good Paper Session I

(Monday 10:30–12:30, Chemie)

PaperAuthors
From Engagement to Empowerment: A Capability-Theoretic Rethinking of Recommender SystemsVittoria Vineis, Gabriele Tolomei
Bias in Book Recommendation: A Case Study on the Danish Public LibrariesSavvina Daniil, Søren Højlund Mollerup, Laura Hollink
How Do LLMs Cite? A Mechanistic Interpretation of Attribution in RAGIan van Dort, Maria Heuss
All That Matters: Revisiting Children's Concept of Relevance in Primary School ContextDiletta Micol Tobia, Hrishita Chakrabarti, Maria Soledad Pera, Monica Landoni
When Attention Becomes Exposure in Generative SearchShayan Alipour, Mehdi Kargar, Morteza Zihayat
Counterfactual Understanding via Retrieval-aware Multimodal Modeling for Time-to-Event Survival PredictionHa-Anh Hoang Nguyen, Tri-Duc Phan Le, Duc-Hoang Pham, Huy-Son Nguyen, Cam-Van Thi Nguyen, Duc-Trong Le, Hoang-Quynh Le
Joint Modeling of Candidate and Recruiter Preferences for Fair Two-Sided Job MatchingClara Rus, Masoud Mansoury, Andrew Yates, Maarten de Rijke

Session: Full Papers 3 - Specialized Retrieval Domains & Architectures

(Monday 14:30–15:30, Centrale)

PaperAuthors
Filtering Few-Level Segment Regions for Efficient Subsequence Search in 3D Human MotionsAndrej Černek, Jan Sedmidubsky
Starbucks: Improved Training for 2D Matryoshka EmbeddingsShengyao Zhuang, Shuai Wang, Fabio Zheng, Bevan Koopman, Guido Zuccon
Website Segmentation Beyond Structure: A Benchmark on Functional and Digital Maturity ClassesJonathan Gerber, Jasmin Saxer, Andreas Weiler, Michael Grossniklaus

Session: Reproducibility 1 - Reproducibility I: Recommender Systems

(Monday 14:30–15:30, Chaos)

PaperAuthors
Are Multimodal Embeddings Truly Beneficial for Recommendation? A Deep Dive into Whole vs. Individual ModalitiesYu Ye, Junchen Fu, Yu Song, Kaiwen Zheng, Joemon Jose
RecRankerEval: A Reproducible Framework for Deploying and Evaluating LLM-based Top-k RecommendersZeyuan Meng, Zixuan Yi, Iadh Ounis
Efficient Optimization of Hierarchical Identifiers for Generative RecommendationFederica Valeau, David Vos, Odysseas Boufalis, Polytimi Gkotsi, Joshua Rosenthal
A Reproducible and Fair Evaluation of Partition-aware Collaborative FilteringDomenico de Gioia, Claudio Pomo, Ludovico Boratto, Tommaso Di Noia
A Systematic Reproducibility Study of BSARec for Sequential RecommendationJan Hutter, Hua Chang Bakker, Stan Fris, Angela Madelon Bernardy, Yuanna Liu

Session: IR4Good 2 - IR-for-Good Paper Session II

(Monday 14:30–15:30, Chemie)

PaperAuthors
Measuring Political Stance and Consistency in Large Language ModelsMucahid Kutlu, Saban Kardas, Salah Feras Alali, Mohammad Nashat Maasfeh
Judiciously Reducing Sub-group Comparisons for Learning Intersectional Fair RepresentationsClara Rus, Andrew Yates, Maarten de Rijke
Modeling Behavioral Patterns in News Recommendations Using Fuzzy Neural NetworksKevin Innerebner, Stephan Bartl, Markus Reiter-Haas, Elisabeth Lex
Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning RerankersSaron Samuel, Benjamin Van Durme, Eugene Yang

Session: Findings Lightning Talks

(Monday 16:00–17:00, Chemie)

PaperAuthors
Measuring Individual User Fairness with User Similarity and Effectiveness DisparityTheresia Veronika Rampisela, Maria Maistro, Tuukka Ruotsalo, Christina Lioma
Nested Named Entity Recognition in Plasma Physics Research ArticlesMuhammad Haris, Hans Höft, Markus Becker, Markus Stocker
Exploring User Simulators in Conversational Search: A Comparison between LLMs and HumansLili Lu, Fabio Crestani
Query Harmfulness Prediction (QHP): a New Challenge for Safer Retrieval SystemsXiana Carrera, Marcos Fernández-Pichel, David E. Losada
The Effect of Document Summarization on LLM-based Relevance JudgmentsSamaneh Mohtadi, Kevin Roitero, Stefano Mizzaro, Gianluca Demartini
Query–Document Dense Vectors for LLM Relevance Judgment Bias AnalysisSamaneh Mohtadi, Gianluca Demartini
Stop Contrasting, Start Distilling: Cross-Encoder Listwise Distillation and Synthetic Data for Dense RetrievalManveer Singh Tamber, Suleman Kazi, Vivek Sourabh, Jimmy Lin
Let Me Explain - Knowledge-based Retrieval Augmented Generation for Agricultural Recommendation ExplanationsDaan Di Scala, Maaike de Boer
From Quotes to Concepts: Axial Coding of Political Debates with Ensemble LMsAngelina Parfenova, David Graus, Juergen Pfeffer
Breaking Flat: A Generalised Query Performance Prediction Evaluation FrameworkPayel Santra, Partha Basuchowdhuri, Debasis Ganguly

Session: IR4Good - Invited Talks & Panel

(Monday 16:00–17:00, Centrale)

TimeSpeaker / Talk
16:00–16:15Invited Talk - Dana McKay: "Anything But Ordinary: How other disciplines can help move beyond the average searcher"
16:15–16:30Invited Talk - Djoerd Hiemstra: "OpenWebSearch.eu: Towards a shared infrastructure for assembling web search engines"
16:30–17:15Panel Discussion - Moderator: Maria Heuss; Panelists: Madeleine I. G. Daepp, Dana McKay, Djoerd Hiemstra, Sanne Vrijenhoek, Bhaskar Mitra

Session: Reproducibility 2 - Reproducibility II: Retrieval

(Monday 16:00–17:00, Chaos)

PaperAuthors
Fast, Compact, Dynamic Indexing for Learned Sparse Retrieval SystemsBilly Rule, Joel Mackenzie
Down with the Hierarchy: The 'H' in HNSW Stands for "Hubs"Blaise Munyampirwa, Vihan Lakshman, Benjamin Coleman
Multi-vector Reranking in the Era of Strong First-Stage RetrieversSilvio Martinico, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini
Temporal Fact Conflicts in LLMs: Reproducibility Insights from Unifying DYNAMICQA and MULANRitajit Dey, Iadh Ounis, Graham McDonald, Yashar Moshfeghi

Session: Full Papers 4 - LLMs as Rankers, Rerankers & Judges

(Tuesday 10:30–12:30, Centrale)

PaperAuthors
Training-Induced Bias Toward LLM-Generated Content in Dense RetrievalWilliam Xion, Wolfgang Nejdl
OrLog: Resolving Complex Queries with LLMs and Probabilistic ReasoningMohanna Hoveyda, Jelle Piepenbrock, Arjen de Vries, Maarten De Rijke, Faegheh Hasibi
LLM-based Listwise Reranking under the Effect of Positional BiasJingfen Qiao, Jin Huang, Xinyu Ma, Shuaiqiang Wang, Dawei Yin, Evangelos Kanoulas, Andrew Yates
RerAnchor: Anchoring Important Context in Multi-Modal Document RerankingTz-Huan Hsu, Sian-Yao Huang, Kuanlun Liao, Che-Yu Lin, Cheng-Lin Yang
How role-play shapes relevance judgment in zero-shot LLM rankersYumeng Wang, Jirui Qi, Catherine Chen, Panagiotis Eustratiadis, Suzan Verberne
Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMsYuxi Xia, Loris Schoenegger, Benjamin Roth
LANCER: LLM Reranking for Nugget CoverageJia-Huei Ju, François G. Landry, Eugene Yang, Suzan Verberne, Andrew Yates

Session: Full Papers 5 - RAG: Retrieval Utility, Scaling & Infrastructure

(Tuesday 10:30–12:30, Chaos)

PaperAuthors
Who Benefits from RAG? The Role of Exposure, Utility and Attribution BiasMahdi Dehghan, Graham McDonald
Utilizing Metadata for Better Retrieval-Augmented GenerationRaquib Bin Yousuf, Shengzhe Xu, Mandar Sharma, Andrew Neeser, Chris Latimer, Naren Ramakrishnan
Predicting Retrieval Utility and Answer Quality in Retrieval-Augmented GenerationFangzheng Tian, Debasis Ganguly, Craig Macdonald
Open Web Indexes for Remote QueryingGijs Hendriksen, Djoerd Hiemstra, Arjen de Vries
LURE-RAG: Lightweight Utility-driven Reranking for Efficient RAGManish Chandra, Debasis Ganguly, Iadh Ounis
Insider Knowledge: How Much Can RAG Systems Gain from Evaluation SecretsLaura Dietz, Bryan Li, Eugene Yang, Dawn Lawrie, William Walden, James Mayfield
Less LLM, More Documents: Searching for Improved RAGJingjie Ning, Yibo Kong, Yunfan Long, Jamie Callan

Session: IR4Good 3 - IR-for-Good Paper Session III

(Tuesday 10:30–12:30, Chemie)

PaperAuthors
AgriIR: A Scalable Framework for Domain-Specific Knowledge RetrievalShuvam Banerji Seal, Aheli Poddar, Alok Mishra, Dwaipayan Roy
Extending Logic Tensor Networks to Implicit Feedback for Representation-Aware Music RecommendationHannah Eckert, Oleg Lesota, Markus Schedl
Cultural Analytics for Good: Building Inclusive Evaluation Frameworks for Historical IRSuchana Datta, Dwaipayan Roy, Derek Greene, Gerardine Meaney, Karen Wade, Philipp Mayr
One LLM to Train Them All: A Multi-Task Learning Framework for Fact-CheckingMalin Astrid Larsson, Harald Fosen Grunnaleite, Vinay Setty
How Information Retrieval Systems Construct and Amplify Immigration NarrativesZarif Masud, Abhijit Paul, Syed Ishtiaque Ahmed, Ebrahim Bagheri
Towards Reliable Machine Translation: Scaling LLMs for Critical Error Detection and SafetyMuskaan Chopra, Lorenz Sparrenberg, Rafet Sifa
Integrating AI and IR paradigms for sustainable and trustworthy accurate access to large scale Biomedical informationFederico Borazio, Danilo Croce, Roberto Basili, Francesco Labbate
Debiasing CLIP with Neural InterventionsAmelia Gómez Grabowska, Jordi Gonzalez, Lluis Gomez

Session: Full Papers 6 - Multimodal Retrieval & Embeddings

(Tuesday 14:30–16:00, Centrale)

PaperAuthors
Event-aware Video Corpus Moment RetrievalDanyang Hou, Liang Pang, Yanyan Lan, Huawei Shen, Xueqi Cheng
Scalable Music Cover Retrieval Using Lyrics-Aligned Audio EmbeddingsJoanne Affolter, Benjamin Martin, Elena V. Epure, Gabriel Meseguer-Brocal, Frédéric Kaplan
Image Complexity-Aware Adaptive Retrieval for Efficient Vision-Language ModelsMikel Williams-Lekuona, Georgina Cosma
Cross-Sensory Brain Passage Retrieval: Scaling Beyond Visual to AudioNiall McGuire, Yashar Moshfeghi
Learning Audio–Visual Embeddings with Inferred Latent Interaction GraphsDonghuo Zeng, Hao Niu, Yanan Wang, Masato Taya

Session: Full Papers 7 - Trustworthy and Responsible Retrieval-Augmented Systems

(Tuesday 14:30–16:00, Chaos)

PaperAuthors
Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production RateCharles Moslonka, Hicham Randrianarivo, Arthur Garnier, Emmanuel Malherbe
FACTUM: Mechanistic Detection of Citation Hallucination in Long-Form RAGMaxime Dassen, Rebecca Kotula, Kenton Murray, Andrew Yates, Dawn Lawrie, Efsun Kayi, James Mayfield, Kevin Duh
SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMsNitish Kumar, Sannu Kumar, S Akash, Manish Gupta, Ankith Karat, Sriparna Saha
Bribery-Resistant Ranking Systems: A Multipartite User-Agnostic Framework for AI Act ComplianceMartim Baltazar, Ludovico Boratto, Mirko Marras, Guilherme Ramos
RAC: Retrieval-Augmented Clarification for Faithful Conversational SearchAhmed Rayane Kebir, Vincent Guigue, Lynda Said Lhadj, Laure Soulier

Session: Resource 1 - Resource I: Interactive and Conversational Search

(Tuesday 14:30–16:00, Chemie)

PaperAuthors
WildClaims: Conversational Information Access in the Wild(Chat)Hideaki Joko, Shakiba Amirshahi, Charles L. A. Clarke, Faegheh Hasibi
LISP - A Rich Interaction Dataset and Loggable Interactive Search PlatformJana Isabelle Friese, Andreas Konstantin Kruff, Philipp Schaer, Norbert Fuhr, Nicola Ferro
UserSimCRS v2: Simulation-Based Evaluation for Conversational Recommender SystemsNolwenn Bernard, Krisztian Balog
Sim4IA-Bench: A User Simulation Benchmark Suite for Next Query and Utterance PredictionAndreas Konstantin Kruff, Christin Katharina Kreutz, Timo Breuer, Philipp Schaer, Krisztian Balog
Beyond the Click: A Framework for Inferring Cognitive Traces in SearchSaber Zerhoudi, Michael Granitzer

Session: IRRJ Papers

(Wednesday 10:30–12:30, Centrale)

PaperAuthors
On the challenges of studying bias in Recommender Systems: The effect of data characteristics and algorithm configurationSavvina Daniil, Manel Slokom, Mirjam Cuper, Cynthia Liem, Jacco van Ossenbruggen
Annotative Indexing (demonstration)Charles Clarke
Effectiveness of In-Context Learning for Due Diligence: A Reproducibility StudyMadhukar Dwivedi, Jaap Kamps
Emancipatory Information RetrievalBhaskar Mitra
Evaluating Dense Model-based Approaches for Multimodal Medical Case RetrievalCatarina Pires, Sérgio Nunes, Luís F. Teixeira
A Survey of Inclusive Information AccessYue Zheng, Haiming Liu, Mike Wald
Graph Embeddings to Empower Entity RetrievalEmma J. Gerritse, Faegheh Hasibi, Arjen P. de Vries
Discussion: The Future of Open Access and IRRJ(panel)

Session: CLEF 2026 Tracks Presentations

(Wednesday 10:30–12:30, Chaos)

  • BioASQ: Large-scale biomedical semantic indexing and question answering
  • HIPE: Person-place relation extraction from historical documents
  • CheckThat!: Identifying and verifying claims
  • Touché: Argumentation systems
  • ELOQUENT: Evaluation of generative language model quality
  • PAN: Stylometry and digital text forensics
  • eRisk: Early risk detection on the internet
  • EXIST: Sexism identification in social networks
  • FinMMeval: Multilingual and multimodal evaluation of financial AI systems
  • TalentCLEF: Skill and job title intelligence for human capital management
  • ImageCLEF: Multimodal challenge in CLEF
  • LifeCLEF: Biodiversity monitoring using AI-powered tools
  • LongEval: Longitudinal evaluation of model performance
  • JOKER: Humor Detection, Search and Translation
  • SimpleText: Simplify scientific text
  • qCLEF: QuantumCLEF

Session: Resource 2 - Resource II: Domain- and Language-specific Datasets

(Wednesday 10:30–12:30, Chemie)

PaperAuthors
FaE: A Resource of Logs, Profiles, and Rankings for Academic Expert FindingMarjan Azimi, Alistair Moffat, Justin Zobel
SciNUP: Natural Language User Interest Profiles for Scientific Literature RecommendationMariam Arustashvili, Krisztian Balog
FoodNexus: Massive Food Knowledge for Recommender SystemsLudovico Boratto, Gianni Fenu, Mirko Marras, Giacomo Medda, Giovanni Zedda
pt-image-ir-dataset: An Image Retrieval Dataset in European PortugueseRodrigo Duarte, António Branco, Hugo Proença, Ricardo Campos
CitiLink-Minutes: A Multilayer Annotated Dataset of Municipal Meeting MinutesRicardo Campos, Ana Pacheco, Ana Fernandes, Inês Cantante, Rute Rebouças, Luís Filipe Cunha, José Isidro, José Pedro Evans, Miguel Marques, Rodrigo Batista, Evelin Amorim, Alípio Jorge, Nuno Guimarães, Sérgio Nunes, António Leal, Purificação Silvano
ClaimPT: A Portuguese Dataset of Annotated Claims in News ArticlesRicardo Campos, Raquel Sequeira, Sara Nerea, Inês Cantante, Diogo Folques, Luís Filipe Cunha, João Canavilhas, António Branco, Alípio Jorge, Sérgio Nunes, Nuno Guimarães, Purificação Silvano
BioGraphletQA: Knowledge-Anchored Generation of Complex Question Answering DatasetsRichard A. A. Jonker, Bárbara Maria Ribeiro de Abreu Martins, Sérgio Matos

Session: Full Papers 8 - Recommendation Systems & LLMs

(Wednesday 14:30–16:00, Chemie)

PaperAuthors
From What to Why: Thought-Space Recommendation with Small Language ModelsProsenjit Biswas, Pervez Shaik, Abhinav Thorat, Ravi Kolla, Niranjan Pedanekar
Post-Training Denoising of User Profiles with LLMs in Collaborative Filtering RecommendationErvin Dervishaj, Maria Maistro, Tuukka Ruotsalo, Christina Lioma
PromptHG: Prompt-Enhanced Heterogeneous Graph for Personalized News RecommendationDang Kieu, Delvin Ce Zhang, Minh-Duc Nguyen, Qiang Wu, Min Xu, Dung D. Le
Interplay: Training Independent Simulators for Reference-Free Conversational RecommendationJerome Ramos, Feng Xia, Xi Wang, Shubham Chatterjee, Xiao Fu, Hossein A. Rahmani, Aldo Lipani
Improving Conversational Recommendation with Contextual Adaptation of External Recommenders and LLM-based RerankingChuang Li, Yang Deng, Weida Liang, Hengchang Hu, See-Kiong Ng, Min-Yen Kan, Haizhou Li

Session: Resource 3 - Resource III: Evaluation Tooling for Retrieval and RecSys

(Wednesday 14:30–16:00, Centrale)

PaperAuthors
CoRECT: A Framework for Evaluating Embedding Compression Techniques at ScaleLaura Caspari, Michael Dinzinger, Kanishka Ghosh Dastidar, Christofer Fellicious, Jelena Mitrović, Michael Granitzer
GREAT: Group Recommender Evaluation and Analysis ToolAriel Smith, David Contreras, Maria Salamo, Ludovico Boratto
Evaluating the Efficiency and Effectiveness of Learned Sparse Retrieval with the lsr_benchmarkMaik Fröbe, Ferdinand Schlatt, Cosimo Rulli, Tim Hagen, Jan Heinrich Merker, Gijs Hendriksen, Carlos Lassance, Franco Maria Nardini, Rossano Venturini, Martin Potthast
An Open SERP Mining Infrastructure for the Archive Query LogJan Heinrich Merker, Simon Ruth, Harrisen Scells, Martin Potthast
RoutIR: Fast Serving of Retrieval Pipelines for Retrieval-Augmented GenerationEugene Yang, Andrew Yates, Dawn Lawrie, James Mayfield, Trevor Adriaanse

Poster Session 1 - Short Papers & Demos

(Monday 13:30–14:30)

Short Papers

  • Multi-Step Semantic Reasoning in Generative Retrieval
  • SSEmb: A Joint Structural and Semantic Embedding Framework for Mathematical Formula Retrieval
  • On the Viability of Exploiting Large Language Models for Misinformation Annotation
  • Incorporating Q&A Nuggets into Retrieval-Augmented Generation
  • Evolving Mixture of Low-Rank Experts for Continual User Modeling
  • Personalized Autocompletion of Interactions with LLM-based Chatbots
  • Evaluating Large Language Models as Domain-Specific Retrieval Agents: A Study on Cybersecurity Challenge Benchmarks
  • Large Language Models as Assessors: On the Impact of Relevance Scales
  • Analyzing AI Evaluation Benchmarks Through Information Retrieval and Network Science
  • Evaluating Retrieval-Augmented Generation Systems on Unanswerable, Uncheatable, Realistic, Multi-hop Queries
  • DARE: A Dialectical Framework for Adversarial and Evidence-Aware RAG
  • Do We Still Need Text for Video Retrieval in the Era of Vision-Language Models?
  • Query Performance Prediction using a Child-focused Definition of Relevance
  • ReFormeR: Learning and Applying Explicit Query Reformulation Patterns
  • One Word is Enough: Minimal Adversarial Perturbations for Neural Text Ranking
  • Text vs. Speech? Detecting Audio Deepfakes on Instagram
  • MiNER: A Two-Stage Pipeline for Metadata Extraction from Municipal Meeting Minutes
  • Revisiting Human-vs-LLM judgments on the TREC Podcast Track
  • Forward Index Compression for Learned Sparse Retrieval

Demos

  • OmniRec: The All-In-One Solution for Reproducible and Interoperable Recommender Systems Experimentation
  • GutBrainKB: Exploring the Gut–Brain Interaction through a Reliable Biomedical KB
  • CancerRAGent: Evidence-Linked and Safety-Guided Oncology Question Answering
  • Talmud-IR: A Talmud-Inspired Interface for Discussing RAG Response Quality
  • Enhancing Job Search Effectiveness with LLM-Powered Context-Aware Query Reformulation
  • Pipeline Inspection, Visualization, and Interoperability in PyTerrier

Poster Session 2 - Short Papers & Demos

(Tuesday 13:30–14:30)

Short Papers

  • LLM-Assisted Pseudo-Relevance Feedback
  • Adversarial Edge Perturbation Framework in Graph-based Retrieval
  • Zero-Cost Multilingual Context Pruning for Retrieval-augmented Generation
  • EmbMerge: A Transformer-based Method for Fusing CDR Lists
  • Enhancing Attention-based Context Attribution via Token Selection and Think-Twice Mechanism
  • Beyond Persuasiveness: A User-Centric Evaluation Framework of Explanations for Food Recommendation
  • Pre-trained LLMs Meet Sequential Recommenders: Efficient User-Centric Knowledge Distillation
  • Beyond Correlations: A Downstream Evaluation Framework for Query Performance Prediction
  • Trust Me on This: A User Study of Explainability for AI-Generated Responses
  • Principled Context Engineering for RAG: Statistical Guarantees via Conformal Prediction
  • Structure-aware Pre-Retrieval Performance Prediction on Query Affinity Graphs
  • Controlling Gender Bias in Retrieval via a Backpack Architecture
  • Knowledge-enhanced Multi-Agent for LLM-based Recommendation
  • From Single to Multi-Agent Reasoning: Advancing GeneGPT for Genomics QA
  • Aligning Instruction-Tuned LLMs for Event Extraction with Multi-objective Reinforcement Learning
  • Topological Metric for Unsupervised Embedding Quality Evaluation
  • Generative Retrieval via Few-shot Indexing
  • Correct but Incomplete: Why Chain-of-Thought Cannot Currently Support Auditable Reasoning

Demos

  • ImageSeek: A Hybrid Text-to-Image Image Retrieval System for Domain-Specific Collections
  • LectureChat: Hybrid RAG over Wikipedia and Multilingual Lectures
  • MedNuggetizer: Confidence-Based Information Nugget Extraction from Medical Documents
  • SuiteEval: Simplifying Retrieval Benchmarks
  • CitiLink: Enhancing Municipal Transparency and Citizen Engagement through Searchable Meeting Minutes
  • Context Engineering for Agentic Data Science
  • Creating Specialized RAG-Based Search Engines Using the Open Web Index

Collab-a-thon Sessions

  • Monday 16:00–17:00 Collab-a-thon Session 1 (LAB.115)
  • Tuesday 14:30–16:00 Collab-a-thon Session 2 (LAB.115)
  • Wednesday 14:30–16:00 Collab-a-thon Session 3 (LAB.115)

Poster Session 3 - CLEF Papers & FDIA Doctoral Consortium

(Wednesday 13:30–14:30)

CLEF Papers

  • Overview of Touché 2026: Argumentation Systems
  • CLEF HIPE-2026: Evaluating Accurate and Efficient Person–Place Relation Extraction from Multilingual Historical Texts
  • Evaluating Information Retrieval Models Along Time: The LongEval Lab
  • ImageCLEF 2026: Multimodal Challenges in Medicine, Science, Agritech, and Security
  • The CLEF-2026 CheckThat! Lab: Advancing Multilingual Fact-Checking
  • BioASQ at CLEF2026: The fourteenth edition of the large-scale biomedical semantic indexing and question answering challenge
  • QuantumCLEF 2026 - The Third Edition of the Quantum Computing Lab at CLEF
  • EXIST 2026: Human Sensor Data for Multimodal Sexism Characterization in Social Media
  • LifeCLEF 2026 Teaser: AI Challenges for Biodiversity Understanding and Ecosystem Management
  • The CLEF-2026 FinMMEval Lab: Multilingual and Multimodal Evaluation of Financial AI Systems
  • ELOQUENT CLEF Shared Tasks for Evaluation of Generative Language Model Quality, 2026 edition
  • TalentCLEF at CLEF2026: Skill and Job Title Intelligence for Human Capital Management
  • CLEF 2026 JOKER Track: Humor Detection, Search, and Translation
  • eRisk 2026: Tasks on Symptoms Ranking, Contextual and Conversational Approaches for Early Mental Health Detection
  • Overview of PAN 2026: Voight-Kampff Generative AI Detection, Text Watermarking, Multi-Author Writing Style Analysis, Generative Plagiarism Detection, and Reasoning Trajectory Detection
  • CLEF 2026 SimpleText Track: Simplify Scientific Text (and Nothing More)

FDIA Doctoral Consortium

  • Generation of Metadata to Improve Tabular Information Discovery in Data Spaces
  • Information Extraction from Data Visualizations in Scientific Literature
  • Relevance by Design: A Systematic Review on Methodologies in Computational Legal IR
  • Retrieval Augmented Generation for Proactive Research
  • Document Retrieval with Fine-grained Relevance Cues
  • Slicing Digital Hermeneutics Into Chunks
  • User-Centric Interactive Search in Mixed Reality Toward Adaptive, Context-Aware Retrieval
33 hits