Stouhi, Yara, Berrizbeitia, Francisco ORCID: https://orcid.org/0000-0002-1542-8435, Fitzgibbons, Megan ORCID: https://orcid.org/0000-0003-0409-6321, Charbonneau, Olivier, Chalifour, Joshua ORCID: https://orcid.org/0000-0001-7663-0509 and Majerczyk, Aviva (2024) Gaby Says | Gaby Dit. In: Applied AI Showcase Digital Poster Session, 8 Nov 2024, Montreal. (Unpublished)
Preview |
Image (image/png)
496kBv3-poster-Applied-AI.png - Presentation Available under License Spectrum Terms of Access. |
Abstract
This poster presents a study on deploying and evaluating a conversational agent using Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs) in an academic setting. The objective was to implement a RAG-based system capable of answering reference questions and develop an evaluation protocol to measure the "usefulness" of the chatbot, comparing multiple models. The system follows a two-step approach: it first retrieves relevant documents from a curated knowledge base and then generates accurate, context-aware responses using LLMs. The evaluation protocol involved grading responses across five dimensions: accuracy, groundedness, elicitation, completeness, and further assistance. The results highlighted that while RAG significantly reduced hallucinations, challenges remained in preventing them completely. The evaluation rubric effectively differentiated between the performances of various models, despite the subjective nature of the grading process. This work emphasizes the integration of RAG and LLMs in academic reference services to provide responsible, reliable, and user-centered AI responses.
Divisions: | Concordia University > Library |
---|---|
Item Type: | Conference or Workshop Item (Poster) |
Refereed: | No |
Authors: | Stouhi, Yara and Berrizbeitia, Francisco and Fitzgibbons, Megan and Charbonneau, Olivier and Chalifour, Joshua and Majerczyk, Aviva |
Date: | 14 November 2024 |
Funders: |
|
ID Code: | 994800 |
Deposited By: | Yara Stouhi |
Deposited On: | 18 Nov 2024 15:03 |
Last Modified: | 18 Nov 2024 15:03 |
Repository Staff Only: item control page