XRLiA

What is XRLiA?

XRLiA: X-ray Line Assistant Large Language Model (LLM) Chatbot

XRLiA, XRAY Line Assistant, is an interactive chatbot designed to help users learn about the imaging of lines and tubes through guided Q&A.

Aside from the case simulation, a chatbot tutor is also integrated within this app to provide near-instant responses to students’ other questions on the topic.
The chatbot is powered by Ministral-8B, an open-source large language model developed by Mistral AI.

Chatbot

Background

Case simulations are a proven educational method where students engage with realistic clinical scenarios to apply their knowledge and decision-making skills. Traditionally, these simulations require teachers to design detailed cases and provide personalized feedback based on student responses. However, these exercises often require substantial man power and hardware, making them less accessible to many students.

Our solution focuses on making these simulations more accessible and scalable, while also transforming digital radiology materials into interactive learning tools that enhance student engagement and understanding.

Like traditional case simulations, students start with a brief background on each case.

To keep the experience realistic and focused, students answer targeted questions aligned with the case’s learning objectives. This approach encourages active learning by prompting students to recall and apply knowledge from their classes.

The chatbot then delivers personalized feedback on students’ answers and overall performance, highlighting strengths and areas for improvement.

Retrieval-Augmented Generation (RAG) is used to help improve response quality and relevance, making the interaction smarter and more engaging.

LLMs

What are Large Language Models (LLMs)?

Large language models are models handling natural language processing tasks – tasks involving comprehension, generation& manipulation of human language.

Development timeline

Early Language Models

In the beginning, language models relied heavily on counting word sequences and predicting the next word based on previous ones. While effective for narrowly defined tasks, these models faced a fundamental limitation: they could only produce meaningful output for sequences they had explicitly seen during training. This meant they lacked flexibility and struggled to generalize to new or unseen inputs, severely limiting their usefulness in real-world applications where language is highly variable.

Breakthrough with Transformer Architecture

A major breakthrough came with the development of the Transformer architecture, introduced in 2017.Transformers allowed models to process entire sequences of text simultaneously rather than sequentially, enabling better context understanding and faster training on large datasets. This innovation paved the way for the rise of large language models (LLMs) with billions of parameters.

A diagram of the Transformer Architecture; “Deep Learning Visuals”. (link) Image by dvgodoy, licensed under a CreativeCommons Attribution 4.0 International License.

Large Language Models: A New Paradigm

Today’s large language models, such as GPT (Generative Pre-trained Transformer) series, BERT, and others, represent a significant leap forward. These models are trained on massive and diverse datasets that cover a wide range of topics, languages, and styles. The sheer scale of parameters—ranging from hundreds of millions to trillions—allows LLMs to capture subtle language patterns and world knowledge embedded in the training text.

One of the most remarkable advances is their ability to perform zero-shot or few-shot learning. Unlike earlier models that required explicit examples for each task, modern LLMs can understand and execute new tasks with little to no prior task-specific training. This is similar to how a knowledgeable person can answer a question about an unfamiliar topic by drawing on their general understanding and reasoning skills.

RAG

Ensuring the accuracy of response with Retrieval-Augmented Generation (RAG)

A retrieval-augmented generation (RAG)pipeline is integrated into the chatbot, which provides the chatbot with relevant clinical information from a curated database of medical literature during response generation to enhance the relevancy and factual accuracy of its response.

Steps in the pipeline include:

Documents from reputable sources on the topic are selected, split into smaller chunks, and organized into a database.

When the user submits a query, the prompt is compared against the database to find the most relevant chunks based on similarity.

These relevant chunks are then sent along with the user’s prompt to the chatbot, providing additional context.

Instead of sending only the user’s question, the final prompt includes a section of context, the original question, and specific instructions for the chatbot to generate a response based on the provided information.

Reference:
Lewis, Patrick, et al."Retrieval-augmented generation for knowledge-intensive nlp tasks."Advances in Neural Information Processing Systems 33 (2020): 9459-9474.

Case

Benefits of Integrating Chatbots in Education

Recently, there is a new wave of AI-integrated learning tools in education. Results from a systematic review on role of AI chatbots in education showed that they benefit both teachers and students:

Teachers have the ability to create more targeted and personalized learning materials that closely align with specific educational objectives. This ensures that the content is relevant and tailored to the needs of individual students or groups, enhancing the effectiveness of the learning process.

Students benefit from self-paced, interactive learning experiences, allowing them to explore concepts at their own speed and revisit challenging topics as needed.

Students can receive instant guidance, ask questions, and get personalized feedback, making the learning process more engaging and supportive outside of traditional classroom settings.

Reference:
Labadze, Lasha, Maya Grigolia, and Lela Machaidze. "Role of AI chatbots in education: systematic literature review." International Journal of Educational Technology in HigherEducation 20.1 (2023): 56.