LiSSA: Toward Generic Traceability Link Recovery through Retrieval-Augmented Generation

by Dominik Fuchß , Tobias Hey , Jan Keim , Haoyu Liu , Niklas Ewald , Tobias Thirolf , and Anne Koziolek

To be published at the 47th IEEE/ACM International Conference on Software Engineering (ICSE 2025), April 27 - May 03 2025.

Approach Overview

Abstract

There are a multitude of software artifacts which need to be handled during the development and maintenance of a software system. These artifacts interrelate in multiple, complex ways. Therefore, many software engineering tasks are enabled — and even empowered — by a clear understanding of artifact interrelationships and also by the continued advancement of techniques for automated artifact linking.

However, current approaches in automatic Traceability Link Recovery (TLR) target mostly the links between specific sets of artifacts, such as those between requirements and code. Fortunately, recent advancements in Large Language Models (LLMs) can enable TLR approaches to achieve broad applicability. Still, it is a nontrivial problem how to provide the LLMs with the specific information needed to perform TLR.

In this paper, we present LiSSA, a framework that harnesses LLM performance and enhances them through Retrieval-Augmented Generation (RAG). We empirically evaluate LiSSA on three different TLR tasks, requirements to code, documentation to code, and architecture documentation to architecture models, and we compare our approach to state-of-the-art approaches.

Our results show that the RAG-based approach can significantly outperform the state-of-the-art on the code-related tasks. However, further research is required to improve the performance of RAG-based approaches to be applicable in practice.

Cite this paper

  1. LiSSA: Toward Generic Traceability Link Recovery through Retrieval-Augmented Generation
    Dominik FuchßTobias HeyJan KeimHaoyu Liu, Niklas Ewald, Tobias Thirolf, and Anne Koziolek
    In Proceedings of the IEEE/ACM 47th International Conference on Software Engineering, Ottawa, Canada, 2025
    @inproceedings{fuchss_lissa_2025,
      author = {Fuchß, Dominik and Hey, Tobias and Keim, Jan and Liu, Haoyu and Ewald, Niklas and Thirolf, Tobias and Koziolek, Anne},
      year = {2025},
      title = {LiSSA: Toward Generic Traceability Link Recovery through Retrieval-Augmented Generation},
      booktitle = {Proceedings of the IEEE/ACM 47th International Conference on Software Engineering},
      publisher = {{Institute of Electrical and Electronics Engineers (IEEE)}},
      location = {Ottawa, Canada},
      series = {ICSE '25},
    }