Login | Register

Towards LLM-Driven Code Generation: The Impact of Process Models and Non-Functional Requirements on Software Development

Title:

Towards LLM-Driven Code Generation: The Impact of Process Models and Non-Functional Requirements on Software Development

Lin, Feng ORCID: https://orcid.org/0009-0009-3887-7071 (2025) Towards LLM-Driven Code Generation: The Impact of Process Models and Non-Functional Requirements on Software Development. Masters thesis, Concordia University.

[thumbnail of LIN_MA_S2025.pdf]
Preview
Text (application/pdf)
LIN_MA_S2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.
1MB

Abstract

Research on LLM-based code generation is growing but often overlooks the impact of Software Engineering (SE) knowledge, such as software process models and non-functional requirements (NFRs). This thesis explores how integrating SE practices with LLMs can help bridge this gap.
We first propose FlowGen to explore the impact of software process models on code generation. We assign LLM agents to different development roles and study three models: FlowGenWaterfall, FlowGenTDD, and FlowGenScrum. We evaluate FlowGen using GPT-3.5, comparing it against several baselines on four code benchmarks (i.e., HumanEval, HumanEval-ET, MBPP, and MBPP-ET). Our results show that FlowGenScrum outperforms the other process models, achieving a 15% improve- ment in Pass@1 over RawGPT on average. Integrating a state-of-the-art technique (i.e., CodeT) further boosts Pass@1 scores. Our findings also show that development activities impact code qual- ity differently, and FlowGen enhances result stability across LLM versions and temperature settings.
Secondly, we investigate how variations in developer behavior affect how LLMs address NFRs (e.g., expressing the same NFR using different wording). Robust LLMs should generate consistent code despite such variations. We propose RoboNFR to evaluate LLM robustness in NFR-aware code generation across four key dimensions—code design, performance, readability, and reliabil- ity—using three methodologies: prompt variation, regression testing, and diverse workflows. Our experiments show that RoboNFR effectively reveals robustness issues in tested LLMs. Overall, across the three methodologies, incorporating NFRs tends to reduce Pass@1 scores while improv- ing NFR-specific metrics, but also increases the standard deviation in both correctness and quality.
This thesis highlights the significant influence of software process models and NFRs, empha- sizing the need for future work to incorporate such SE knowledge in the era of LLMs.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:Lin, Feng
Institution:Concordia University
Degree Name:M.A. Sc.
Program:Software Engineering
Date:May 2025
Thesis Supervisor(s):Chen, Tse-Hsun (Peter)
ID Code:995585
Deposited By: Feng Lin
Deposited On:04 Nov 2025 18:25
Last Modified:04 Nov 2025 18:25
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top