Saha, Soumit Kanti (2026) Specification-Driven Code Translation Powered by Large Language Models: How Far Are We? Masters thesis, Concordia University.
Preview |
Text (application/pdf)
633kBSaha_MA_S2026.pdf - Accepted Version Available under License Spectrum Terms of Access. |
Abstract
Large Language Models (LLMs) are increasingly being applied across various domains, including code-related tasks such as code translation. Previous studies have explored using LLMs for translating code between different programming languages. Since LLMs are more effective with natural language, using natural language as an intermediate representation in code translation tasks presents a promising approach. In this work, we investigate using NL-specification as an intermediate representation for code translation. We evaluate our method using three datasets, five popular programming languages, and 29 language pair permutations. Our results show that using NL-specification alone does not lead to performance improvements. However, when combined with source code, it provides a slight improvement over the baseline in certain language pairs. Besides analyzing the performance of code translation, we also investigate the quality of the translated code and provide insights into the issues present in the translated code.
| Divisions: | Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering |
|---|---|
| Item Type: | Thesis (Masters) |
| Authors: | Saha, Soumit Kanti |
| Institution: | Concordia University |
| Degree Name: | M. Sc. |
| Program: | Computer Science |
| Date: | 19 January 2026 |
| Thesis Supervisor(s): | Yang, Jinqiu |
| ID Code: | 996715 |
| Deposited By: | Soumit Kanti Saha |
| Deposited On: | 29 Jun 2026 14:59 |
| Last Modified: | 29 Jun 2026 14:59 |
Repository Staff Only: item control page


Download Statistics
Download Statistics