Is Neural Machine Translation Approach Accurate Enough for Coding Assistance?
Yuka Akinobu,
Momoka Obara, Teruno Kajiura, Shiho Takano, Miyu Tamura, Mayu Tomioka, and
Kimio Kuramitsu
(Japan Women's University, Japan)
Coding assistance with deep learning is an emerging concern that has recently attracted much attention in the software development community. To integrate coding assistance with deep learning compactly, we focus on neural machine translation (NMT), which allows users to translate natural language descriptions into expressions in a programming language such as Python. A rising problem here is the limited availability of parallel corpora, which is essential to train better NMT models.
To overcome the problem, we propose a transcompiler-based back-translation, a data augmentation method that generates parallel corpora from numerous source code repositories. In this paper, we present our initial experimental results by comparing several NMT models that are built upon the existing corpora and our corpora. The resulting BLEU indicates that our proposed model is accurate enough to allow coding assistance in the future.
@InProceedings{BCNC21p23,
author = {Yuka Akinobu and Momoka Obara and Teruno Kajiura and Shiho Takano and Miyu Tamura and Mayu Tomioka and Kimio Kuramitsu},
title = {Is Neural Machine Translation Approach Accurate Enough for Coding Assistance?},
booktitle = {Proc.\ BCNC},
publisher = {ACM},
pages = {23--28},
doi = {10.1145/3486949.3486966},
year = {2021},
}
Publisher's Version