An Empirical Comparison of Code Generation Approaches for Ansible

Benjamin Darnell; Hetarth Chopra; Aaron Councilman; David Grove; Vikram Adve

Publication

ICSE 2024

Workshop paper

An Empirical Comparison of Code Generation Approaches for Ansible

ICSE 2024

Abstract

The rapid proliferation of LLM-based programming assistants has enabled fast and accurate automatic code generation for general purpose programming languages. Domain-specific languages like Ansible, a DSL for IT Automation, have seen a lack of support despite being critical to many fields, due to limited public-domain code for training models and a lack of interest from tool developers. To address this issue, we collect a novel dataset of permissively licensed Ansible code, and use it to create Warp, an LLM for code fine-tuned to produce Ansible tasks from a natural language prompt. We evaluate state-of-the-art tools for LLM-based code generation models, comparing multiple common strategies, including fine-tuning base models on Ansible code and retrieval-augmented-generation using documentation, in order to understand challenges with existing methodology and identify future research directions to enable better code generation for DSLs.

Date

14 Apr 2024

Publication

ICSE 2024

Authors

IBM-affiliated at time of publication

Topics

AI for Code

Abstract

Date

Publication

Authors

Topics

Share