A Confederacy of Models: A Comprehensive Evaluation of LLMs on Creative Writing

Conference poster

Open access

Paul Williams and Carlos Gómez-Rodríguez

UniSC Research Conference, 2024 (Sippy Downs, Australia, 23-Sep-2024–27-Sep-2024)

University of the Sunshine Coast

2024

Files and links (1)

pdf

Paul Williams Poster435.31 kBDownload View

Poster Open Access

Creative and professional writing

Artificial intelligence

Expanding knowledge in creative arts and writing studies

creative writing research methodologies

Artificial Intelligence

creative writing assessment

We evaluate a range of recent LLMs on English creative writing, a

challenging and complex task that requires imagination, coherence, and

style. We use a difficult, open-ended scenario chosen to avoid training

data reuse: an epic narration of a single combat between Ignatius J.

Reilly, the protagonist of the Pulitzer Prize-winning novel A Confederacy of

Dunces (1980), and a pterodactyl, a prehistoric flying reptile. We ask several

LLMs and humans to write such a story and conduct a human evalution

involving various criteria such as fluency, coherence, originality, humor, and

style. Our results show that some state-of-the-art commercial LLMs match

or slightly outperform our writers in most dimensions; whereas open-source

LLMs lag behind. Humans retain an edge in creativity, while humor shows a

binary divide between LLMs that can handle it comparably to humans and

those that fail at it. We discuss the implications and limitations of our study

and suggest directions for future research.

Title: A Confederacy of Models: A Comprehensive Evaluation of LLMs on Creative Writing
Authors: Paul Williams (Author) - University of the Sunshine Coast, Queensland, School of Business and Creative Industries
Carlos Gómez-Rodríguez (Author) - Universidade da Coruña
Conference details: UniSC Research Conference, 2024 (Sippy Downs, Australia, 23-Sep-2024–27-Sep-2024)
Publisher: University of the Sunshine Coast
Date published: 2024
Organisation Unit: School of Business and Creative Industries; Healthy Ageing Research Cluster; Indigenous and Transcultural Research Centre; Sustainability Research Cluster
Language: English
Record Identifier: 991066295302621
Output Type: Conference poster

22 File views/ downloads

19 Record Views