Unsupervised Dialogue Spectrum Generation for Log Dialogue Ranking

Published in Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019

Online Digital Library

Download PDF

Recommended citation: Xinnuo Xu, Yizhe Zhang, Lars Liden, Sungjin Lee, "Unsupervised Dialogue Spectrum Generation for Log Dialogue Ranking," In Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, Stockholm, Sweden, 2019, pp. 143-154.

Abstract

Although the data-driven approaches of some recent bot building platforms make it possible for a wide range of users to easily create dialogue systems, those platforms don’t offer tools for quickly identifying which log dialogues contain problems. This is important since corrections to log dialogues provide a means to improve performance after deployment. A log dialogue ranker, which ranks problematic dialogues higher, is an essential tool due to the sheer volume of log dialogues that could be generated. However, training a ranker typically requires labelling a substantial amount of data, which is not feasible for most users. In this paper, we present a novel unsupervised approach for dialogue ranking using GANs and release a corpus of labelled dialogues for evaluation and comparison with supervised methods. The evaluation result shows that our method compares favorably to supervised methods without any labelled data.