Memorizing and utilizing speakers' personas is a common practice for response generation in long-term conversations. Yet, human-authored datasets often provide uninformative persona sentences that hinder response quality. This paper presents a novel framework that leverages commonsense-based persona expansion to address such issues in long-term conversation.
While prior work focuses on not producing personas that contradict others, we focus on transforming contradictory personas into sentences that contain rich speaker information, by refining them based on their contextual backgrounds with designed strategies. As the pioneer of persona expansion in multi-session settings, our framework facilitates better response generation via human-like persona refinement.
Click here to see our paper.
While baselines yield short personas and inconsistent or unconstructive responses (green underlines), CAFFEINE offers informative personas (color red) and leads to a response that provides constructive suggestion for Person A’s Spanish learning.
While baselines yield short personas, CAFFEINE offers informative personas (color red) and leads to a response that reflects Person B’s situation and what B is looking for in a car.
Example of persona resolution and disambiguation done by CAFFEINE. Color magenta highlights the contextual cues from the relevant dialogue contexts.
Example of persona resolution and disambiguation done by CAFFEINE. Color magenta highlights the contextual cues from the relevant dialogue contexts.
Examples of "preservation" addressing the sub-optimal performance of NLI models that solely rely on the persona sentences without contextual backgrounds. Color magenta highlights the contextual cues.
Examples of "preservation" addressing the sub-optimal performance of NLI models that solely rely on the persona sentences without contextual backgrounds. Color magenta highlights the contextual cues.