Performance in Response Generation.

table

Bold and underline show the best and second-highest in each column. * indicates those yields significant differences (p-value < 0.05) with “No Memory” (only current dialogue context).