Mimicry (BERT)

High-Level Intuition

This feature measure how much the current utterance “mimics” the previous utterance in a conversation.

Citation

N/A; This is a novel measure that builds on related measures of mimicry/accommodation (see “Related Features”).

Implementation Basics

Using BERT’s Sentence Transfomers model (https://sbert.net/), utterances are represented as multidimensional embeddings. Stepping through each message in a conversation, this feature computes the cosine similarity of the current embedding and previous embedding to determine their degree of mimicry.

Implementation Notes/Caveats

Note that the first utterance in a conversation cannot have a mimicry score, as there is no “previous utterance” to associate it with. In this case, we assign a value of 0 to this utterance.

Interpreting the Feature

This feature generates a score between 0-1 for each utterance in a conversation, with scores closer to 0 representing a more original thought compared with the previous chat (lacking mimicry), while scores near 1 represent a higher degree of mimicry/similarity with the previous chat.

It’s important to note that this score doesn’t measure the overall mimicry of the conversation. As an utterance-level feature, it computes the mimicry only between the selected chat and the previous. If a particular message is only similar to chats exchanged before it’s direct previous chat, therefore, it won’t have a high mimicry score (see below). In the same vein, high mimicry score for an individual chat does not signal that a conversation overall employed high mimicry.

Output File
message	speaker	mimicry_bert
Hi, my name is Shruti!	Speaker A	0
Hey, my name is Nathaniel, but I go by Nate.	Speaker B	0.89
What’s the plan for today?	Speaker A	0.12
My name is Emily.	Speaker C	0.09