Contrasting the Automatic Identification of Two Discourse Markers in Multiparty Dialogues

doi:https://doi.org/10.7892/boris.78683

Contrasting the Automatic Identification of Two Discourse Markers in Multiparty Dialogues

Sandrine Zufferey,Andréi Popescu-Belis-2016-04-26-Open Access CRIS of the University of Bern

4

TL;DRAbstract

The identification of occurrences of like and well that serve as discourse markers (DMs) is a classification problem which is studied here on a corpus of dialogue transcripts with more than 4,000 occurrences of each item. Decision trees using item-specific lexical, prosodic, positional and sociolinguistic features are trained using the C4.5 method. The results demonstrate improvement over past experiments, reaching the same range as inter-annotator agreement scores. DM identification appears to benefit from itemspecific classifiers, which perform better than general purpose ones, thanks to the differentiated use of lexical features.

Chat with Paper

AI Agents for this Paper

The identification of occurrences of like and well that serve as discourse markers (DMs) is a classification problem which is studied here on a corpus of dialogue transcripts with more than 4,000 occurrences of each item. Decision trees using item-specific lexical, prosodic, positional and sociolinguistic features are trained using the C4.5 method. The results demonstrate improvement over past experiments, reaching the same range as inter-annotator agreement scores. DM identification appears to benefit from itemspecific classifiers, which perform better than general purpose ones, thanks to the differentiated use of lexical features.

Keywords

Identification (biology)Computer scienceNatural language processingArtificial intelligenceRange (aeronautics)Discourse markerLinguisticsEngineering

Chat

Click to start Chat