SeCoDa: Sense Complexity Dataset

David Strohmaier; Sian Gooding; Shiva Taslimipoor; Ekaterina Kochmar

SeCoDa: Sense Complexity Dataset

David Strohmaier, Sian Gooding, Shiva Taslimipoor & Ekaterina Kochmar

Proceedings of the 12Th Language Resources and Evaluation Conference (2020) Copy BIBT_EX

Abstract

The Sense Complexity Dataset (SeCoDa) provides a corpus that is annotated jointly for complexity and word senses. It thus provides a valuable resource for both word sense disambiguation and the task of complex word identification. The intention is that this dataset will be used to identify complexity at the level of word senses rather than word tokens. For word sense annotation SeCoDa uses a hierarchical scheme that is based on information available in the Cambridge Advanced Learner’s Dictionary. This way we can offer more coarse-grained senses than directly available in WordNet.

View on PhilPapers

Author's Profile

David Strohmaier

Cambridge University

Archival history

Archival date: 2023-02-13
View all versions

Keywords

Add keywords

Reprint years

Analytics

Added to PP
2020-06-15

Downloads
252 (#87,427)

6 months
74 (#83,229)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

SeCoDa: Sense Complexity Dataset

Abstract

Author's Profile

Archival history

Categories

Keywords

Reprint years

Analytics