SwedishLSdataset

This dataset is the first Lexical Simplification Dataset developed for Swedish as a part of a Bachelor's thesis in Cognitive Science at Linköping University. It contains 150 quadruples of complex words sourced from the Swedish Kelly list, their corpus frequencies in the "BloggMix odat" corpus, replacements to the complex word sourced from SynLex and their corresponding word frequencies in the BloggMix corpus, and an example sentence from SALDO where the complex word is found. The human assessment of each quadruple is also included in the dataset (regarding quality, coverage, and complexity).

Links

For a more detailed description of the work, please follow this link: http://liu.diva-portal.org/smash/get/diva2:1767273/FULLTEXT01.pdf.

For links to other repositories related to this thesis, please see the following links:

Lexical Simplification System for Swedish: https://github.com/emilgraichen/SwedishLexicalSimplifier

Complex Word Identification Dataset: https://github.com/emilgraichen/SwedishCWI

Structure of the Dataset

Links to the resources used for this dataset:

BloggMix Odat: https://spraakbanken.gu.se/resurser/bloggmix

Kelly Swedish: https://spraakbanken.gu.se/resurser/kelly

SynLex: http://folkets-lexikon.csc.kth.se/synlex.html

SALDO: https://spraakbanken.gu.se/resurser/saldoe

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
images		images
LICENSE		LICENSE
README.md		README.md
swedishLSdatasetAnnotated.csv		swedishLSdatasetAnnotated.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SwedishLSdataset

Links

Structure of the Dataset

Links to the resources used for this dataset:

About

Uh oh!

Releases

Packages

License

emilgraichen/SwedishLSdataset

Folders and files

Latest commit

History

Repository files navigation

SwedishLSdataset

Links

Structure of the Dataset

Links to the resources used for this dataset:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages