Skip to main content
Info meny
Aktuellt
FAQ
About us
Contact us
Sök
Plattformar
Data
Analyses
Research
Staff
Menu
Breadcrumb
Home
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1397)
Collections (32)
Corpora (1236)
Lexicons (84)
Training and evaluation data (27)
Models (50)
Title
Free search
Language
- Any -
Swedish
Albanian
Arabic
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Typ
Språk
Åtkomst
Older Swedish novels
A collection of more than 50 older Swedish novels by 14 different authors
Corpus
Swedish
Dataset:
romg.xml.bz2
2017-03-17 – 60.15 MB – CC-BY-4.0
Word statistics:
stats_ROMG.txt.zip
2025-04-22 – 1.75 MB – CC-BY-4.0
Explore in:
OpenEDGeS
The public license subset of the EDGeS Diachronic Bible Corpus, a diachronically and synchronically parallel corpus of Bible translations in Dutch,English, German and Swedish, with texts from the 14th century until today.
Corpus
Swedish, English, German, Dutch
Dataset:
OpenEDGeS_v1.01.zip
2024-01-25 – 121.17 MB – CC-BY-NC-SA-4.0
Dataset:
OpenEDGeS_v1.0.0.zip
2024-01-25 – 72.89 MB – For license details of the previous versions, see the 'Read me.txt' file in the download.
Oral Copus for Reference of Contemporary Spanish
Corpus with transcriptions from recorded audio tapes from 1991 to 1992. Part of SOL - Spanish Online
Corpus
Spanish
Dataset:
cor92.xml.bz2
2017-11-10 – 2.33 MB – CC-BY-4.0
Explore in:
ORDAT
Yearbook of Svenska Dagbladet 1923–1958
Corpus
Swedish
Dataset:
ordat.xml.bz2
2017-05-16 – 28.07 MB – CC-BY-4.0
Word statistics:
stats_ORDAT.txt.zip
2025-04-22 – 1.43 MB – CC-BY-4.0
Explore in:
PAROLE
A corpus annotated with morphological and syntactic information
Corpus
Swedish
Dataset:
parole.xml.bz2
2017-05-17 – 425.19 MB – CC-BY-4.0
Dataset:
parole.zip
2024-01-25 – 67.62 MB – CC-BY-4.0
Word statistics:
stats_PAROLE.txt.zip
2025-04-22 – 8.11 MB – CC-BY-4.0
Explore in:
Parole
The Swedish PAROLE Lexicon - A language technology resource with access to syntactic information
Lexicon
Swedish
Dataset:
PAROLE_usyn_descr.txt
2012-03-27 – 913.17 KB – CC-BY-4.0
Parole+
The Swedish PAROLE Lexicon - A language technology resource with access to syntactic information, partially linked to SALDO senses
Lexicon
Swedish
Dataset:
parolelexplus.xml
2017-09-19 – 13.93 MB – CC-BY-4.0
Explore in:
PGV-PII
A small collection of 10 pairs of parallel texts in Swedish and English annotated with personal information categories.
Corpus
Swedish, English
Dataset:
gv-pii.bz2
2026-02-27 – 49.75 KB – CC-BY-4.0
Podiet
Articles from the consert magazine Podiet
Corpus
Swedish
Dataset:
podiet.xml.bz2
2026-01-16 – 20.05 MB – CC-BY-4.0
Word statistics:
stats_podiet.csv.zip
2025-04-22 – 1.4 MB – CC-BY-4.0
Explore in:
Poeter.se
Poetry from Poeter.se
Corpus
Swedish
Dataset:
poeter.xml.bz2
2026-01-16 – 3.81 GB – CC-BY-4.0
Word statistics:
stats_poeter.csv.zip
2026-01-17 – 19.71 MB – CC-BY-4.0
Explore in:
POS-tagging model: Flair
Pretrained models for POS-tagging.
Model
Swedish
Dataset:
flair_eval.zip
2020-06-18 – 1.37 GB – CC-BY-4.0
Dataset:
flair_full.zip
2020-06-18 – 1.37 GB – CC-BY-4.0
POS-tagging model: Marmot
Pretrained models for POS-tagging.
Model
Swedish
Dataset:
marmot_eval.marmot
2020-06-29 – 108.59 MB – CC-BY-4.0
Dataset:
marmot_full.marmot
2020-06-29 – 113.41 MB – CC-BY-4.0
Dataset:
saldo_marmot.txt
2020-06-29 – 46.33 MB – CC-BY-4.0
POS-tagging model: Stanza
Pretrained models for POS-tagging.
Model
Swedish
Dataset:
morph_stanza_eval.zip
2020-12-09 – 19.94 MB – CC-BY-4.0
Dataset:
morph_stanza_full2.zip
2020-12-09 – 20.19 MB – CC-BY-4.0
Dataset:
stanza_pretrain.zip
2025-02-20 – 91.7 MB – CC-BY-4.0
Preparatory work 1734
Material från lagkommissionen till 1734 års lag
Corpus
Swedish
Dataset:
forarbeten1734.xml.bz2
2014-12-08 – 9.11 MB – CC-BY-4.0
Word statistics:
stats_FORARBETEN1734.txt.zip
2025-04-22 – 750.13 KB – CC-BY-4.0
Explore in:
Collection
Press
Swedish press
Corpus
Swedish
See 6 collected resources
Explore in:
Press 65
Swedish press 1965
Corpus
Swedish
Dataset:
press65.xml.bz2
2017-03-14 – 20.88 MB – CC-BY-4.0
Word statistics:
stats_PRESS65.txt.zip
2025-04-22 – 1.34 MB – CC-BY-4.0
Explore in:
Press 76
Swedish press 1976
Corpus
Swedish
Dataset:
press76.xml.bz2
2017-03-17 – 24.45 MB – CC-BY-4.0
Word statistics:
stats_PRESS76.txt.zip
2025-04-22 – 1.51 MB – CC-BY-4.0
Explore in:
Press 95
Swedish press 1995
Corpus
Swedish
Dataset:
press95.xml.bz2
2017-03-15 – 139.65 MB – CC-BY-4.0
Word statistics:
stats_PRESS95.txt.zip
2025-04-22 – 4.03 MB – CC-BY-4.0
Explore in:
Press 96
Swedish press 1996
Corpus
Swedish
Dataset:
press96.xml.bz2
2017-03-15 – 117.54 MB – CC-BY-4.0
Word statistics:
stats_PRESS96.txt.zip
2025-04-22 – 3.82 MB – CC-BY-4.0
Explore in:
Press 97
Swedish press 1997
Corpus
Swedish
Dataset:
press97.xml.bz2
2017-03-17 – 241.09 MB – CC-BY-4.0
Word statistics:
stats_PRESS97.txt.zip
2025-04-22 – 6.08 MB – CC-BY-4.0
Explore in:
Press 98
Swedish press 1998
Corpus
Swedish
Dataset:
press98.xml.bz2
2017-03-17 – 187.08 MB – CC-BY-4.0
Word statistics:
stats_PRESS98.txt.zip
2025-04-22 – 5.17 MB – CC-BY-4.0
Explore in:
Pretrained embeddings
A list of pretrained embeddings for Swedish
Model
Swedish
Psalm book (1937)
The Swedish psalm book from 1937
Corpus
Swedish
Dataset:
psalmboken.xml.bz2
2017-05-18 – 1.72 MB – CC-BY-4.0
Word statistics:
stats_PSALMBOKEN.txt.zip
2025-04-22 – 124.94 KB – CC-BY-4.0
Explore in:
Questions and answers about the Swedish language
Counselling mails of the Language Council of Sweden
Corpus
Swedish
Explore in:
Collection
Riksdag of the Estates
Collection of textual documents from the Swedish Riksdag of the Estates
Corpus
Swedish
See 7 collected resources
Explore in:
Riksdag of the Estates: Adelsståndet
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-adelsstandet.xml.bz2
2024-06-17 – 852.82 MB – CC-BY-4.0
Word statistics:
stats_standsriksdagen-adelsstandet.csv.zip
2025-04-22 – 11.88 MB – CC-BY-4.0
Explore in:
Riksdag of the Estates: Bihang m.m.
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-bihang.xml.bz2
2024-06-18 – 841.09 MB – CC-BY-4.0
Word statistics:
stats_standsriksdagen-bihang.csv.zip
2025-04-22 – 14.05 MB – CC-BY-4.0
Explore in:
Riksdag of the Estates: Bondeståndet
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-bondestandet.xml.bz2
2024-06-18 – 411.81 MB – CC-BY-4.0
Word statistics:
stats_standsriksdagen-bondestandet.csv.zip
2025-04-22 – 8.65 MB – CC-BY-4.0
Explore in:
Riksdag of the Estates: Borgarståndet
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-borgarstandet.xml.bz2
2024-06-18 – 477.72 MB – CC-BY-4.0
Word statistics:
stats_standsriksdagen-borgarstandet.csv.zip
2025-04-22 – 7.98 MB – CC-BY-4.0
Explore in:
Riksdag of the Estates: Prästeståndet
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-prastestandet.xml.bz2
2024-06-19 – 422.82 MB – CC-BY-4.0
Word statistics:
stats_standsriksdagen-prastestandet.csv.zip
2025-04-22 – 7.28 MB – CC-BY-4.0
Explore in:
Riksdag of the Estates: Riksdagsakter
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-riksdagsakter.xml.bz2
2024-06-17 – 44.69 MB – CC-BY-4.0
Word statistics:
stats_standsriksdagen-riksdagsakter.csv.zip
2025-04-22 – 2.08 MB – CC-BY-4.0
Explore in:
Riksdag of the Estates: Riksdagsbeslut
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-riksdagsbeslut.xml.bz2
2024-06-18 – 3.44 MB – CC-BY-4.0
Word statistics:
stats_standsriksdagen-riksdagsbeslut.csv.zip
2025-04-22 – 569.92 KB – CC-BY-4.0
Explore in:
Collection
Riksdagens öppna data
Data from the Swedish parliament collected from data.riksdagen.se
Corpus
Swedish
See 21 collected resources
Explore in:
Riksdagens öppna data: Betänkande
Utskottens betänkanden och utlåtanden, inklusive rksdagens beslut, en sammanfattning av voteringsresultaten och Beslut i korthet
Corpus
Swedish
Dataset:
rd-bet.xml.bz2
2022-10-11 – 3.84 GB – CC-BY-4.0
Word statistics:
stats_rd-bet.csv.zip
2025-04-22 – 62.92 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Departementsserien
Utredningar från regeringens departement
Corpus
Swedish
Dataset:
rd-ds.xml.bz2
2022-09-06 – 928.31 MB – CC-BY-4.0
Word statistics:
stats_rd-ds.csv.zip
2025-04-22 – 10.94 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: EUN
Dokument från EU-nämnden, bland annat möteskallelser, föredragningslistor, protokoll och skriftliga samråd med regeringen
Corpus
Swedish
Dataset:
rd-eun.xml.bz2
2023-02-03 – 8.72 MB – CC-BY-4.0
Word statistics:
stats_rd-eun.csv.zip
2025-04-22 – 213.5 KB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Faktapromemoria
Regeringens faktapromemorior om EU-kommissionens förslag
Corpus
Swedish
Dataset:
rd-fpm.xml.bz2
2024-01-08 – 68.67 MB – CC-BY-4.0
Word statistics:
stats_rd-fpm.csv.zip
2025-04-22 – 1.27 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Framställning/redogörelse
Framställningar och redogörelser från organ som utsetts av riksdagen
Corpus
Swedish
Dataset:
rd-frsrdg.xml.bz2
2022-09-06 – 350.59 MB – CC-BY-4.0
Word statistics:
stats_rd-frsrdg.csv.zip
2025-04-22 – 4.78 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Föredragningslista
Föredragningslistor för kammarens sammanträden
Corpus
Swedish
Dataset:
rd-flista.xml.bz2
2023-02-03 – 11.77 MB – CC-BY-4.0
Word statistics:
stats_rd-flista.csv.zip
2025-04-22 – 368.55 KB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Interpellation
Interpellationer från ledamöterna till regeringen
Corpus
Swedish
Dataset:
rd-ip.xml.bz2
2022-09-06 – 521.92 MB – CC-BY-4.0
Word statistics:
stats_rd-ip.csv.zip
2025-04-22 – 5.27 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Kammaraktiviteter
Corpus
Swedish
Dataset:
rd-kammakt.xml.bz2
2023-02-06 – 129.07 MB – CC-BY-4.0
Word statistics:
stats_rd-kammakt.csv.zip
2025-04-22 – 1.74 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: KOM
EU-kommissionens förslag och redogörelser, så kallade KOM-dokument
Corpus
Swedish
Dataset:
rd-kom.xml.bz2
2024-01-08 – 621 MB – CC-BY-4.0
Word statistics:
stats_rd-kom.csv.zip
2025-04-22 – 6.85 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Motion
Motioner från riksdagens ledamöter
Corpus
Swedish
Dataset:
rd-mot.xml.bz2
2022-10-11 – 3.4 GB – CC-BY-4.0
Word statistics:
stats_rd-mot.csv.zip
2025-04-22 – 36.81 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Proposition
Propositioner och skrivelser från regeringen
Corpus
Swedish
Dataset:
rd-prop.xml.bz2
2022-10-12 – 6.98 GB – CC-BY-4.0
Word statistics:
stats_rd-prop.csv.zip
2025-04-22 – 94.58 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Protokoll
Protokoll från kammarens sammanträden
Corpus
Swedish
Dataset:
rd-prot.xml.bz2
2024-01-11 – 4.69 GB – CC-BY-4.0
Word statistics:
stats_rd-prot.csv.zip
2025-04-22 – 52.12 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Riksdagsskrivelse
Skrivelser från riksdagen till regeringen
Corpus
Swedish
Dataset:
rd-rskr.xml.bz2
2022-09-09 – 2.55 MB – CC-BY-4.0
Word statistics:
stats_rd-rskr.csv.zip
2025-04-22 – 127.7 KB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Sammanträden
Corpus
Swedish
Dataset:
rd-samtr.xml.bz2
2022-09-09 – 1.61 MB – CC-BY-4.0
Word statistics:
stats_rd-samtr.csv.zip
2025-04-22 – 140.89 KB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Skriftliga frågor
Skriftliga frågor från ledamöterna till regeringen och svaren på dessa
Corpus
Swedish
Dataset:
rd-skfr.xml.bz2
2022-09-09 – 320.97 MB – CC-BY-4.0
Word statistics:
stats_rd-skfr.csv.zip
2025-04-22 – 4.11 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Statens offentliga utredningar
Olika utredningars förslag till regeringen
Corpus
Swedish
Dataset:
rd-sou.xml.bz2
2024-01-11 – 5.09 GB – CC-BY-4.0
Word statistics:
stats_rd-sou.csv.zip
2025-04-22 – 28.16 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Talarlista
Talarlistor för kammarens sammanträden
Corpus
Swedish
Dataset:
rd-tlista.xml.bz2
2022-09-09 – 3.63 MB – CC-BY-4.0
Word statistics:
stats_rd-tlista.csv.zip
2025-04-22 – 177.68 KB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Utredningar
Kommittédirektiv och kommittéberättelser för utredningar som regeringen tillsätter
Corpus
Swedish
Dataset:
rd-utr.xml.bz2
2022-09-09 – 25.19 MB – CC-BY-4.0
Word statistics:
stats_rd-utr.csv.zip
2025-04-22 – 1.04 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Utskottsdokument
Dokument från utskotten, bland annat KU-anmälningar, protokoll, verksamhetsberättelser och den gamla dokumentserien Utredningar från riksdagen
Corpus
Swedish
Dataset:
rd-utsk.xml.bz2
2022-09-09 – 80.14 MB – CC-BY-4.0
Word statistics:
stats_rd-utsk.csv.zip
2025-04-22 – 1.37 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Yttrande
Utskottens yttranden
Corpus
Swedish
Dataset:
rd-yttr.xml.bz2
2024-01-10 – 190.75 MB – CC-BY-4.0
Word statistics:
stats_rd-yttr.csv.zip
2025-04-22 – 3.48 MB – CC-BY-4.0
Explore in:
Riksdagens öppna data: Övrigt
Dokumentserierna Riksrevisionens granskningsrapporter, Utredningar från Riksdagsförvaltningen och Rapporter från riksdagen samt planeringsdokument, bilagor till dokument och uttag ur riksdagens databaser och de gamla dokumentserierna Utredningar från riksdag
Corpus
Swedish
Dataset:
rd-ovr.xml.bz2
2022-09-08 – 417.6 MB – CC-BY-4.0
Word statistics:
stats_rd-ovr.csv.zip
2025-04-22 – 6.66 MB – CC-BY-4.0
Explore in:
Russian Constructicon
A Russian Constructicon
Lexicon
Russian, English
Dataset:
konstruktikon-rus.xml
2021-11-09 – 2.72 MB – CC-BY-4.0
Rösträtt för kvinnor
Annual volumes 1912–1918 of the journal Rösträtt för kvinnor
Corpus
Swedish
Dataset:
runeberg-rost.xml.bz2
2014-12-08 – 23.62 MB – CC-BY-4.0
Word statistics:
stats_RUNEBERG-ROST.txt.zip
2025-04-22 – 1.49 MB – CC-BY-4.0
Explore in:
SALDO
SALDO is an extensive lexicon resource for modern Swedish written language.
Lexicon
Swedish
Dataset:
saldo.xml
2017-09-19 – 70.98 MB – CC-BY-4.0
Explore in:
SALDO: examples
Example sentences for senses in SALDO
Lexicon
Swedish
Dataset:
saldoe.xml
2017-09-19 – 1.09 MB – CC-BY-4.0
Explore in:
SALDO's morphology
Semantic and morphological lexicon for language technology
Lexicon
Swedish
Dataset:
saldom.xml
2017-09-19 – 242.34 MB – CC-BY-4.0
Explore in:
SALT – Swedish-Dutch
Dutch-Swedish parallel corpus of 20th century fictional and nonfictional texts.
Corpus
Swedish, Dutch
Dataset:
saltnld-sv.xml.bz2
2016-05-03 – 18.23 MB – CC-BY-4.0
Dataset:
saltnld-nl.xml.bz2
2016-05-03 – 9.81 MB – CC-BY-4.0
Word statistics:
stats_SALTNLD-SV.txt.zip
2025-04-22 – 1.07 MB – CC-BY-4.0
Word statistics:
stats_SALTNLD-NL.txt.zip
2025-04-22 – 476.52 KB – CC-BY-4.0
Explore in:
SAOB1950
Scanned books from 1950 to 2007 that are used as source material for updating SAOB, with a selection that reflects the Swedish vocabulary during the 20th century.
Corpus
Swedish
Dataset:
saob-bocker.xml.bz2
2023-11-30 – 1006.14 MB – CC-BY-4.0
Word statistics:
stats_saob-bocker.csv.zip
2025-04-22 – 38.24 MB – CC-BY-4.0
Explore in:
SAOL 1 (1874) - faksimil
Lexicon
Swedish
Dataset:
saol1-faksimil_facsimiles.zip
2025-12-11 – 102.72 MB – CC-BY-4.0
Dataset:
saol1-faksimil.jsonl
2025-12-11 – 8.99 MB – CC-BY-4.0
Explore in:
SAOL 10 (1973) - faksimil
Lexicon
Swedish
Dataset:
saol10-faksimil_facsimiles.zip
2025-12-11 – 138.75 MB – CC-BY-4.0
Dataset:
saol10-faksimil.jsonl
2025-12-11 – 37.15 MB – CC-BY-4.0
Explore in:
SAOL 11 (1986) - faksimil
Lexicon
Swedish
Dataset:
saol11-faksimil_facsimiles.zip
2025-12-11 – 109.4 MB – CC-BY-4.0
Dataset:
saol11-faksimil.jsonl
2025-12-11 – 33.17 MB – CC-BY-4.0
Explore in:
SAOL 12 (1998) - faksimil
Lexicon
Swedish
Dataset:
saol12-faksimil_facsimiles.zip
2025-12-11 – 33.66 MB – CC-BY-4.0
Dataset:
saol12-faksimil.jsonl
2025-12-11 – 32.62 MB – CC-BY-4.0
Explore in:
SAOL 13 (2006) - faksimil
Lexicon
Swedish
Dataset:
saol13-faksimil_facsimiles.zip
2025-12-11 – 317.49 MB – CC-BY-4.0
Dataset:
saol13-faksimil.jsonl
2025-12-11 – 33.63 MB – CC-BY-4.0
Explore in:
SAOL 14 (2015) - faksimil
Lexicon
Swedish
Dataset:
saol14-faksimil_facsimiles.zip
2025-12-11 – 55.4 MB – CC-BY-4.0
Dataset:
saol14-faksimil.jsonl
2025-12-11 – 39.68 MB – CC-BY-4.0
Explore in:
SAOL 6 (1889) - faksimil
Lexicon
Swedish
Dataset:
saol6-faksimil_facsimiles.zip
2025-12-11 – 75.66 MB – CC-BY-4.0
Dataset:
saol6-faksimil.jsonl
2025-12-11 – 11.02 MB – CC-BY-4.0
Explore in:
SAOL 7 (1900) - faksimil
Lexicon
Swedish
Dataset:
saol7-faksimil_facsimiles.zip
2025-12-11 – 71.46 MB – CC-BY-4.0
Dataset:
saol7-faksimil.jsonl
2025-12-11 – 19.12 MB – CC-BY-4.0
Explore in:
SAOL 8 (1923) - faksimil
Lexicon
Swedish
Dataset:
saol8-faksimil_facsimiles.zip
2025-12-11 – 81.03 MB – CC-BY-4.0
Dataset:
saol8-faksimil.jsonl
2025-12-11 – 20.74 MB – CC-BY-4.0
Explore in:
SAOL 9 (1950) - faksimil
Lexicon
Swedish
Dataset:
saol9-faksimil_facsimiles.zip
2025-12-11 – 135.53 MB – CC-BY-4.0
Dataset:
saol9-faksimil.jsonl
2025-12-11 – 39.78 MB – CC-BY-4.0
Explore in:
SAOLhist Plus (v. 1.0)
SAOLhist Plus is a lexical resource containing Swedish lemmatized diachronical vocabulary data for the period 1874–2015.
Lexicon
Swedish
Dataset:
shplus-v01_0.tbz2
2025-06-03 – 252.11 MB – CC-BY-4.0
sbx/KB-bert-base-swedish-cased_PI-detection-basic
En modell baserad på KB/bert-base-swedish-cased tränad med syfte att upptäcka personliga uppgifter, särskilt i studentuppsatser.
Model
Swedish
Dataset:
KB-bert-base-swedish-cased_PI-detection-basic
113.22 KB – GPL-3.0
sbx/KB-bert-base-swedish-cased_PI-detection-basic-iob
En modell baserad på KB/bert-base-swedish-cased tränad med syfte att upptäcka personliga uppgifter, särskilt i studentuppsatser.
Model
Swedish
Dataset:
KB-bert-base-swedish-cased_PI-detection-basic-iob
113.4 KB – GPL-3.0
sbx/KB-bert-base-swedish-cased_PI-detection-detailed
En modell baserad på KB/bert-base-swedish-cased tränad med syfte att upptäcka personliga uppgifter, särskilt i studentuppsatser.
Model
Swedish
Dataset:
KB-bert-base-swedish-cased_PI-detection-detailed
113.68 KB – GPL-3.0
sbx/KB-bert-base-swedish-cased_PI-detection-detailed-iob
En modell baserad på KB/bert-base-swedish-cased tränad med syfte att upptäcka personliga uppgifter, särskilt i studentuppsatser.
Model
Swedish
Dataset:
KB-bert-base-swedish-cased_PI-detection-detailed-iob
113.92 KB – GPL-3.0
sbx/KB-bert-base-swedish-cased_PI-detection-general
En modell baserad på KB/bert-base-swedish-cased tränad med syfte att upptäcka personliga uppgifter, särskilt i studentuppsatser.
Model
Swedish
Dataset:
KB-bert-base-swedish-cased_PI-detection-general
113.4 KB – GPL-3.0
sbx/KB-bert-base-swedish-cased_PI-detection-general-iob
En modell baserad på KB/bert-base-swedish-cased tränad med syfte att upptäcka personliga uppgifter, särskilt i studentuppsatser.
Model
Swedish
Dataset:
KB-bert-base-swedish-cased_PI-detection-general-iob
113.73 KB – GPL-3.0
ScandiSent
Sentiment Corpus for Swedish, Norwegian, Danish, Finnish and English crawled from trustpilot.
Corpus
Swedish, Norwegian Bokmål, Danish, English, Finnish
Dataset:
ScandiSent.zip
2024-01-25 – 5.16 MB – CC-BY-4.0
Dataset:
ScandiSent-mt.zip
2024-01-25 – 3.62 MB – CC-BY-4.0
Schlyter
Dictionary of Old Swedish
Lexicon
Swedish
Dataset:
schlyter.xml
2017-09-19 – 6.87 MB – CC-BY-4.0
Explore in:
Segregation texts: Gothenburg city: Budgets
Corpus
Swedish
Dataset:
segreg-gbg-budgetar.xml.bz2
2025-11-05 – 36.29 MB – CC-BY-4.0
Word statistics:
stats_segreg-gbg-budgetar.csv.zip
2025-11-05 – 930.46 KB – CC-BY-4.0
Explore in:
Segregation texts: Gothenburg city: Committees
Corpus
Swedish
Dataset:
segreg-gbg-namnder.xml.bz2
2025-11-05 – 7.54 MB – CC-BY-4.0
Word statistics:
stats_segreg-gbg-namnder.csv.zip
2025-11-05 – 371.55 KB – CC-BY-4.0
Explore in:
Segregation texts: Gothenburg city: Interpellations
Corpus
Swedish
Dataset:
segreg-gbg-interpellationer.xml.bz2
2025-11-05 – 1.99 MB – CC-BY-4.0
Word statistics:
stats_segreg-gbg-interpellationer.csv.zip
2025-11-05 – 151.72 KB – CC-BY-4.0
Explore in:
Segregation texts: Gothenburg city: Motions
Corpus
Swedish
Dataset:
segreg-gbg-motioner.xml.bz2
2025-11-05 – 3.36 MB – CC-BY-4.0
Word statistics:
stats_segreg-gbg-motioner.csv.zip
2025-11-05 – 216.6 KB – CC-BY-4.0
Explore in:
Segregation texts: Gothenburg city: Offices/Administrations
Corpus
Swedish
Dataset:
segreg-gbg-kontor.xml.bz2
2025-11-05 – 15.56 MB – CC-BY-4.0
Word statistics:
stats_segreg-gbg-kontor.csv.zip
2025-11-05 – 556.41 KB – CC-BY-4.0
Explore in:
Segregation texts: Gothenburg city: Opinions
Corpus
Swedish
Dataset:
segreg-gbg-yttranden.xml.bz2
2025-11-05 – 4.35 MB – CC-BY-4.0
Word statistics:
stats_segreg-gbg-yttranden.csv.zip
2025-11-05 – 247.56 KB – CC-BY-4.0
Explore in:
Segregation texts: Gothenburg city: Petitions
Corpus
Swedish
Dataset:
segreg-gbg-yrkanden.xml.bz2
2025-11-05 – 4.55 MB – CC-BY-4.0
Word statistics:
stats_segreg-gbg-yrkanden.csv.zip
2025-11-05 – 290.74 KB – CC-BY-4.0
Explore in:
Segregation texts: Gothenburg city: Reports
Corpus
Swedish
Dataset:
segreg-gbg-rapporter.xml.bz2
2025-11-05 – 13.75 MB – CC-BY-4.0
Word statistics:
stats_segreg-gbg-rapporter.csv.zip
2025-11-05 – 565.02 KB – CC-BY-4.0
Explore in:
Segregation texts: Media: Municipal newsletter
Corpus
Swedish
Dataset:
segreg-media-vartgoteborg.xml.bz2
2025-11-06 – 1.2 MB – CC-BY-4.0
Word statistics:
stats_segreg-media-vartgoteborg.csv.zip
2025-11-06 – 120.69 KB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Activities in the Chamber
Corpus
Swedish
Dataset:
segreg-rd-kammakt.xml.bz2
2025-03-07 – 39.94 MB – CC-BY-4.0
Word statistics:
stats_segreg-rd-kammakt.csv.zip
2025-03-06 – 833.54 KB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Committee on EU Affairs
Documents from the Committee on EU Affairs
Corpus
Swedish
Dataset:
segreg-rd-eun.xml.bz2
2025-03-07 – 58.53 KB – CC-BY-4.0
Word statistics:
stats_segreg-rd-eun.csv.zip
2025-03-06 – 12.52 KB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Committee reports and statements
Utskottens betänkanden och utlåtanden, inklusive riksdagens beslut, en sammanfattning av voteringsresultaten och Beslut i korthet
Corpus
Swedish
Dataset:
segreg-rd-bet.xml.bz2
2025-03-07 – 504.76 MB – CC-BY-4.0
Word statistics:
stats_segreg-rd-bet.csv.zip
2025-03-06 – 3.7 MB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Documents from Committees
Dokument från utskotten, bland annat KU-anmälningar, protokoll, verksamhetsberättelser och den gamla dokumentserien Utredningar från riksdagen
Corpus
Swedish
Dataset:
segreg-rd-utsk.xml.bz2
2025-03-07 – 1.11 MB – CC-BY-4.0
Word statistics:
stats_segreg-rd-utsk.csv.zip
2025-03-06 – 117.28 KB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: EU initiatives
EU initiatives are documents from the European Commission, “COM documents”.
Corpus
Swedish
Dataset:
segreg-rd-kom.xml.bz2
2025-03-07 – 13.62 MB – CC-BY-4.0
Word statistics:
stats_segreg-rd-kom.csv.zip
2025-03-06 – 579.99 KB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Explanatory memorandums on EU proposals
Regeringens faktapromemorior om EU-kommissionens förslag
Corpus
Swedish
Dataset:
segreg-rd-fpm.xml.bz2
2025-03-07 – 388.78 KB – CC-BY-4.0
Word statistics:
stats_segreg-rd-fpm.csv.zip
2025-03-06 – 54.98 KB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Government bills
Propositioner och skrivelser från regeringen
Corpus
Swedish
Dataset:
segreg-rd-prop.xml.bz2
2025-03-07 – 641.52 MB – CC-BY-4.0
Word statistics:
stats_segreg-rd-prop.csv.zip
2025-03-06 – 6.69 MB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Interpellations
Interpellations from members of the Riksdag to the government
Corpus
Swedish
Dataset:
segreg-rd-ip.xml.bz2
2025-03-07 – 18.87 MB – CC-BY-4.0
Word statistics:
stats_segreg-rd-ip.csv.zip
2025-03-06 – 526.53 KB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Ministry Publications Series
Utredningar från regeringens departement
Corpus
Swedish
Dataset:
segreg-rd-ds.xml.bz2
2025-03-07 – 114.52 MB – CC-BY-4.0
Word statistics:
stats_segreg-rd-ds.csv.zip
2025-03-06 – 2.71 MB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Motions
Motions from the members of the Riksdag
Corpus
Swedish
Dataset:
segreg-rd-mot.xml.bz2
2025-03-07 – 343.2 MB – CC-BY-4.0
Word statistics:
stats_segreg-rd-mot.csv.zip
2025-03-06 – 3.17 MB – CC-BY-4.0
Explore in:
Segregation texts: Riksdag's open data: Order papers
Föredragningslistor för kammarens sammanträden
Corpus
Swedish
Dataset:
segreg-rd-flista.xml.bz2
2025-03-07 – 71.12 KB – CC-BY-4.0
Word statistics:
stats_segreg-rd-flista.csv.zip
2025-03-06 – 15.54 KB – CC-BY-4.0
Explore in:
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Page
13
Page
14
Next page
Next ›
Last page
Last »
Plattformar
Hur vi arbetar
Data
Analyses
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Research meetings
Staff
Aktuellt
Calendar
Conferences and workshops
Autumn Workshop
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
FAQ
About us
Organisation
Språkbanken 50 years
Celebration
A brief history
How to cite
Cookies
Internal
Contact us
Help desk
Sök