Skip to main content
Info meny
Aktuellt
FAQ
About us
Contact us
Sök
Plattformar
Data
Analyses
Research
Staff
Menu
Breadcrumb
Home
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1397)
Collections (32)
Corpora (1236)
Lexicons (84)
Training and evaluation data (27)
Models (50)
Title
Free search
Language
- Any -
Swedish
Albanian
Arabic
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Typ
Språk
Åtkomst
8 Sidor
News articles from 8 SIDOR.
Corpus
Swedish
Dataset:
attasidor.xml.bz2
2025-12-05 – 170.95 MB – CC-BY-4.0
Word statistics:
stats_attasidor.csv.zip
2025-12-05 – 1.03 MB – CC-BY-4.0
Explore in:
Academic texts: Humanities
A corpus with academic texts
Corpus
Swedish
Dataset:
sweachum.xml.bz2
2017-05-19 – 208.67 MB – CC-BY-4.0
Word statistics:
stats_SWEACHUM.txt.zip
2025-04-22 – 5.43 MB – CC-BY-4.0
Explore in:
Academic texts: Social science
A corpus with academic texts
Corpus
Swedish
Dataset:
sweacsam.xml.bz2
2017-06-07 – 157.41 MB – CC-BY-4.0
Word statistics:
stats_SWEACSAM.txt.zip
2025-04-22 – 3.9 MB – CC-BY-4.0
Explore in:
Academic wordlist
Academic wordlist
Lexicon
Swedish
Dataset:
ao.xml
2017-09-13 – 265.72 KB – CC-BY-4.0
Explore in:
Af Soomaali 1993-94
Corpus
Somali
Dataset:
somali-1993-94.xml.bz2
2024-01-04 – 19.45 KB – CC-BY-4.0
Explore in:
Af-Soomaali 2016 Somaliland
Corpus
Somali
Dataset:
somali-as-2016.xml.bz2
2024-01-04 – 109.54 KB – CC-BY-4.0
Explore in:
Aftonbladet 1830's
Part of the collection Kubhist2
Corpus
Swedish
Dataset:
kubhist2-aftonbladet-1830.xml.bz2
2024-01-14 – 1.02 GB – CC-BY-4.0
Word statistics:
stats_kubhist2-aftonbladet-1830.csv.zip
2025-04-22 – 15.94 MB – CC-BY-4.0
Explore in:
Agriculture
Agricultural manuals: "Engelska Åker-Mannen" and "En Grundelig Kundskap Om Swenska Åkerbruket"
Corpus
Swedish
Dataset:
akerbruk.xml.bz2
2015-05-19 – 898.54 KB – CC-BY-4.0
Word statistics:
stats_AKERBRUK.txt.zip
2025-04-22 – 160.14 KB – CC-BY-4.0
Explore in:
Akademiliv
Corpus
Swedish
Dataset:
akademiliv.xml.bz2
2025-09-09 – 46.39 MB – CC-BY-4.0
Word statistics:
stats_akademiliv.csv.zip
2025-09-09 – 1.4 MB – CC-BY-4.0
Explore in:
Akademiliv (English)
Akademiliv is the Sahlgrenska Academy’s staff magazine online. This corpus contains the English versions of the articles.
Corpus
English
Dataset:
akademiliv-eng.xml.bz2
2025-09-11 – 8.59 MB – CC-BY-4.0
Word statistics:
stats_akademiliv-eng.csv.zip
2025-09-11 – 412 KB – CC-BY-4.0
Explore in:
Argumentation sentences 1.0
A translated corpus for classifying sentence stance in relation to a topic.
Corpus
Swedish
Dataset:
argumentation-sentences.zip
2023-03-30 – 827.04 KB – CC-BY-4.0
Collection
ASPAC
The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Belarusian, Bulgarian, Czech, German, Lower Sorbian, Modern Greek (1453-), English, Spanish, French, Croatian, Upper Sorbian, Latin, Macedonian, Dutch, Polish, Portuguese, Romanian, Russian, Kele (Papua New Guinea), Slovak, Slovenian, Serbian, Slavomolisano, Turkmen, Ukrainian
See 27 collected resources
Explore in:
ASPAC: Swedish
The Swedish part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish
Dataset:
aspacsv.xml.bz2
2021-07-08 – 14.28 MB – CC-BY-4.0
Word statistics:
stats_aspacsv.csv.zip
2025-04-22 – 744.87 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Belarussian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Belarusian
Dataset:
aspacsvbe-sv.xml.bz2
2016-11-03 – 2.33 MB – CC-BY-4.0
Dataset:
aspacsvbe-be.xml.bz2
2016-11-03 – 772.78 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVBE-SV.txt.zip
2025-04-22 – 208.66 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVBE-BE.txt.zip
2025-04-22 – 159.59 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Bulgarian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Bulgarian
Dataset:
aspacsvbg-sv.xml.bz2
2016-11-02 – 4.08 MB – CC-BY-4.0
Dataset:
aspacsvbg-bg.xml.bz2
2016-11-02 – 1.83 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVBG-SV.txt.zip
2025-04-22 – 333.21 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVBG-BG.txt.zip
2025-04-22 – 223.75 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Croatian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Croatian
Dataset:
aspacsvhr-sv.xml.bz2
2016-11-02 – 6.08 MB – CC-BY-4.0
Dataset:
aspacsvhr-hr.xml.bz2
2016-11-03 – 1.88 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVHR-SV.txt.zip
2025-04-22 – 487.15 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVHR-HR.txt.zip
2025-04-22 – 263.42 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Czech
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Czech
Dataset:
aspacsvcs-sv.xml.bz2
2016-11-03 – 9.03 MB – CC-BY-4.0
Dataset:
aspacsvcs-cs.xml.bz2
2016-11-03 – 2.68 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVCS-SV.txt.zip
2025-04-22 – 600.87 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVCS-CS.txt.zip
2025-04-22 – 386.15 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Dutch
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Dutch
Dataset:
aspacsvnl-sv.xml.bz2
2016-11-02 – 9.03 MB – CC-BY-4.0
Dataset:
aspacsvnl-nl.xml.bz2
2016-11-03 – 4.02 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVNL-SV.txt.zip
2025-04-22 – 601.03 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVNL-NL.txt.zip
2025-04-22 – 244.56 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-English
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, English
Dataset:
aspacsven-sv.xml.bz2
2016-11-25 – 9.1 MB – CC-BY-4.0
Dataset:
aspacsven-en.xml.bz2
2016-11-25 – 3.87 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVEN-SV.txt.zip
2025-04-22 – 600.92 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVEN-EN.txt.zip
2025-04-22 – 217.15 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-French
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, French
Dataset:
aspacsvfr-sv.xml.bz2
2016-11-25 – 1.95 MB – CC-BY-4.0
Dataset:
aspacsvfr-fr.xml.bz2
2016-11-25 – 1008.92 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVFR-SV.txt.zip
2025-04-22 – 212.9 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVFR-FR.txt.zip
2025-04-22 – 98.46 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-German
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, German
Dataset:
aspacsvde-sv.xml.bz2
2016-10-31 – 9.07 MB – CC-BY-4.0
Dataset:
aspacsvde-de.xml.bz2
2016-10-31 – 4.64 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVDE-SV.txt.zip
2025-04-22 – 600.6 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVDE-DE.txt.zip
2025-04-22 – 417.33 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Greek
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Modern Greek (1453-), Swedish
Dataset:
aspacsvel-sv.xml.bz2
2016-11-02 – 1.94 MB – CC-BY-4.0
Dataset:
aspacsvel-el.xml.bz2
2016-11-03 – 570.94 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVEL-SV.txt.zip
2025-04-22 – 212.82 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVEL-EL.txt.zip
2025-04-22 – 103.08 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Italian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Italian
Dataset:
aspacsvit-sv.xml.bz2
2016-11-25 – 519.56 KB – CC-BY-4.0
Dataset:
aspacsvit-it.xml.bz2
2016-11-25 – 249.64 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVIT-SV.txt.zip
2025-04-22 – 71.95 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVIT-IT.txt.zip
2025-04-22 – 38.03 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Latin
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Latin
Dataset:
aspacsvla-sv.xml.bz2
2016-11-03 – 792.29 KB – CC-BY-4.0
Dataset:
aspacsvla-la.xml.bz2
2016-11-03 – 372.16 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVLA-SV.txt.zip
2025-04-22 – 91.89 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVLA-LA.txt.zip
2025-04-22 – 70.25 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Lower Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Lower Sorbian
Dataset:
aspacsvdsb-sv.xml.bz2
2016-11-03 – 195.53 KB – CC-BY-4.0
Dataset:
aspacsvdsb-dsb.xml.bz2
2016-11-03 – 72.76 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVDSB-SV.txt.zip
2025-04-22 – 37.25 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVDSB-DSB.txt.zip
2025-04-22 – 19.67 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Macedonian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Macedonian
Dataset:
aspacsvmk-sv.xml.bz2
2016-11-02 – 3.76 MB – CC-BY-4.0
Dataset:
aspacsvmk-mk.xml.bz2
2016-11-03 – 1.06 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVMK-SV.txt.zip
2025-04-22 – 320.57 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVMK-MK.txt.zip
2025-04-22 – 145.92 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Molise Slavik
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Slavomolisano, Swedish
Dataset:
aspacsvsvm-sv.xml.bz2
2016-11-03 – 194.99 KB – CC-BY-4.0
Dataset:
aspacsvsvm-svm.xml.bz2
2016-11-03 – 63.89 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVSVM-SV.txt.zip
2025-04-22 – 37.34 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVSVM-SVM.txt.zip
2025-04-22 – 13.57 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Polish
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Polish
Dataset:
aspacsvpl-sv.xml.bz2
2016-11-02 – 9.04 MB – CC-BY-4.0
Dataset:
aspacsvpl-pl.xml.bz2
2016-11-02 – 4.44 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVPL-SV.txt.zip
2025-04-22 – 601.08 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVPL-PL.txt.zip
2025-04-22 – 622.81 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Portuguese
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Portuguese
Dataset:
aspacsvpt-sv.xml.bz2
2016-11-25 – 1.55 MB – CC-BY-4.0
Dataset:
aspacsvpt-pt.xml.bz2
2016-11-03 – 770.36 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVPT-SV.txt.zip
2025-04-22 – 162.99 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVPT-PT.txt.zip
2025-04-22 – 78.35 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Romanian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Romanian
Dataset:
aspacsvro-sv.xml.bz2
2016-11-03 – 517.08 KB – CC-BY-4.0
Dataset:
aspacsvro-ro.xml.bz2
2016-11-02 – 276.74 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVRO-SV.txt.zip
2025-04-22 – 72.07 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVRO-RO.txt.zip
2025-04-22 – 48.05 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Russian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Russian
Dataset:
aspacsvru-sv.xml.bz2
2016-11-28 – 9.08 MB – CC-BY-4.0
Dataset:
aspacsvru-ru.xml.bz2
2016-11-28 – 4.41 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVRU-SV.txt.zip
2025-04-22 – 600.94 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVRU-RU.txt.zip
2025-04-22 – 606.22 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Serbian (cyrillic)
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Serbian, Swedish
Dataset:
aspacsvsbc-sv.xml.bz2
2016-11-03 – 3.47 MB – CC-BY-4.0
Dataset:
aspacsvsbc-sbc.xml.bz2
2016-11-03 – 1006.26 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVSBC-SV.txt.zip
2025-04-22 – 261.57 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVSBC-SBC.txt.zip
2025-04-22 – 158.03 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Serbian (latin)
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Serbian
Dataset:
aspacsvsr-sv.xml.bz2
2016-11-03 – 3.11 MB – CC-BY-4.0
Dataset:
aspacsvsr-sr.xml.bz2
2016-11-03 – 956.03 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVSR-SV.txt.zip
2025-04-22 – 290.19 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVSR-SR.txt.zip
2025-04-22 – 157.76 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Slovak
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Slovak
Dataset:
aspacsvsk-sv.xml.bz2
2016-11-02 – 3.41 MB – CC-BY-4.0
Dataset:
aspacsvsk-sk.xml.bz2
2016-11-03 – 1.56 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVSK-SV.txt.zip
2025-04-22 – 303.42 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVSK-SK.txt.zip
2025-04-22 – 224.29 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Slovene
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Slovenian
Dataset:
aspacsvsl-sv.xml.bz2
2016-11-03 – 3.44 MB – CC-BY-4.0
Dataset:
aspacsvsl-sl.xml.bz2
2016-11-02 – 1.69 MB – CC-BY-4.0
Word statistics:
stats_ASPACSVSL-SV.txt.zip
2025-04-22 – 303.42 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVSL-SL.txt.zip
2025-04-22 – 221.03 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Spanish
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Spanish
Dataset:
aspacsves-sv.xml.bz2
2016-11-03 – 325.61 KB – CC-BY-4.0
Dataset:
aspacsves-es.xml.bz2
2016-11-03 – 145.87 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVES-SV.txt.zip
2025-04-22 – 48.3 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVES-ES.txt.zip
2025-04-22 – 22.14 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Turkmen
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Turkmen
Dataset:
aspacsvtk-sv.xml.bz2
2016-11-02 – 196.79 KB – CC-BY-4.0
Dataset:
aspacsvtk-tk.xml.bz2
2016-11-03 – 61.13 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVTK-SV.txt.zip
2025-04-22 – 37.29 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVTK-TK.txt.zip
2025-04-22 – 22.49 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Ukrainian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Ukrainian
Dataset:
aspacsvuk-sv.xml.bz2
2016-11-02 – 2.67 MB – CC-BY-4.0
Dataset:
aspacsvuk-uk.xml.bz2
2016-11-03 – 869.41 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVUK-SV.txt.zip
2025-04-22 – 227.62 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVUK-UK.txt.zip
2025-04-22 – 187.91 KB – CC-BY-4.0
Explore in:
ASPAC: Swedish-Upper Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Upper Sorbian
Dataset:
aspacsvhsb-sv.xml.bz2
2016-11-03 – 476.11 KB – CC-BY-4.0
Dataset:
aspacsvhsb-hsb.xml.bz2
2016-11-03 – 162.79 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVHSB-SV.txt.zip
2025-04-22 – 63.76 KB – CC-BY-4.0
Word statistics:
stats_ASPACSVHSB-HSB.txt.zip
2025-04-22 – 35.7 KB – CC-BY-4.0
Explore in:
ASU
Structural development of the second language
Corpus
Swedish
Word statistics:
stats_asu.csv.zip
2025-04-22 – 411.64 KB – CC-BY-4.0
Explore in:
August Strindberg's letters
Part of the collected works of August Strindberg
Corpus
Swedish
Dataset:
strindbergbrev.xml.bz2
2017-04-26 – 20.39 MB – CC-BY-4.0
Word statistics:
stats_STRINDBERGBREV.txt.zip
2025-04-22 – 1.12 MB – CC-BY-4.0
Explore in:
August Strindberg's novels
Part of the collected works of August Strindberg
Corpus
Swedish
Dataset:
strindbergromaner.xml.bz2
2017-06-20 – 63.43 MB – CC-BY-4.0
Word statistics:
stats_STRINDBERGROMANER.txt.zip
2025-04-22 – 2.26 MB – CC-BY-4.0
Explore in:
Aventinus
Drug related terminology
Lexicon
Swedish
Dataset:
aventinus.zip
2023-06-12 – 339.93 KB – CC-BY-4.0
Bellman
Collected works of C.M. Bellman
Corpus
Swedish
Dataset:
bellman.xml.bz2
2015-11-09 – 4.83 MB – CC-BY-4.0
Word statistics:
stats_BELLMAN.txt.zip
2025-04-22 – 489.31 KB – CC-BY-4.0
Explore in:
Betänkande ang. läroböcker (1882)
A report from 1882, digitized by the Gothenburg University Library
Corpus
Swedish
Dataset:
betankande.xml.bz2
2015-12-11 – 403.44 KB – CC-BY-4.0
Word statistics:
stats_BETANKANDE.txt.zip
2025-04-22 – 71.87 KB – CC-BY-4.0
Explore in:
Biblioteksbladet
The earliest volumes of "Biblioteksbladet: Organ för Sveriges allmänna biblioteksförening" from 1916–1940, digitized by Project Runeberg
Corpus
Swedish
Dataset:
runeberg-biblblad.xml.bz2
2015-05-19 – 52.49 MB – CC-BY-4.0
Word statistics:
stats_RUNEBERG-BIBLBLAD.txt.zip
2025-04-22 – 2.79 MB – CC-BY-4.0
Explore in:
Collection
Bicameral Riksdag
Collection of textual documents from the Swedish bicameral parliament data
Corpus
Swedish
See 10 collected resources
Explore in:
Bicameral riksdag: Government official investigations
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-utredningar-kombet-sou.xml.bz2
2023-12-12 – 986.58 MB – CC-BY-4.0
Word statistics:
stats_tkr-utredningar-kombet-sou.csv.zip
2025-04-22 – 24.94 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: Letters of the Riksdag
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-rskr.xml.bz2
2023-12-11 – 476.4 MB – CC-BY-4.0
Word statistics:
stats_tkr-rskr.csv.zip
2025-04-22 – 12.45 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: Motions
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-motioner.xml.bz2
2023-12-11 – 1.42 GB – CC-BY-4.0
Word statistics:
stats_tkr-motioner.csv.zip
2025-04-22 – 34.01 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: Narratives and accounts
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-berattelser-redogorelser-frsrdg.xml.bz2
2023-12-12 – 1 GB – CC-BY-4.0
Word statistics:
stats_tkr-berattelser-redogorelser-frsrdg.csv.zip
2025-04-22 – 29.67 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: Propositions and letters
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-propositioner-skrivelser.xml.bz2
2023-12-12 – 5.94 GB – CC-BY-4.0
Word statistics:
stats_tkr-propositioner-skrivelser.csv.zip
2025-04-22 – 77.09 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: Protocols
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-protokoll.xml.bz2
2023-12-12 – 6.08 GB – CC-BY-4.0
Word statistics:
stats_tkr-protokoll.csv.zip
2025-04-22 – 67.83 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: Register
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-register.xml.bz2
2023-12-11 – 285.18 MB – CC-BY-4.0
Word statistics:
stats_tkr-register.csv.zip
2025-04-22 – 10 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: Regulations
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-reglementen-sfs.xml.bz2
2023-12-11 – 43.12 MB – CC-BY-4.0
Word statistics:
stats_tkr-reglementen-sfs.csv.zip
2025-04-22 – 2.08 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: Reports, memorandums and opinions
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-bet-mem-utl.xml.bz2
2023-12-11 – 3.59 GB – CC-BY-4.0
Word statistics:
stats_tkr-bet-mem-utl.csv.zip
2025-04-22 – 55.09 MB – CC-BY-4.0
Explore in:
Bicameral riksdag: The constitution of the Riksdag
Part of the data set "Bicameral Riksdag"
Corpus
Swedish
Dataset:
tkr-riksdagens-forfattningssamling-rfs.xml.bz2
2023-12-11 – 1.56 MB – CC-BY-4.0
Word statistics:
stats_tkr-riksdagens-forfattningssamling-rfs.csv.zip
2025-04-22 – 133.15 KB – CC-BY-4.0
Explore in:
Blingbring
Blingbring, an enhanced and modernized version of Bring's thesaurus (1930)
Lexicon
Swedish
Dataset:
blingbring.txt
2017-09-20 – 7.52 MB – CC-BY-4.0
Dataset:
blingbring.xml
2021-11-09 – 42.68 MB – CC-BY-4.0
Explore in:
Bliss
Blissymbolics is a constructed symbol language which is mainly used by people with severe communicative and phsyical disabilities. It consists of around 5000 graphical symbols.
Lexicon
Blissymbols
Dataset:
bliss.xml
2017-09-13 – 2.73 MB – CC-BY-4.0
Collection
Blog mix
Material from a selection of Swedish blogs. Regularly updated.
Corpus
Swedish
See 21 collected resources
Explore in:
Blog mix 1998
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix1998.xml.bz2
2017-02-14 – 453.05 KB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX1998.txt.zip
2025-04-22 – 83.09 KB – CC-BY-4.0
Explore in:
Blog mix 1999
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix1999.xml.bz2
2017-02-14 – 9.27 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX1999.txt.zip
2025-04-22 – 569.59 KB – CC-BY-4.0
Explore in:
Blog mix 2000
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2000.xml.bz2
2017-02-22 – 2.69 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2000.txt.zip
2025-04-22 – 263.39 KB – CC-BY-4.0
Explore in:
Blog mix 2001
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2001.xml.bz2
2017-02-14 – 4.7 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2001.txt.zip
2025-04-22 – 424.02 KB – CC-BY-4.0
Explore in:
Blog mix 2002
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2002.xml.bz2
2017-02-14 – 3.4 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2002.txt.zip
2025-04-22 – 319.44 KB – CC-BY-4.0
Explore in:
Blog mix 2003
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2003.xml.bz2
2017-02-14 – 3.76 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2003.txt.zip
2025-04-22 – 377.79 KB – CC-BY-4.0
Explore in:
Blog mix 2004
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2004.xml.bz2
2017-02-14 – 9.03 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2004.txt.zip
2025-04-22 – 593.74 KB – CC-BY-4.0
Explore in:
Blog mix 2005
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2005.xml.bz2
2017-02-14 – 70.01 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2005.txt.zip
2025-04-22 – 2.3 MB – CC-BY-4.0
Explore in:
Blog mix 2006
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2006.xml.bz2
2017-02-15 – 123.62 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2006.txt.zip
2025-04-22 – 3.5 MB – CC-BY-4.0
Explore in:
Blog mix 2007
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2007.xml.bz2
2017-02-15 – 288.92 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2007.txt.zip
2025-04-22 – 5.78 MB – CC-BY-4.0
Explore in:
Blog mix 2008
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2008.xml.bz2
2017-02-16 – 656.67 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2008.txt.zip
2025-04-22 – 9.46 MB – CC-BY-4.0
Explore in:
Blog mix 2009
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2009.xml.bz2
2017-02-17 – 1.1 GB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2009.txt.zip
2025-04-22 – 12.95 MB – CC-BY-4.0
Explore in:
Blog mix 2010
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2010.xml.bz2
2017-02-23 – 1.44 GB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2010.txt.zip
2025-04-22 – 15.57 MB – CC-BY-4.0
Explore in:
Blog mix 2011
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2011.xml.bz2
2017-02-24 – 1.48 GB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2011.txt.zip
2025-04-22 – 15.45 MB – CC-BY-4.0
Explore in:
Blog mix 2012
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2012.xml.bz2
2017-02-23 – 1.17 GB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2012.txt.zip
2025-04-22 – 12.92 MB – CC-BY-4.0
Explore in:
Blog mix 2013
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2013.xml.bz2
2017-02-24 – 930.12 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2013.txt.zip
2025-04-22 – 10.71 MB – CC-BY-4.0
Explore in:
Blog mix 2014
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2014.xml.bz2
2017-02-23 – 596.24 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2014.txt.zip
2025-04-22 – 8 MB – CC-BY-4.0
Explore in:
Blog mix unknown date
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmixodat.xml.bz2
2017-02-23 – 511.42 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIXODAT.txt.zip
2025-04-22 – 7.77 MB – CC-BY-4.0
Explore in:
Bloggmix 2015
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2015.xml.bz2
2017-05-10 – 434.91 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2015.txt.zip
2025-04-22 – 6.49 MB – CC-BY-4.0
Explore in:
Bloggmix 2016
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2016.xml.bz2
2017-02-22 – 262.98 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2016.txt.zip
2025-04-22 – 4.97 MB – CC-BY-4.0
Explore in:
Bloggmix 2017
Material from a selection of Swedish blogs. Is updated regularly.
Corpus
Swedish
Dataset:
bloggmix2017.xml.bz2
2017-02-22 – 23.48 MB – CC-BY-4.0
Word statistics:
stats_BLOGGMIX2017.txt.zip
2025-04-22 – 1.21 MB – CC-BY-4.0
Explore in:
Bonnier novels I (1976–77)
A corpus of 69 Bonnier novels from 1976–77
Corpus
Swedish
Dataset:
romi.xml.bz2
2017-10-04 – 135.42 MB – CC-BY-4.0
Word statistics:
stats_ROMI.txt.zip
2025-04-22 – 2.52 MB – CC-BY-4.0
Explore in:
Bonniers novels II (1980–81)
A corpus of 60 Bonnier novels from 1980–81
Corpus
Swedish
Dataset:
romii.xml.bz2
2017-03-17 – 62.87 MB – CC-BY-4.0
Word statistics:
stats_ROMII.txt.zip
2025-04-22 – 2.21 MB – CC-BY-4.0
Explore in:
Bring
A digital version of Bring's thesaurus (1930)
Lexicon
Swedish
Dataset:
bring.txt
2017-09-11 – 6.69 MB – CC-BY-4.0
Caafimaad 1983
Corpus
Somali
Dataset:
somali-caafimaad-1983.xml.bz2
2024-01-15 – 4.48 KB – CC-BY-4.0
Explore in:
COCTAILL
Corpus of coursebooks used for teaching L2 Swedish. Annotated manually for text structure and pedagogical/didactical categories; automatically linguistically annotated. See more here https://spraakbanken.gu.se/forskning/teman/icall/icall-l2-projects/l2-data
Corpus
Swedish
Dataset:
coctaill.xml.bz2
2017-10-30 – 16.57 MB – CC-BY-4.0
Word statistics:
stats_COCTAILL.txt.zip
2025-04-22 – 621.39 KB – CC-BY-4.0
Explore in:
COCTAILL activities & examples
Corpus of coursebooks used for teaching L2 Swedish. Annotated manually for text structure and pedagogical/didactical categories; automatically linguistically annotated.
Corpus
Swedish
Word statistics:
stats_COCTAILL-AE.txt.zip
2025-04-22 – 352 KB – CC-BY-4.0
Explore in:
COCTAILL lesson text
Corpus of coursebooks used for teaching L2 Swedish. Annotated manually for text structure and pedagogical/didactical categories; automatically linguistically annotated.
Corpus
Swedish
Word statistics:
stats_COCTAILL-LT.txt.zip
2025-04-22 – 379.61 KB – CC-BY-4.0
Explore in:
CoDeRooMor, v.01
Morphological dataset (word-building morphology), Swedish L2 profiles project
Lexicon
Swedish
Dataset:
CodeRoomor_v01_lemgramView.csv
2021-04-13 – 1.96 MB – CC-BY-4.0
Dataset:
CodeRoomor_v01_morphemeView.csv
2021-04-13 – 856.29 KB – CC-BY-4.0
Dataset:
CodeRoomor_v01_lemgramView.xlsx
2021-04-13 – 1.72 MB – CC-BY-4.0
Dataset:
CodeRoomor_v01_morphemeView.xlsx
2021-04-13 – 699.46 KB – CC-BY-4.0
Explore in:
Constructicon
Swedish Constructicon
Lexicon
Swedish
Dataset:
konstruktikon.xml
2021-11-09 – 2.03 MB – CC-BY-4.0
Explore in:
Corpus of spoken isiXhosa
A corpus of transcribed and annotated recordings of spoken Xhosa.
Corpus
Xhosa
Dataset:
xhosa.xml.bz2
2026-03-09 – 295.86 KB – CC-BY-4.0
Explore in:
Corpus word statistics
Accumulated word statistics from many of our modern Swedish corpora
Corpus
Word statistics:
stats_all.txt.zip
2025-04-22 – 763.87 MB – CC-BY-4.0
Dagens Arena
News texts from dagensarena.se
Corpus
Swedish
Dataset:
da.xml.bz2
2026-01-16 – 246.98 MB – CC-BY-4.0
Word statistics:
stats_da.csv.zip
2025-04-22 – 21.95 MB – CC-BY-4.0
Explore in:
DaLAJ-GED-SuperLim 2.0
Dataset for Linguistic Acceptability Judgments (and more), v.2.0
Corpus
Swedish
Dataset:
dalaj-ged-superlim.zip
2023-04-03 – 1.41 MB – CC-BY-4.0
Dataset:
dalaj-ged-tsv.zip
2023-05-20 – 1.15 MB – CC-BY-4.0
Dataset:
liuep197-11.pdf
2024-01-25 – 463.74 KB – CC-BY-4.0
Dalin Dictionary
Dalin's Dictionary of 19th century Swedish
Lexicon
Swedish
Dataset:
dalin.xml
2017-09-13 – 32.26 MB – CC-BY-4.0
Explore in:
Dalin Dictionary - Base Material
Dalin's Dictionary of 19th century Swedish - base material
Lexicon
Swedish
Dataset:
dalin-base.xml
2017-09-13 – 25.76 MB – CC-BY-4.0
Explore in:
Dalin: Then Swänska Argus 1732-1734
Manual transcription of Then Swänska Argus by Olof von Dalin, Stockholm, 1732–1734. For OCR analysis.
Corpus
Swedish
Dataset:
dalin-then-swaanska-argus-1732-1734.tar.gz
2020-06-12 – 80.21 MB – CC-BY-4.0
Dalin's morphology
A morphology from Dalin's Dictionary of 19th century Swedish that is derived from Dalin's base material.
Lexicon
Swedish
Dataset:
dalinm.xml
2017-09-13 – 133.24 MB – CC-BY-4.0
Explore in:
Dalpilen 1860's
Part of the collection Kubhist2
Corpus
Swedish
Dataset:
kubhist2-dalpilen-1860.xml.bz2
2024-01-09 – 273.1 MB – CC-BY-4.0
Word statistics:
stats_kubhist2-dalpilen-1860.csv.zip
2025-04-22 – 4.94 MB – CC-BY-4.0
Explore in:
Databank of 1977 Spanish Press
Text from two Spanish newspapers from 1977. Part of SOL - Spanish Online.
Corpus
Spanish
Dataset:
pe77.xml.bz2
2017-11-10 – 7.7 MB – CC-BY-4.0
Explore in:
Pagination
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Page
13
Page
14
Next page
Next ›
Last page
Last »
Plattformar
Hur vi arbetar
Data
Analyses
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Research meetings
Staff
Aktuellt
Calendar
Conferences and workshops
Autumn Workshop
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
FAQ
About us
Organisation
Språkbanken 50 years
Celebration
A brief history
How to cite
Cookies
Internal
Contact us
Help desk
Sök