Skip to main content
Info meny
Aktuellt
FAQ
About us
Contact us
Sök
Plattformar
Data
Analyses
Research
Staff
Menu
Breadcrumb
Home
Language resources
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1397)
Collections (32)
Corpora (1236)
Lexicons (84)
Training and evaluation data (27)
Models (50)
Title
Free search
Language
- Any -
Swedish
Albanian
Arabic
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Antal tokens
Språk
Åtkomst
Collection
Web News
News from Swedish newspapers' websites
Swedish
See 13 collected resources
Explore in:
Web News 2001
News from Swedish newspapers' websites
614,151
Swedish
Dataset:
webbnyheter2001.xml.bz2
2024-01-04 – 17.13 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2001.csv.zip
2025-04-22 – 929.87 KB – CC-BY-4.0
Explore in:
Web News 2002
News from Swedish newspapers' websites
17,426,173
Swedish
Dataset:
webbnyheter2002.xml.bz2
2022-11-30 – 506.49 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2002.csv.zip
2025-04-22 – 8.19 MB – CC-BY-4.0
Explore in:
Web News 2003
News from Swedish newspapers' websites
12,217,288
Swedish
Dataset:
webbnyheter2003.xml.bz2
2022-11-30 – 357.9 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2003.csv.zip
2025-04-22 – 6.44 MB – CC-BY-4.0
Explore in:
Web News 2004
News from Swedish newspapers' websites
13,806,323
Swedish
Dataset:
webbnyheter2004.xml.bz2
2022-11-30 – 403.31 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2004.csv.zip
2025-04-22 – 6.68 MB – CC-BY-4.0
Explore in:
Web News 2005
News from Swedish newspapers' websites
29,503,647
Swedish
Dataset:
webbnyheter2005.xml.bz2
2024-01-05 – 849.81 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2005.csv.zip
2025-04-22 – 7.6 MB – CC-BY-4.0
Explore in:
Web News 2006
News from Swedish newspapers' websites
22,563,792
Swedish
Dataset:
webbnyheter2006.xml.bz2
2022-12-01 – 654.61 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2006.csv.zip
2025-04-22 – 9.77 MB – CC-BY-4.0
Explore in:
Web News 2007
News from Swedish newspapers' websites
24,630,443
Swedish
Dataset:
webbnyheter2007.xml.bz2
2022-12-01 – 715.52 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2007.csv.zip
2025-04-22 – 10.25 MB – CC-BY-4.0
Explore in:
Web News 2008
News from Swedish newspapers' websites
27,561,804
Swedish
Dataset:
webbnyheter2008.xml.bz2
2022-12-01 – 796.9 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2008.csv.zip
2025-04-22 – 11.17 MB – CC-BY-4.0
Explore in:
Web News 2009
News from Swedish newspapers' websites
25,888,779
Swedish
Dataset:
webbnyheter2009.xml.bz2
2024-01-05 – 747.74 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2009.csv.zip
2025-04-22 – 7.33 MB – CC-BY-4.0
Explore in:
Web News 2010
News from Swedish newspapers' websites
23,803,577
Swedish
Dataset:
webbnyheter2010.xml.bz2
2022-12-02 – 691.09 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2010.csv.zip
2025-04-22 – 9.17 MB – CC-BY-4.0
Explore in:
Web News 2011
News from Swedish newspapers' websites
26,268,603
Swedish
Dataset:
webbnyheter2011.xml.bz2
2022-12-02 – 764.57 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2011.csv.zip
2025-04-22 – 10.12 MB – CC-BY-4.0
Explore in:
Web News 2012
News from Swedish newspapers' websites
25,132,041
Swedish
Dataset:
webbnyheter2012.xml.bz2
2022-12-02 – 729.32 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2012.csv.zip
2025-04-22 – 9.89 MB – CC-BY-4.0
Explore in:
Web News 2013
News from Swedish newspapers' websites
22,648,638
Swedish
Dataset:
webbnyheter2013.xml.bz2
2024-01-05 – 652.09 MB – CC-BY-4.0
Word statistics:
stats_webbnyheter2013.csv.zip
2025-04-22 – 6.57 MB – CC-BY-4.0
Explore in:
Wexjöbladet 1820's
Part of the collection Kubhist2
1,338,559
Swedish
Dataset:
kubhist2-wexjobladet-1820.xml.bz2
2024-01-16 – 36.76 MB – CC-BY-4.0
Word statistics:
stats_kubhist2-wexjobladet-1820.csv.zip
2025-04-22 – 1.65 MB – CC-BY-4.0
Explore in:
WordReference
A large corpus of native and non-native written speech in four languages.
170,000,000
English, Spanish, French, Italian
Dataset:
wordreference.zip
2020-11-10 – 365.51 MB – CC-BY-4.0
Written production in learner French
This corpus contains student texts written by Swedish learners of French
8,920
French
Östgötalagen
52,062
Swedish
Word statistics:
stats_OGL.txt.zip
2025-04-22 – 21.13 KB – CC-BY-4.0
Explore in:
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Page
13
Plattformar
Hur vi arbetar
Data
Analyses
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Research meetings
Staff
Aktuellt
Calendar
Conferences and workshops
Autumn Workshop
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
FAQ
About us
Organisation
Språkbanken 50 years
Celebration
A brief history
How to cite
Cookies
Internal
Contact us
Help desk
Sök