Language Public Datasets

Webhose’s news and blog articles are available to researchers in 12 different languages. Download them to conduct financial or sentiment analysis, market research, AI or machine learning and media and web monitoring for your research project, startup or large organization.

Description
Category
#Documents
File Size
Crawled Date
Found:12 Datasets

Swedish news articles

News articles in Swedish from the leading Swedish news outlets

Language
234,196
338 MB
Oct, 2016

Spanish news articles

Spanish news articles from the top 10,000 (based on the ranking provided by Alexa) news sites

Language
341,695
601 MB
Oct, 2016

Russian news articles

Russian news articles from the top 10,000 (based on the ranking provided by Alexa) news sites

Language
291,584
538 MB
Oct, 2016

Portuguese news articles

News articles in Portuguese extracted from the top Portuguese news sites

Language
103,020
160 MB
Oct, 2015

Japanese news articles

Articles in Japanese from the top Japanese news sites

Language
29,970
39.1 MB
Oct, 2015

Italian news articles

News articles in Italian from the leading Italian news sites

Language
159,226
244 MB
Oct, 2015

German news articles

News articles in the German language from the top german news sites

Language
398,840
654 MB
Oct, 2015

French news articles

French news articles from the top 10,000 (based on the ranking provided by Alexa) news sites

Language
245,308
388 MB
Oct, 2016

English news articles

English news articles originated in the US from the top 1,000 (based on the ranking provided by Alexa) news sites

Language
499,610
923 MB
Nov, 2016

Dutch news articles

News articles in Dutch extracted from the top Dutch news sites

Language
116,193
132 MB
Oct, 2015

Chinese news articles

Chinese news articles from the top 10,000 (based on the ranking provided by Alexa) news sites

Language
316,003
571 MB
Oct, 2016

Arabic news articles

Arabic news articles from the top 10,000 (based on the ranking provided by Alexa) news sites

Language
236,383
375 MB
Oct, 2016
Customize Your Own Dataset
Create a Webhose account and create a dataset on your own. Access over 100TB of historical content
from multiple sources in one place.