Quick Guide to News APIs

Posted on October 10, 2017 by Ran Geva

Monitoring mass media has come a long way since the days of the press-cutting agency. The bulk of today’s news is published online, while modern technology lets us store, index and query massive amounts of textual data in milliseconds. Digitization presents clear advantages for consumers, who can now read or watch the news from the...

Continue reading

Posted in API

Can Data Science Deliver a Fake News Detector?

Posted on April 4, 2017 by ohadf

Regardless of your political opinion, fake news has dominated the conversation since the 2016 US presidential election. The crux of the problem is that the very definition of what qualifies as fake news is in dispute. Still, most of us would like to know if the news story we’re reading reflects actual events – or...

Continue reading

Posted in Machine Learning

Top 10 Big Data Stories Leading the Conversation

Posted on September 26, 2016 by ohadf

In the right hands, crawled web data can tell an amazing story. We were interested in the top 10 news stories – sorted by social shares on Facebook and LinkedIn. So we set up a simple news API request. We were looking for the stories published over the past 30 days returned by an exact match query for the term “big data”.  Here...

Continue reading

Posted in Big Data

100% coverage of the Web

Posted on March 9, 2016 by Webhose

Well that’s the holy grail. To be able to tap into World Wide Web as a whole is something that anyone dealing with data would like to have, but is far FAR from achieving (except maybe for the NSA, we don’t know). The idea behind Webhose.io is that when you need data from the web,...

Continue reading

Posted in API

Five Reasons a News Crawler Is Essential to Your Business

Posted on January 5, 2016 by Webhose

“Originality is the art of remembering something but forgetting where you heard it.” Case in point, I don’t remember where I heard that. Nonetheless, it’s absolutely true, especially when it comes to running an online business. Why? Because in today’s online marketplace, sales, brand management, and genuine engagement are all practices that shouldn’t begin with...

Continue reading

Posted in API

30-Days of Historical Data Access for Webhose.io Now Available

Posted on September 10, 2015 by Ran Geva

I’m very happy to let you know about the launch of our extended access to 30-days of historical data from Webhose.io, which is available to our paying customers immediately. No waiting list. With the 30 days data access, Webhose.io customers don’t have to worry about missing posts in the realtime stream since they can now...

Continue reading

Posted in News

Webhose.io Tip: Search for top performing (viral) posts

Posted on April 30, 2015 by Ran Geva

Here at Webhose, our crawlers download millions of posts a day from millions of sources. When searching for web data among these many sources, you may want to limit your results to news or blog posts that had some kind of social impact. To provide you with this capability, we are introducing a new score...

Continue reading

Posted in API