March 2019

Avoid Biased Data Analysis with Clean and Structured Data

Posted on March 10, 2019 by

I want to share with you an unfortunate truth: All data is biased. Here at Webhose, we’ve written about this at length in our posts that explained how surveys are biased and the danger of fake reviews.  News headlines throughout 2018 were full of examples of disinformation, fake news and the questioning of its impact

Continue reading

Posted in API | Comments Off on Avoid Biased Data Analysis with Clean and Structured Data

Article’s publication date extractor – an overview

Posted on December 13, 2015 by

A few days ago I’ve released an open source Python module that provides you with a simple way to extract and normalize the publication date of any online blog or news post. There are some commercial solutions out there, but why not just use this module for free?   The logic behind the code Here

Continue reading

Posted in API | Comments Off on Article’s publication date extractor – an overview

Webhose.io Tip: Search for top performing (viral) posts

Posted on April 30, 2015 by

Our crawlers download millions of posts a day from millions of sources. Sometimes you may want to only sift through news or blog posts that had some kind of social impact. To provide you with this capability, we are introducing a new score we call the “Performance Score”.  

Continue reading

Posted in API | Comments Off on Webhose.io Tip: Search for top performing (viral) posts