Survey Results: What Matters to Web Data Collection Buyers

Posted on June 28, 2018 by webhose

While structured web data presents exciting possibilities in many fields of endeavor – including finance, cyber-security, artificial intelligence and more – the market for data extraction platforms is still fairly young. Only a handful of companies are providing online data at scale, and unlike other technologies which are covered extensively by analysts and professional publications,...

Continue reading

Posted in Technology

The Hackathon Award for Best API Mashup Goes to…

Posted on March 26, 2017 by ohadf

Competitive programming competitions, commonly referred to as Hackathons, offer a great opportunity for new talent to show what they can do. Much like professional sports, industry leaders send recruiters to scout out the top performers. With high stakes on the line and limited resources, getting noticed as a hackathon winner not only looks good on...

Continue reading

Posted in API

Webhose.io API Featured in New Guide to Web Development with Django

Posted on March 12, 2017 by ohadf

Last February, co-authors Leiff Azopardi and James Maxwell completed the latest edition of their book Tango with Django. It presents an excellent step-by-step approach to learning Python on the popular Django framework v1.9 (also compatible with v1.10). Although the book is designed as a beginner’s guide to web development, the material is packed with tips even...

Continue reading

Posted in API

How to use rated reviews for sentiment classification

Posted on February 9, 2017 by Omer Turner

Sentiment classification is a fascinating use case for machine learning. Regardless of complexity – you need two core components to deliver meaningful results; a machine learning engine and a significant volume of structured data to train that engine. Last month, we added the new “rating” field for rated review sites covered in the Webhose.io threaded...

Continue reading

Posted in API

Should you buy crawled web data or build your own solution?

Posted on October 10, 2016 by ohadf

In a technologically driven environment, the temptation to develop a proprietary web crawling solution is virtually irresistible. Our latest report examines the true cost of computing and software development resources required to deliver a data crawling and structuring solution at scale: Development & Maintenance Development could mean coding a proprietary solution from scratch, or modifying an existing crawling...

Continue reading

Posted in Technology

The Race to Achieve 100% Coverage of the Web

Posted on September 19, 2016 by ohadf

In our new report, we deconstruct the all-too-familiar race to achieve 100% coverage of the web. Data acquisition efforts usually rely on one of three approaches – build an internal web crawling capability, rely on data providers, or implement a combination of both. The goal is to tap into as much structured web data as...

Continue reading

Posted in Big Data

How to Keep Your Restaurant Sentiment Analysis Well-Fed

Posted on April 6, 2016 by webhose

When the team from London-based data analysis service GetSentiment developed a bleeding-edge system to measure the emotional baggage found in free text, they were missing just one thing: relevant data. “We were looking for a data provider that would be able to give access to sufficiently large amounts of frequently updated mentions of brands,” recalls...

Continue reading

Posted in News

Social Media Analytics: Insights from Structured versus Unstructured Data

Posted on December 1, 2015 by webhose

Let’s be honest … social media is a challenge. Not only is staying current, active, and “topped off” a chore, but crafting full-scale campaigns that contribute to your business’ and brand’s actual goals can be bewildering. At the same time, the market for social-media continues to grow. According to recent data from eMarketer, “Social Network...

Continue reading

Posted in Big Data