Category: Technology

Survey Results: What Matters to Web Data Collection Buyers

Posted on June 28, 2018 by

While structured web data presents exciting possibilities in many fields of endeavor – including finance, cyber-security, artificial intelligence and more – the market for data extraction platforms is still fairly young. Only a handful of companies are providing online data at scale, and unlike other technologies which are covered extensively by analysts and professional publications,

Continue reading

Posted in Technology | Leave a comment

Should you buy crawled web data or build your own solution?

Posted on October 10, 2016 by

In a technologically driven environment, the temptation to develop a proprietary web crawling solution is virtually irresistible. Our latest report examines the true cost of computing and software development resources required to deliver a data crawling and structuring solution at scale: Development & Maintenance Development could mean coding a proprietary solution from scratch, or modifying an existing crawling

Continue reading

Posted in Technology | Comments Off on Should you buy crawled web data or build your own solution?

5 Ways to Measure the Impact of Crawled Web Data on Your Business

Posted on July 27, 2016 by

The analysis you provide is only as good as the raw data you start with. Although data from the open web is often perceived as a commodity, not all crawled data is created equal.  Whether you’re relying on a proprietary crawling technology, tapping into a vendor’s firehose, or implementing a combination of both strategies –

Continue reading

Posted in Technology | Comments Off on 5 Ways to Measure the Impact of Crawled Web Data on Your Business

Calling all (almost) Kimono Labs developers to migrate to Webhose.io

Posted on February 16, 2016 by

Kimono Labs made an announcement today that it has been acquired by Palantir. Unfortunately Kimono Labs users will only have two weeks to migrate to a different service because the team will shut down the Kimono service on February 29, 2016. The good news is that if you are a Kimono Labs user that used

Continue reading

Posted in Technology | Comments Off on Calling all (almost) Kimono Labs developers to migrate to Webhose.io

How we quadrupled the performance of Elasticsearch

Posted on July 19, 2015 by

Well, that’s a misleading title. We actually quadrupled the performance of our brand monitoring alert system that uses Elasticsearch’s Percolator, but that would have been a much longer title. Some background Buzzilla has two main products. The first is Webhose.io which provides businesses worldwide access to structured data from the open web, and the second

Continue reading

Posted in Technology | Leave a comment

Crawling Horrors – Computer Vision Crawlers

Posted on November 26, 2014 by

So if RSS Crawlers are bad, Browser Scraping isn’t efficient, what about computer vision web-page analyzers? This technology uses machine learning and computer vision to extract information from web pages by interpreting pages visually as a human being might.  

Continue reading

Posted in Technology | Comments Off on Crawling Horrors – Computer Vision Crawlers