Gooooolll! Who Will Win The World Cup?

Gooooolll! Who Will Win The World Cup?

It’s been a month since the World Cup began and as usual, there were quite a few surprises in these matches. Seriously – did anyone see Germany getting bumped in the first […]

How Alternative Data is Reshaping Finance

How Alternative Data is Reshaping Finance

According to a report recently featured on the Financial Times (PDF), hedge funds are expected to spend upwards of $600m on digital datasets this year, and up to $1bn by 2020. What’s […]

What is the Omgili Bot, and why is it Crawling Your Website?

What is the Omgili Bot, and why is it Crawling Your Website?

Hi there. If you’re reading this, it’s probably because you’ve run into Omgilibot – perhaps in your web analytics or server logs (user agent: omgili/0.5 +https://omgili.com) – and turned to Google to […]

3 Steps to Turn Webpages into Machine-Readable Data

3 Steps to Turn Webpages into Machine-Readable Data

The vast majority of us use the web every single day – for news, shopping, socializing and really any type of activity you can imagine. But when it comes to acquiring data […]

What is DaaS, BDaaS, DBaaS? And Why Should You Care?

What is DaaS, BDaaS, DBaaS? And Why Should You Care?

The proliferation of data services has created a wide range of confusing buzzwords and acronyms – but at its core, DaaS is still a meaningful concept. We are living in the age […]

Crawling the Dark Web to Detect the Next Market

Crawling the Dark Web to Detect the Next Market

Over the past few days, the internet has been abuzz with talk of the recent blows dealt by law enforcement to two major dark web “marketplaces”, AlphaBay and Hansa market; and the […]

Can Data Science Deliver a Fake News Detector?

Can Data Science Deliver a Fake News Detector?

Regardless of your political opinion, fake news has dominated the conversation since the 2016 US presidential election. The crux of the problem is that the very definition of what qualifies as fake […]

The Hackathon Award for Best API Mashup Goes to…

The Hackathon Award for Best API Mashup Goes to…

Competitive programming competitions, commonly referred to as Hackathons, offer a great opportunity for new talent to show what they can do. Much like professional sports, industry leaders send recruiters to scout out […]

Webz.io API Featured in New Guide to Web Development with Django

Webz.io API Featured in New Guide to Web Development with Django

Last February, co-authors Leiff Azopardi and James Maxwell completed the latest edition of their book Tango with Django. It presents an excellent step-by-step approach to learning Python on the popular Django framework […]

How to use rated reviews for sentiment classification

How to use rated reviews for sentiment classification

Sentiment classification is a fascinating use case for machine learning. Regardless of complexity – you need two core components to deliver meaningful results; a machine learning engine and a significant volume of […]

Can Crawled Web Data Tell the Future?

Can Crawled Web Data Tell the Future?

Robert Tercek’s book Vaporized: Solid Strategies for Success in a Dematerialized World recently recently won GetAbastract’s 2016 International Book of the Year award at the Frankfurt Book Fair. Based in Hollywood, Robert has […]

Should you buy crawled web data or build your own solution?

Should you buy crawled web data or build your own solution?

In a technologically driven environment, the temptation to develop a proprietary web crawling solution is virtually irresistible. Our latest report examines the true cost of computing and software development resources required to deliver a data […]