Many factors can affect streaming data relevancy. When the data you consume isn’t ordered by relevancy, rather by the time it was crawled, getting the relevant posts is essential. I would like to share with you a few tips you can use to highly increase the relevancy of the data you consume via Webhose.io API
- Add more keywords: If the keyword you are using is too generic, like let’s say “Apple”, and you are searching for Apple products, you can add multiple keywords describing those products like (iPhone OR iPad OR MacBook). This will bring back posts mentioning both “Apple” and one or more of the mentioned products.
- Search in specific sections: A great way of retrieving relevant posts, without having to create exhausting muti-keyword queries, is by searching for posts posted in certain sections of a site. For example, searching for posts about restaurants is easy as:
This will limit the results to be under sections which contain the keyword “restaurants” in the section name. You can also use Boolean queries in the “thread.section_title” field, as well as to combine it with other filters.
- Search in post’s title: Although very restrictive, this method will bring back the most relevant results as the keywords you are searching for, will appear in the title of the article/thread/blog post.
- Other filters: On top of the mentioned techniques, you can also apply simple filters like source type (news/blogs/forums), language and even to limit the search to specific sites.
If you have another tip/trick, let me know in the comments and I will share it with our readers.