We’re pleased to announce Isoxya 1.4—Web Crawler & Data Processing System. It’s been 6 months since our last major update to the core crawling engine, and in this time Isoxya has moved to our custom-designed highly-available infrastructure, as well as receiving a plethora of performance improvements and bug fixes. Expanding our web We continue to […]

Since releasing open-source Isoxya plugin: Elasticsearch 1.1 in September 2019, we’ve improved our design to better support multiple organisations and large amounts of crawling data. We’re pleased to announce version 1.2—changing our indices structure to improve our usage of Elasticsearch, upgrading to the latest Elasticsearch 7.6, and updating various libraries.

It’s been 5 months since we released open-source Isoxya plugin: Crawler HTML 1.0, and we’re pleased to announce version 1.1—merging link-checking in from Isoxya: plugin Link Checker and now extracting headers by default. We’ve crawled millions of pages with the previous version as part of our private beta programme, and this has given us insights […]

By now you’ve probably heard of Isoxya – the web-crawling system with data-processing and data-streaming plugins — including an open source spellchecker. With recently released full support for streaming data directly into the Elasticsearch database, Isoxya’s spellchecker plugin places the full power of that and related tools like Kibana at your disposal! Why spelling matters […]

We’re pleased to announce the release of Isoxya 1.3—the high-performance web-crawling system with data-processing and data-streaming plugins. With the introduction of external URL validation, public plugin configurations, and full support for streaming data directly into Elasticsearch, we feel this release is a solid and creative foundation for analysing data and powering other products. Release Notes […]

We’re pleased to announce open-source Isoxya plugin: Elasticsearch 1.1—a plugin which streams data from the Isoxya crawler to the Elasticsearch database. This release adds various metadata from Isoxya data-processing plugins, extends the metadata through Isoxya data-streaming plugins, as well as using Elasticsearch Bulk API to optimise data-streaming from pages generating large numbers of documents (e.g. […]

We’re really rather excited to announce the very first release of open-source Isoxya plugin: Elasticsearch 1.0—a plugin which streams data from the Isoxya crawler to the Elasticsearch database. Now any type of data from Isoxya—whether link-checking, spellchecking, or something else entirely—can be streamed directly to Elasticsearch in seconds, placing the full power of that and […]

We’re pleased to announce the very first release of open-source Isoxya plugin: Crawler HTML 1.0—providing static HTML crawling to SEO and other internet data-processing activities. This discovers the site graph as quickly as possible, leaving data-extraction or more costly operations to other plugins working in parallel. Using this in combination with the proprietary Isoxya engine, […]

We’re pleased to announce the very first release of open-source Isoxya plugin: Spellchecker 1.0—providing spellchecking to SEO and other internet-related data-processing activities. Using this in combination with the proprietary Isoxya engine, it’s possible to spellcheck entire websites, even if they have millions of pages. Docker images are available, and similar to Isoxya plugin: Link Checker, […]

We’re pleased to announce open-source Isoxya plugin: Link Checker 1.0—one of many possible plugins for the flexible Isoxya internet data processor and web crawler. This helps with link checking, useful in SEO for validating large lists of URLs. The very first release, this marks a new phase within the history of Isoxya, as we’ve decided […]