An Open-Source Crawler for Autonomy IDOL
HP Autonomy users, take control over your web crawling. Norconex recently released an HP Autonomy IDOL Committer module for its open-source web crawler, Norconex HTTP Collector. You can now enjoy the...
View ArticleGoogle Search Appliance (GSA): A journey into an accessible responsive web...
As a Search Expert at Norconex, I am often assigned the task of integrating web accessibility standards within a search user interface for our customers in the government sector; these customers look...
View ArticleFacets with Lucene
During the development of our latest product, Norconex Content Analytics, we decided to add facets to the search interface. They allow for exploring the indexed content easily. Solr and Elasticsearch...
View ArticleMonitor your crawler’s progress with JEF Monitor
On large environments, it’s common to have many crawlers running at once, or at scheduled intervals, in order to keep your collected content up-to-date. For example, this is a typical requirement of...
View ArticleHow to crawl Facebook
Despite all the “noise” on social media sites, we can’t deny how valuable information found on social media networks can be for some organizations. Somewhat less obvious is how to harvest that...
View ArticleCreate a website broken links checker
This tutorial will show you how to extend Norconex HTTP Collector using Java to create a link checker to ensure all URLs in your web pages are valid. The link checker will crawl your target site(s) and...
View ArticleWhat’s new in Solr 5
I am very excited about the new Solr 5. I had the opportunity to download and install the latest release, and I have to say that I am impressed with the work that has been done to make Solr easy and...
View ArticleHow to Run Solr5 as a Service on Windows
In this tutorial, I will show you how to run Solr 5.0.0 as a Microsoft Windows service. Up to version 5.0.0, it was possible to run Solr inside the Java web application container of your choice....
View ArticleData Mining with Solr 5 – How to Slice and Dice Your Data With Facet Pivot...
Introduction You already know that Solr is a great search application, but did you know that Solr 5 could be used as a platform to slice and dice your data? With Pivot Facet working hand in hand with...
View ArticleUse Solr 5 with Docker
Docker is all the rage at the moment! It was recently selected as Gartner Cool Vendor in DevOps. As you may already know, Docker is a platform to build and deploy applications as self-contained units....
View ArticleGoogle Search Appliance is Being Phased Out… Now What?
Google Search Appliance (GSA) was introduced in 2002, and since then, thousands of organizations have acquired Google “search in a box” to meet their search needs. Earlier this year, Google announced...
View ArticleGoogle Search Appliance (GSA): A journey into an accessible responsive web...
As a Search Expert at Norconex, I am often assigned the task of integrating web accessibility standards within a search user interface for our customers in the government sector; these customers look...
View ArticleFacets with Lucene
During the development of our latest product, Norconex Content Analytics, we decided to add facets to the search interface. They allow for exploring the indexed content easily. Solr and Elasticsearch...
View ArticleMonitor your crawler’s progress with JEF Monitor
On large environments, it’s common to have many crawlers running at once, or at scheduled intervals, in order to keep your collected content up-to-date. For example, this is a typical requirement of...
View ArticleHow to crawl Facebook
Despite all the “noise” on social media sites, we can’t deny how valuable information found on social media networks can be for some organizations. Somewhat less obvious is how to harvest that...
View ArticleCreate a website broken links checker
This tutorial will show you how to extend Norconex HTTP Collector using Java to create a link checker to ensure all URLs in your web pages are valid. The link checker will crawl your target site(s) and...
View ArticleWhat’s new in Solr 5
I am very excited about the new Solr 5. I had the opportunity to download and install the latest release, and I have to say that I am impressed with the work that has been done to make Solr easy and...
View ArticleHow to Run Solr5 as a Service on Windows
In this tutorial, I will show you how to run Solr 5.0.0 as a Microsoft Windows service. Up to version 5.0.0, it was possible to run Solr inside the Java web application container of your choice....
View ArticleData Mining with Solr 5 – How to Slice and Dice Your Data With Facet Pivot...
Introduction You already know that Solr is a great search application, but did you know that Solr 5 could be used as a platform to slice and dice your data? With Pivot Facet working hand in hand with...
View ArticleUse Solr 5 with Docker
Docker is all the rage at the moment! It was recently selected as Gartner Cool Vendor in DevOps. As you may already know, Docker is a platform to build and deploy applications as self-contained units....
View Article