Quantcast
Channel: Latest Articles – Norconex Inc
Browsing all 41 articles
Browse latest View live

An Open-Source Crawler for Autonomy IDOL

HP Autonomy users, take control over your web crawling. Norconex recently released an HP Autonomy IDOL Committer module for its open-source web crawler, Norconex HTTP Collector. You can now enjoy the...

View Article


Image may be NSFW.
Clik here to view.

Google Search Appliance (GSA): A journey into an accessible responsive web...

As a Search Expert at Norconex, I am often assigned the task of integrating web accessibility standards within a search user interface for our customers in the government sector; these customers look...

View Article


Facets with Lucene

During the development of our latest product, Norconex Content Analytics, we decided to add facets to the search interface. They allow for exploring the indexed content easily. Solr and Elasticsearch...

View Article

Image may be NSFW.
Clik here to view.

Monitor your crawler’s progress with JEF Monitor

On large environments, it’s common to have many crawlers running at once, or at scheduled intervals, in order to keep your collected content up-to-date. For example, this is a typical requirement of...

View Article

Image may be NSFW.
Clik here to view.

How to crawl Facebook

Despite all the “noise” on social media sites, we can’t deny how valuable information found on social media networks can be for some organizations. Somewhat less obvious is how to harvest that...

View Article


Image may be NSFW.
Clik here to view.

Create a website broken links checker

This tutorial will show you how to extend Norconex HTTP Collector using Java to create a link checker to ensure all URLs in your web pages are valid. The link checker will crawl your target site(s) and...

View Article

Image may be NSFW.
Clik here to view.

What’s new in Solr 5

I am very excited about the new Solr 5. I had the opportunity to download and install the latest release, and I have to say that I am impressed with the work that has been done to make Solr easy and...

View Article

Image may be NSFW.
Clik here to view.

How to Run Solr5 as a Service on Windows

In this tutorial, I will show you how to run Solr 5.0.0 as a Microsoft Windows service. Up to version 5.0.0, it was possible to run Solr inside the Java web application container of your choice....

View Article


Image may be NSFW.
Clik here to view.

Data Mining with Solr 5 – How to Slice and Dice Your Data With Facet Pivot...

Introduction You already know that Solr is a great search application, but did you know that Solr 5 could be used as a platform to slice and dice your data?  With Pivot Facet working hand in hand with...

View Article


Image may be NSFW.
Clik here to view.

Use Solr 5 with Docker

Docker is all the rage at the moment! It was recently selected as Gartner Cool Vendor in DevOps. As you may already know, Docker is a platform to build and deploy applications as self-contained units....

View Article

Image may be NSFW.
Clik here to view.

Google Search Appliance is Being Phased Out… Now What?

Google Search Appliance (GSA) was introduced in 2002, and since then, thousands of organizations have acquired Google “search in a box” to meet their search needs. Earlier this year, Google announced...

View Article

Image may be NSFW.
Clik here to view.

Google Search Appliance (GSA): A journey into an accessible responsive web...

As a Search Expert at Norconex, I am often assigned the task of integrating web accessibility standards within a search user interface for our customers in the government sector; these customers look...

View Article

Facets with Lucene

During the development of our latest product, Norconex Content Analytics, we decided to add facets to the search interface. They allow for exploring the indexed content easily. Solr and Elasticsearch...

View Article


Image may be NSFW.
Clik here to view.

Monitor your crawler’s progress with JEF Monitor

On large environments, it’s common to have many crawlers running at once, or at scheduled intervals, in order to keep your collected content up-to-date. For example, this is a typical requirement of...

View Article

Image may be NSFW.
Clik here to view.

How to crawl Facebook

Despite all the “noise” on social media sites, we can’t deny how valuable information found on social media networks can be for some organizations. Somewhat less obvious is how to harvest that...

View Article


Image may be NSFW.
Clik here to view.

Create a website broken links checker

This tutorial will show you how to extend Norconex HTTP Collector using Java to create a link checker to ensure all URLs in your web pages are valid. The link checker will crawl your target site(s) and...

View Article

Image may be NSFW.
Clik here to view.

What’s new in Solr 5

I am very excited about the new Solr 5. I had the opportunity to download and install the latest release, and I have to say that I am impressed with the work that has been done to make Solr easy and...

View Article


Image may be NSFW.
Clik here to view.

How to Run Solr5 as a Service on Windows

In this tutorial, I will show you how to run Solr 5.0.0 as a Microsoft Windows service. Up to version 5.0.0, it was possible to run Solr inside the Java web application container of your choice....

View Article

Image may be NSFW.
Clik here to view.

Data Mining with Solr 5 – How to Slice and Dice Your Data With Facet Pivot...

Introduction You already know that Solr is a great search application, but did you know that Solr 5 could be used as a platform to slice and dice your data?  With Pivot Facet working hand in hand with...

View Article

Image may be NSFW.
Clik here to view.

Use Solr 5 with Docker

Docker is all the rage at the moment! It was recently selected as Gartner Cool Vendor in DevOps. As you may already know, Docker is a platform to build and deploy applications as self-contained units....

View Article
Browsing all 41 articles
Browse latest View live