Has all the content been crawled? How do I check? (Doc ID 1039444.1)

Last updated on SEPTEMBER 05, 2017

Applies to:

Oracle Knowledge - Version 8.0.x and later
Information in this document applies to any platform.
Information in this document applies to any platform.

Symptoms

What things should be looked at to check a crawl to see if the content is being crawled correctly?  

Changes

When there are job failures there are some times that you can restart from error when a job fails.
 - a connection error, or timeout error.
 - where the job was stopped from the system manager, these jobs you can restart.

If a job has a lot of warnings during content update and then fails in indexing, you would not want to restart this job from failure.  You would want to do a content reset on that collection and re crawl it completely.
If a collection has a lot of warnings in content update, content processing, indexing and then the job succeeds you could end up with virtually no documents for that collection.
if a job fails in collection maintenance then that can mean that some data (a collection) is completely missing.
If any task is failing because of a data corruption or a dictionary corruption, then you do not want to proceed with that job, that could be a job that needs to be opened to support in an SR.

Proceeding with a job that has data corruption can result in a runtime that does not work.  This would result in an outage for your system.  In this case you would have to restore the system from back up or follow this KM to go back to the index before if there were not multiple synchs of a bad index.

How to manually revert back to a previous good index if no search results are being returned because a bad index was synched out to the runtimes (Doc ID 1363108.1)  In some cases this is not possible, and so a regular backup of a runtime directory should be taken so that it can be restored in the case of an emergency.

Here is some information about understanding the status of a collection. Understanding Collection processing information in System Manager (Doc ID 1040286.1)
 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms