Crawls Failing with Cert Errors for IM Resources can be Overriden with Search Config Change

(Doc ID 1041133.1)

Last updated on NOVEMBER 16, 2017

Applies to:

Oracle Knowledge - Version 8.1.2.1 and later
Information in this document applies to any platform.
Information in this document applies to any platform.

Symptoms

If the crawls fail with the SSL certificate error:

processAccessBaseURLReplacement: called for https://mycompany.com/resources/sites/<repo>/content/meta/FAQS/
[3716618 Downloader-0(179845)] Event(Code=DOCUMENT_ACCESS_FAILED, id=#228B49I179845P179844) occurred at 11/14/17 10:14 AM: Unable to access to document im:<repo>/FAQS/live/88ae92a8e2b442d9b7b5d12ead4ab9fb in collection FAQS. Cause: javax.net.ssl.SSLHandshakeException: sun.security.validator.ValidatorException: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target
sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target
at sun.security.provider.certpath.SunCertPathBuilder.build(SunCertPathBuilder.java:145)
at sun.security.provider.certpath.SunCertPathBuilder.engineBuild(SunCertPathBuilder.java:131)
at java.security.cert.CertPathBuilder.build(CertPathBuilder.java:280)
at sun.security.validator.PKIXValidator.doBuild(PKIXValidator.java:382)
at sun.security.validator.PKIXValidator.engineValidate(PKIXValidator.java:292)
at sun.security.validator.Validator.validate(Validator.java:260)
at sun.security.ssl.X509TrustManagerImpl.validate(X509TrustManagerImpl.java:324)

The reason this affects the crawls is because while the collection configurations for the IM channels point to a single instance running on http, the RESOURCE_HOST_URL in config.properties for the IM instance(s) is configured to point to a load-balancer address running on https, and absent any configured override the indexer will use this https URL when attempting to crawl the xml files.

The URLs from the config of the IM instance being crawled are explained here - How are IM attachments and the content xml created for IM articles in IM console and used by Infocenter and the Crawler? (Doc ID 1598600.1)

Changes

To update the certificates see this KM - Crawl or Runtime failing with sun.security exceptions because of invalid java cert (Doc ID 1381202.1)

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms