My Oracle Support Banner

WebCenter Portal Document Crawl Fails With: java.net.SocketTimeoutException: Read timed out (Doc ID 2811454.1)

Last updated on DECEMBER 19, 2022

Applies to:

Oracle WebCenter Portal - Version 12.2.1.4.201126 and later
Information in this document applies to any platform.

Symptoms

The document crawler fails to run most of the times.  Just a few times does it completes.

When the crawler fails, testing the document crawl source settings shows the following error:


ERROR

URL is secured. Ensure credentials are valid 

The WC_Portal-diagnistic.log shows the following error:

[<TIMESTAMP>] [WC_Portal] [NOTIFICATION] [] [oracle.webcenter.doclib.crawl.rss.RSSCrawlerManager] [tid: [ACTIVE].ExecuteThread: '42' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] Crawler configuration file downloaded and parsed successfully. Configuration URL = http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONFIG&source=default

[<TIMESTAMP>] [WC_Portal] [WARNING] [] [oracle.webcenter.doclib.crawl.http.HttpUtils] [tid: [ACTIVE].ExecuteThread: '20' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] Error while sending HTTP request to url: http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default, Error message : Read timed out

[<TIMESTAMP>] [WC_Portal] [WARNING] [] [oracle.webcenter.doclib.crawl.http.HttpUtils] [tid: [ACTIVE].ExecuteThread: '20' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] [[
java.net.SocketTimeoutException: Read timed out
     at java.net.SocketInputStream.socketRead0(Native Method)
     at java.net.SocketInputStream.socketRead(SocketInputStream.java:116)
     at java.net.SocketInputStream.read(SocketInputStream.java:171)
     at java.net.SocketInputStream.read(SocketInputStream.java:141)
...

[<TIMESTAMP>] [WC_Portal] [WARNING] [] [oracle.webcenter.doclib.crawl.http.HttpUtils] [tid: [ACTIVE].ExecuteThread: '20' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] Error occured while processing HTTP Response, URL : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default, error message : Error occured while processing HTTP Response, URL : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default, error message : HTTP connect attempt failed after 2 attempts. Aborting connection attempt to url : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default

[<TIMESTAMP>] [WC_Portal] [WARNING] [] [oracle.webcenter.doclib.crawl.http.HttpUtils] [tid: [ACTIVE].ExecuteThread: '20' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] [[
oracle.webcenter.doclib.crawl.exception.URIHandlerException: Error occured while processing HTTP Response, URL : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default, error message : HTTP connect attempt failed after 2 attempts. Aborting connection attempt to url : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default
     at oracle.webcenter.doclib.crawl.http.HttpUtils.getHTTPResponse(HttpUtils.java:308)
     at oracle.webcenter.doclib.crawl.http.HttpUtils.getHTTPInputStream(HttpUtils.java:185)
     at oracle.webcenter.doclib.crawl.rss.RSSControlFeedFetcher.getControlFeed(RSSControlFeedFetcher.java:152)
     at oracle.webcenter.doclib.crawl.rss.RSSCrawlerManager.initCrawlerControlFeed(RSSCrawlerManager.java:612)
...

Caused by: oracle.webcenter.doclib.crawl.exception.URIHandlerException: HTTP connect attempt failed after 2 attempts. Aborting connection attempt to url : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default
     at oracle.webcenter.doclib.crawl.http.HttpUtils.postToURL(HttpUtils.java:436)
     at oracle.webcenter.doclib.crawl.http.HttpUtils.getHTTPResponse(HttpUtils.java:292)
... 159 more

]]
[<TIMESTAMP>] [WC_Portal] [WARNING] [] [oracle.webcenter.doclib.crawl.rss.RSSControlFeedFetcher] [tid: [ACTIVE].ExecuteThread: '20' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] Exception while getting the control feed stream from content server: http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default.

[<TIMESTAMP>] [WC_Portal] [WARNING] [] [oracle.webcenter.doclib.crawl.rss.RSSControlFeedFetcher] [tid: [ACTIVE].ExecuteThread: '20' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] Error Message : Error occured while processing HTTP Response, URL : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default, error message : Error occured while processing HTTP Response, URL : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default, error message : HTTP connect attempt failed after 2 attempts. Aborting connection attempt to url : http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default

[<TIMESTAMP>] [WC_Portal] [WARNING] [] [oracle.webcenter.doclib.crawl.rss.RSSCrawlerManager] [tid: [ACTIVE].ExecuteThread: '20' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] Initialization of crawler control feed failed.

[<TIMESTAMP>] [WC_Portal] [WARNING] [] [oracle.webcenter.doclib.crawl.rss.RSSCrawlerManager] [tid: [ACTIVE].ExecuteThread: '20' for queue: 'weblogic.kernel.Default (self-tuning)'] [userId: <USERID>] [ecid: <ECID>] [APP: webcenter] [partition-name: DOMAIN] [tenant-name: GLOBAL] [DSID: <DSID>] Exiting crawling , Error Message : Exception while getting the control feed stream from content server: http://<HOST:PORT>/cs/idcplg?IdcService=SES_CRAWLER_DOWNLOAD_CONTROL&source=default.
...

 

 

 

Changes

 

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


In this Document
Symptoms
Changes
Cause
Solution


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.