locked
Connection to Amazon Redshift database with R using RPostgreSQL2 package RRS feed

  • Question

  • Hi,

    We are trying to use Redshift as source of data for our ML Studio experiments and haven't been able to. We are loading the RPostgreSQL2 R package loaded as a custom package. A simple select to a 500 records table keeps running for 11 hours without any response.

    Any help will be well received

    Juan Carlos Lozano

    Data Architect - Adext.com


    • Edited by jclozanoc Thursday, September 14, 2017 3:02 PM
    Thursday, September 14, 2017 2:06 PM

All replies

  • Hey Juan,

    Apologies for the late reply - the R and Python processes are sandboxed, and thus we don't support general network access (though a few HTTP-only scenarios are known to work). Could you provide the output log for the runs by clicking on the module and then View Output Log? Does any error jump out at you?

    I wouldn't expect an 11hr wait for the records to return, but that it's more likely that the package is incompatible with the sandboxing and the process is in a bad state. Have you considered using the ADF task for data movement from Redshift to a supported source?

    https://docs.microsoft.com/en-us/azure/data-factory/v1/data-factory-amazon-redshift-connector

    Regards,

    AK

    • Proposed as answer by Hai Ning Tuesday, October 3, 2017 2:17 AM
    Monday, October 2, 2017 9:26 PM