Skip to Content
0
May 22, 2020 at 01:44 PM

SAP Data Intelligence 3.0 trial. Another error accessing the Data Lake

206 Views

Hello,

I have uploaded files using "ML Data Manger" creating a Dataset and Collection.

Then I have created a Jupyter Notebook. In the "Data Browser" I can see both files, I generate the code snippet and when I run it crashes.

I think the error my be related with the error in my previous post. I hope this provides more information.

---------------------------------------------------------------------------
KeyError                                  Traceback (most recent call last)
/opt/conda/lib/python3.7/site-packages/sapdi/internal/datalake/datalake_client.py in _get_datalake_connection_details(di_client)
     55             scheme = "https" if connection_detail['protocol'] == 'swebhdfs' else "http"
---> 56             url = "{scheme}://{host}:{port}".format(scheme=scheme, host=connection_detail['host'],
     57                                                     port=connection_detail['port'])

KeyError: 'host'

During handling of the above exception, another exception occurred:

DataLakeException                         Traceback (most recent call last)
<ipython-input-1-0d17cbb02709> in <module>
      3 ws = sapdi.get_workspace(name='SampleData')
      4 dc = ws.get_datacollection(name='SampleDataCol')
----> 5 with dc.open('california_housing_test.csv').get_reader() as reader:
      6     df = pd.read_csv(reader)

/opt/conda/lib/python3.7/site-packages/sapdi/data/datacollection.py in open(self, rel_path)
    299         else:
    300             val = self.location
--> 301         return _StorageFactory.get_instance().get_handler(val)
    302 
    303     def register_as_artifact(self, scenario=None):

/opt/conda/lib/python3.7/site-packages/sapdi/internal/common/storage/storage_factory.py in get_instance()
     16 
     17         if StorageFactory._instance is None:
---> 18             StorageFactory._instance = DataLakeStorage()
     19         return StorageFactory._instance

/opt/conda/lib/python3.7/site-packages/sapdi/internal/common/storage/datalake/datalake_storage.py in __init__(self)
     12     def __init__(self):
     13         # return singleton
---> 14         self.data_lake_client = _datalake_client.DataLakeClient.get_instance()
     15 
     16     def get_handler(

/opt/conda/lib/python3.7/site-packages/sapdi/internal/datalake/datalake_client.py in get_instance()
     27     def get_instance():
     28         if DataLakeClient._instance is None:
---> 29             DataLakeClient._instance = DataLakeClient()
     30         return DataLakeClient._instance
     31 

/opt/conda/lib/python3.7/site-packages/sapdi/internal/datalake/datalake_client.py in __init__(self, di_client)
     63 
     64     def __init__(self, di_client=None):
---> 65         dl_connection_info = DataLakeClient._get_datalake_connection_details(di_client)
     66 
     67         session = requests.session()

/opt/conda/lib/python3.7/site-packages/sapdi/internal/datalake/datalake_client.py in _get_datalake_connection_details(di_client)
     60         except Exception as e:
     61             message = "Error while retrieving Data Lake URL.  Reason: {}".format(str(e))
---> 62             raise DataLakeException(message)
     63 
     64     def __init__(self, di_client=None):

DataLakeException: Error while retrieving Data Lake URL.  Reason: 'host'

Attachments

error.png (49.4 kB)