如何从谷歌存储云读取数据到谷歌云数据实验室

问题描述

我有一些 CSV 文件存储在 Google 存储中,我想将它们读入 Google datalab.到目前为止,我不知道该怎么做.我发现 这个 并遵循第一个答案,但没有奏效并提出了

I have a few CSV files storing in Google storage and I want to read those into Google datalab. So far, I have no idea how to do it. I found this and followed the first answer but didn't work and raised

  File "<ipython-input-1-5e9607fa3f65>", line 5
    %%gcs read --object $data_csv --variable data
    ^
SyntaxError: invalid syntax

任何帮助将不胜感激.


解决方案

如果减去 % 符号之一,它应该可以工作.最小的例子:

If you subtract one of the % symbols it should work. Minimal example:

import google.datalab.storage as storage
import pandas as pd
from io import BytesIO

mybucket = storage.Bucket('BUCKET_NAME')
data_csv = mybucket.object('data.csv')

uri = data_csv.uri
%gcs read --object $uri --variable data

df = pd.read_csv(BytesIO(data))
df.head()

相关文章