Create Hive tables and load data from Azure Blob Storage
文章目录
Upload data to Azure Blob Storage
We assume that the data for Hive tables is in an uncompressed tabular format, and that the data has been uploaded to the default (or to an additional) container of the storage account used by the Hadoop cluster.
If you want to practice on the NYC Taxi Trip Data, you need to:
- download the 24 NYC Taxi Trip Data files (12 Trip files and 12 Fare files),
- unzip all files into .csv files, and then
- upload them to the default (or appropriate container) of the Azure Storage account; options for such an account appear at Use Azure Storage with Azure HDInsight clusters topic. The process to upload the .csv files to the default container on the storage account can be found on this page.
文章作者 Hustbill billyzhang2010@gmail.com
上次更新 2021-11-09