Maximizing HDInsight throughput to Azure Blob Storage

Posted: Nov 25, 2013  0 comments  

Average Rating

Share this Post

The HDInsight service supports both HDFS and Windows Azure Storage (BLOB Service) for storing data. Using BLOB Storage with HDInsight gives you low-cost, redundant storage, and allows you to scale your storage needs independently of your compute needs. However, Windows Azure Storage allocates bandwidth to a storage account that can be exceeded by HDInsight clusters of sufficient size. If this occurs, Windows Azure Storage will throttle requests. This article describes when throttling may occur and how to maximize throughput to BLOB Storage by avoiding throttling. Note : In HDInsight, HDFS is intended to be used as a cache or for intermediary storage. When a cluster is deleted, data in HDFS will be discarded. Data intended for long-term storage Read More...

Read the complete post here

Submit a Comment

  • Plain text is accepted.
  • URLs starting with http:// are converted to links.