Windows Azure HDInsight has been available for a little while now, but I haven’t had a chance to work with it. Tonight as I was browsing the Patterns & Practices Website, I noticed that they were working on a new book for the Cloud Series. It’s an ongoing project about developing Big Data solutions using the Windows Azure HDInsight and related technologies.
The book can be downloaded from the Patterns & Practices Windows Azure Guidance site.
Topics Covered in This Book
- What is Big Data?
- Getting Started with HDInsight
- Collecting and Loading Data
- Performance Queries and Transformations
- Consuming and Visualizing Results
- Debugging, Testing, and Automating HDInsight Solutions
- Managing and Monitoring HDInsight
- Iterative Data Exploration
- Extra, Transform, Load
- HDInsight as a Data Warehouse
- Enterprise BI Integration
The book is currently a preview and will probably change over time.
What’s Windows Azure HDInsight?
HDInsight is Microsoft’s Hadoop-based service that brings a 100% Apache Hadoop solution to the cloud. HDInsight gives you the ability to gain the full value of Big Data with a modern, cloud-based data platform that manages data of any type, whether structured or unstructured, and of any size.
With HDInsight you can seamlessly store and process data of all types through Microsoft’s modern data platform, which provides simplicity, ease of management, and an open Enterprise-ready Hadoop service all running in the cloud. You can analyze your Hadoop data with PowerPivot, Power View and other Microsoft BI tools, thanks to integration with Microsoft data platform. [More]