![]() |
Diagram of Hadoop on Google Cloud Platform. HDFS and the NameNode are optional when storing data in Google Cloud Storage |
In the 10 years since we first introduced Google File System (GFS) — the basis for Hadoop Distributed File System (HDFS) — Google has continued to improve our storage system for large data processing. The latest iteration is Colossus.
Today’s launch delivers exactly that. Using a simple connector library, Hadoop can now run directly against Google Cloud Storage — an object store built on Colossus. That means you benefit from Google’s expertise in large data processing.
Here are a few other benefits of running Hadoop with Google Cloud Storage:
- Compatibility: The Google Cloud Storage connector for Hadoop code-compatible with Hadoop. Just change the URL to point to your data.
- Quick startup: Your data is ready to process. You don’t have to wait for extra minutes or more while your data is copied over to HDFS and the NameNode comes out of safe mode, and you don’t have to pay for the VM time for data copying either.
- Greater availability and scalability: Google Cloud Storage is globally replicated and has higher availability than HDFS because it’s independent of the compute nodes and the NameNode. If the VMs are turned down (or, cloud forbid, crash) your data lives on.
- Lower costs: Save on storage and compute: storage, because there’s no need to maintain two copies of your data, one for backups and one for running Hadoop; compute, because you don’t need to keep VMs going just to serve data. And with per-minute billing, you can run Hadoop jobs faster on more cores and know your costs aren’t getting rounded up to a whole hour.
- No storage management overhead: Whereas HDFS requires routine maintenance -- like file system checks, rebalancing, upgrades, rollbacks and NameNode restarts -- Google Cloud Storage just works. Your data is safe and consistent with no extra effort.
- Interoperability: By keeping your data in Google Cloud Storage, you can benefit from all of the other Google services that already play nicely together.
- Performance: Google’s infrastructure delivers high performance from Google Cloud Storage that’s comparable to HDFS -- without the overhead and maintenance.
To see the benefits for yourself, give Hadoop on Google Cloud Platform a try by following the simple tutorial.
We would love to hear your feedback and ideas on how to make Hadoop and MapReduce run even better on Google Cloud Platform.
-Posted by Jonathan Bingham, Product Manager
Related Post:
and
- Download Game IronPlane HD v1 5 for Nokia 5800 N97 X6 5530 and N8
- Download Theme Anime Girl Music by Rosy90 for Nokia 5800 and X6
- Distracted driving and cell phones
- Google and NTT DoCoMo announce collaboration
- Faster dialing with Google Voice on Android and Blackberry devices
- Discover more than 3 million Google eBooks from your choice of booksellers and devices
- Google App Engine integration features in IntelliJ IDEA and PyCharm
- Google Compute Engine Expanded Availability New Features and Lower Prices
- How to check your email from your phone and 50 other things you might want to teach your parents
- Many languages and in the runtime bind them
- PHP App Engine Apps and File System Concepts
- ZTE Announces the Grand S II Iconia Phablet and the BlueWatch
- Comparison between Infinix Alpha X570 and Infinix Alpha Marvel X502
- Ubuntu Touch support dropped for 2012 Nexus 7 and Nexus 10 nexus 5 may be supported in the future
- Install the latest version of Adobe Flash Player on Android 4 0 and Higher
- Google Looking To Expand On Education Based Devices With Samsung And Other OEMs
big
0 komentar:
Posting Komentar