EMC Greenplum makes big Big Data news
If you've ever been to a Big Data conference, Community Development and Open Source is what it's all about.
If you've ever been to an EMC Conference, or followed EMC software, not so much.
But all of that changed today.
We’re going to be delivering an enterprise distribution of Hadoop, Luke Lonergan and Scott Yara of EMC Greenplum told a room full of industry journalists this morning.
For anyone who doesn't know, Hadoop is the Apache Software Foundation's analytics engine. It is Java based and was built by numerous parties, one of its major contributors is Yahoo.
Jaws in the press briefing room might have dropped upon hearing this news, except that EMC CEO Joe Tucci had leaked the news (to an audience of 10,000) an hour earlier. What can the Greenplum guys do, he writes their paychecks.
All small talk aside, this is big news for the Big Data community. EMC is the first billion-dollar company to come to the community and say, 'Let's work together on solving these 'big data' problems, and let's do it the right way," explained Yara.
It would have been interesting to be a fly on the wall listening to Yara, Lonergan and Tucci talking about this.
Before pasting in EMC's press release, I'd be remiss if I failed to mention an interesting question that was raised during the Q&A session. It asked what kind of position this might put Cloudera in; after all, EMC and Cloudera announced an alliance for tackling Big Data last September.
More to come on this later, here's the press release:
EMC DELIVERS HADOOP ‘BIG DATA’ ANALYTICS TO THE ENTERPRISE
EMC Software Distribution, Support and New Appliance Solidify Apache Hadoop as an Enterprise-Ready Tool Enabling NewReal-Time Data Capabilities
EMC WORLD 2011— LAS VEGAS — May 9, 2011 — Extending its leadership in providing customers with the most powerful and efficient ways to extract value from Big Data, EMC Corporation (NYSE: EMC), the world leader in information infrastructure solutions, today announced a comprehensive strategy for distributing, integrating and supporting the Apache Hadoop open-source software used for data-intensive distributed applications. The companyis introducing the world’s first purpose-built,high-performance, data co-processing Hadoop appliance — the GreenplumHD Data Computing Appliance. The appliance marries Hadoop with the EMC Greenplum Database, allowing the co-processing of both structured and unstructured data within a single, seamless solution. In addition, EMC announced the availability of the Hadoop-based EMC Greenplum HD Community Edition and EMC Greenplum HD Enterprise Edition software. Combined with product certification by a dozen leading partners, these will enable technology innovations such as real-time data interaction, offer greater reliability, and make Hadoop much easier to deploy and use.
Apache Hadoop has rapidly emerged as the preferred solution for Big Data analytics across unstructured data. Organizations looking for opportunity in an ever-changing business environment are finding that Big Data analysis is the competitive advantage. Hadoop-based batch processing of unstructured and structured data at massive scale using commodity hardware has led to a profound change in analytics. By extracting the knowledge wrapped within unstructured machine-generated data, organizations can make better decisions that drive revenue, improve service and reduce costs.
The EMC Greenplum HD product family enables an organization to take advantage of Big Data analytics without the overhead and complexity that comes with the cumbersome tools and solutions on the market today. Available in two editions — Community and Enterprise —Greenplum HD software provides a complete platform including installation, training, global support and value add beyond simple packaging of the Apache distribution.
EMC’s unique value and capabilities for Hadoop include:
- EMC GreenplumHD Data Computing Appliance
- EMC Greenplum HD Enterprise Edition
- Data management features such as snapshots and wide area replication
- Simple data loading and access using a native network file system (NFS) interface
- End-to-end manageability including simple cluster deployment, automatic failure detection and notification, multi-site management and rolling upgrades
- Best of all, these capabilities are delivered along with two to five the performance improvement over the standard packaged versions of Apache Hadoop.
- EMC Greenplum HD Community Edition
In addition to its Hadoop offerings, EMC has created a vibrant and powerful ecosystem with twelve companies offering business intelligence, data transfer and other technology capabilities. These companies are Concurrent, CSC, Datameer, Informatica, Jaspersoft, Karmasphere, Microstrategy, Pentaho, SAS, SnapLogic, Talend, and VMware. This breadth of support is testament to the value EMC brings to Hadoop. Technology companies and enterprises can now extend the trust they have in EMC to the open source data analytics tool.
EMC Global Services has developed a series of professional services, support and training for data warehousing and business analytics, including a new Enterprise Business Analytics Assessment Service to review and understand data and its role across an organization, its processes and technology. EMC professionals will help customers deploy and optimize the new Greenplum Data Computing Appliance and design an environment for complex correlation across massive data sets. In addition, EMC will assist data migration and consolidation requirements from their Oracle, Teradata and other existing database systems onto the Greenplum Data Computing Appliance.
Monday, May 9, 2011 at 02:11PM 
Reader Comments (1)
Hello,
We facilitate the provision of independent analysis to support expert testimony, regulatory or legislative engagements. Frequently, this work includes economic, financial and statistical studies of varying data analysis, technical and http://www.stlouisbridal.com.