Newsletters
News & Information for Technology Purchasers NewsFactor Sites:       NewsFactor.com     Enterprise Security Today     CRM Daily     Business Report     Sci-Tech Today  
   
This ad will display for the next 20 seconds. Click for more information, or
Home Enterprise I.T. Cloud Computing Applications Hardware More Topics...
Build Apps 5x Faster
For Half the Cost
Enterprise Cloud Computing

On Force.com
Applications
DDoS Protection Powered By Verisign
Average Rating:
Rate this article:  
Cloudera Announces Real-Time Query Engine for Hadoop

Cloudera Announces Real-Time Query Engine for Hadoop
By Barry Levine

Share
Share on Facebook Share on Twitter Share on Linkedin Share on Google Plus

Cloudera's Apache-licensed, open-source query engine, Cloudera Impala, is specifically designed for real-time query of data stored in a Hadoop Distributed File System, or HDFS, and in HBase, a non-relational distributed database, and the company said it is the result of two years of in-house development. The queries for Impala can be expressed as SQL.
 


There's a new tool for Big Data analysis. On Wednesday, Cloudera announced a real-time query engine for Apache Hadoop, resulting from two years of in-house development efforts.

The engine is an enhancement to Cloudera's Big Data platform, known as Cloudera Enterprise. In describing the query-engine's uniqueness, Cloudera claims this is the first time both real-time and batch operations are available for unstructured and structured data in one massively scalable system.

Cloudera offers a commonly used version of Hadoop, an open-source data framework designed for handling Big Data.

In its announcement, Cloudera said that the new query engine will enable organizations to "process data at petabyte scale and, on the same system, interact with that data in real time to deliver 'speed-of-thought' insights." In short, the company said, the new tool will allow organizations to "ask bigger questions" of their data.

SQL Queries

The Apache-licensed, open-source query engine, Cloudera Impala, is specifically designed for real-time query of data stored in a Hadoop Distributed File System (HDFS) and in HBase, a non-relational distributed database. Interactive queries for Impala can be expressed as SQL.

The company said that Impala operates 10 times as fast as the existing Hive/MapReduce, and can be even faster, depending on the workload. It pointed to cost savings for analyzing Big Data with real-time queries, by using this open-source technology with commodity hardware.

Cloudera said that, in a recent survey it conducted of more than 100 customers, over 70 percent were looking at how to extract value from Big Data. Operational IT efficiency and competitive advantage were cited by the customers as reasons for adopting Hadoop, but the vast majority also indicated they needed faster methods of querying than the batch operations that had been available.

'Most Exciting' Since Hadoop

In its announcement, the company pointed to one of its clients, travel Web site Expedia, which said that it uses the Cloudera Enterprise platform to manage more than 4 petabytes of data. With Impala added, Expedia said the enhanced Enterprise Real-Time Query platform allows the creation of one single platform for Big Data, instead of having to maintain several systems for archiving, extracting, transforming, loading, and analytics.

Cloudera CEO Mike Olson said in a statement that, "until now, enterprises had to limit the work they did with Hadoop because batch-mode processing using MapReduce was just too slow for some business problems." Impala, he explained, will enable organizations to store all their data in Hadoop and "use the same hardware to do both powerful analytics and run real-time queries using industry-standard tools and the SQL language."

In fact, Cloudera co-founder and Chief Scientist Jeff Hammerbacher characterized Impala as "the most exciting open-source project since Hadoop," adding that it was "the most important framework beyond MapReduce for analyzing data stored in HDFS and Hbase."
 

Tell Us What You Think
Comment:

Name:



Salesforce.com is the market and technology leader in Software-as-a-Service. Its award-winning CRM solution helps 82,400 customers worldwide manage and share business information over the Internet. Experience CRM success. Click here for a FREE 30-day trial.


 Applications
1.   Popular Mailbox App Comes to Mac
2.   9 Norton Security Products Are Now 1
3.   Infor Buys Cloud CRM App Saleslogix
4.   Plan Your Move from Windows 7 Now
5.   Health Agencies Use Dynamics CRM


advertisement
China Puts Microsoft Under the Lens
Official anti-monopoly probe launched.
Average Rating:
Popular Mailbox App Comes to Mac
Takes to-do list approach to the inbox.
Average Rating:
9 Norton Security Products Are Now 1
Symantec takes software-as-service tack.
Average Rating:
Product Information and Resources for Technology You Can Use To Boost Your Business

Network Security Spotlight
Cost of Target Data Breach: $148 Million
The now infamous Target data breach is still costing the company -- and its shareholders -- plenty. In fact, the retailing giant forecast the December 2013 incident cost shareholders $148 million.
 
Aruba Networks Handles Black Hat with Aplomb
It's not an easy job. Aruba Networks' task throughout the Black Hat USA conference in Las Vegas this month was to ensure thousands of attendees could connect without malicious attacks.
 
Chinese Hackers Nab Info on Millions of U.S. Patients
A group of Chinese hackers has stolen the personal information, including names and Social Security numbers, of about 4.5 million patients at hospitals operated by Community Health Systems.
 

Enterprise Hardware Spotlight
Three New Lenovo PCs Aimed at Business Users
Businesses everywhere want computing solutions that do more for less money, and Lenovo has unveiled three new desktop PCs that offer solid computing at a budget-minded price.
 
Aruba Networks Handles Black Hat with Aplomb
It's not an easy job. Aruba Networks' task throughout the Black Hat USA conference in Las Vegas this month was to ensure thousands of attendees could connect without malicious attacks.
 
Compression, Deduplication Come to Violin Concerto 2200
Violin Memory has announced that data deduplication and compression capabilities are now available on its Concerto 2200 solution. Typically, users will experience deduplication rates between 6:1 and 10:1.
 

Mobile Technology Spotlight
Apple Stock Soars Ahead of iPhone 6 Launch
The imminent release of the iPhone 6 -- and maybe even an iWatch -- has sent Apple's stock soaring to new heights. Considering what else the firm could have up its sleeve -- the stratosphere may be the limit.
 
HTC Debuts Windows Phone Version of One M8 Smartphone
HTC is bringing the Windows Phone mobile OS to its flagship One M8 device -- the first time any mainstream flagship smartphone has been offered with a choice of operating systems.
 
Verizon Earns Top Rating in Mobile Network Comparison
A new report says Verizon Wireless was the top-performing U.S. cellphone service provider in the first half of 2014, on a nationwide and state-by-state basis, as well as in metro areas.
 

Navigation
NewsFactor Network
Home/Top News | Enterprise I.T. | Cloud Computing | Applications | Hardware | Mobile Tech | Big Data | Communications
World Wide Web | Network Security | Data Storage | CRM Systems | Microsoft/Windows | Apple/Mac | Linux/Open Source | Personal Tech
Press Releases
NewsFactor Network Enterprise I.T. Sites
NewsFactor Technology News | Enterprise Security Today | CRM Daily

NewsFactor Business and Innovation Sites
Sci-Tech Today | NewsFactor Business Report

NewsFactor Services
FreeNewsFeed | Free Newsletters

About NewsFactor Network | How To Contact Us | Article Reprints | Careers @ NewsFactor | Services for PR Pros | Top Tech Wire | How To Advertise

Privacy Policy | Terms of Service
© Copyright 2000-2014 NewsFactor Network. All rights reserved. Article rating technology by Blogowogo. Member of Accuserve Ad Network.