Announced this week, TurboRush for Hive is Pervasive Software's latest play in its efforts to try and make Hadoop-driven Hive data warehouse infrastructure queries run faster on less hardware.
As Pervasive now opens up its early access program, TurboRush for Hive is the first in a series of "big data accelerators" currently being developed on the company's parallel processing multicore processor-aligned DataRush platform.
"Hive is the data warehouse system built on top of Hadoop," said Ray Newmark, vice president of sales and marketing for Pervasive DataRush. "Pervasive TurboRush for Hive accelerates Hive by using the Pervasive DataRush dataflow engine on the back end, providing faster execution of Hive programs without needing to modify any code. It also provides the power of Pervasive DataRush parallelism to the full spectrum of SQL and Hive developers."
The company says that its engineers and early access users have seen Hive acceleration of between two to four times using the Pervasive TurboRush accelerator. Pervasive now intends to extend this model to deliver out-of-the-box accelerators for other languages and frameworks as well.
"I am seeing more and more organizations turn to the Hadoop ecosystem to see what kind of practical strategies they can adopt for processing big data," said David S. Linthicum, CTO of Blue Mountain Labs, and an internationally known cloud computing and SOA expert. "This is a promising offer to deliver enterprise-scale performance for Hive developers tackling big data challenges."
There will also be a community edition of Pervasive DataRush for Hadoop in the coming weeks.