Throughout Asia Pacific and Japan, our customers are experiencing the disruptive nature of big data technologies and the world of the Internet of Things. As the number of connected devices is expected to hit 20 billion by 2020*, the amount of available data is also expected to reach an all-time high. With this trend on the horizon, companies are faced with the consequences of disruptive competition and evolving business processes.
As organizations in all industries start to explore the tangible benefits of Big Data and the Internet of Things, it is imperative for companies to invest in the right technologies to stay competitive and relevant.
The Backbone: Hadoop & Spark
If your organization has invested in Big Data and the Internet of Things, then you’re most likely familiar with Hadoop. This open source ecosystem offers an infrastructure to store and analyze massive volumes of structured and unstructured data. Underpinned by the Hadoop Distributed File System (HDFS) for data storage and powered by the Apache Spark in-memory processing engine, its scale out ability and affordable cost make it extremely attractive for companies seeking a Big Data management framework.
More information on the Apache Hadoop Ecosystem.
Globally the Hadoop market has had wide scale adoption and is expected to grow 58.2 percent CAGR until 2020.**
The Massive Divide
The Hadoop ecosystem offers an affordable and scalable solution for Big Data, but its architectural framework makes it impractical for online transaction processing required by enterprise business applications, so this data, is stored in either traditional relational databases or more recently, next generation data platforms like SAP HANA, that itself enables the convergence of transaction processing and analytics in one real-time system. (Read more on this specific topic in my blog SAP HANA – The Ultimate Simplifier).
While both data frameworks are exceptional for their designed functionalities, their integration is imperative for innovative applications. This integration has proven to be technically challenging and has resulted in even more data silos, creating complexity, a massive divide between enterprise business data and Big Data and therefore stifling innovation and business differentiation.
Bridging The Gap
In order to bridge the divide between enterprise business data and Big Data, SAP introduced SAP HANA Vora, an in-memory query engine that is an implementation of Apache Spark for the Hadoop environment. SAP HANA Vora enables organizations to seamlessly integrate the enterprise business data with Big Data in Hadoop, delivering real-time precision analytics on both sets of data to gather meaningful business insights. With SAP HANA Vora, customer can leverage their existing Hadoop infrastructure to build new innovative applications that truly differentiate their business.
To learn more about SAP HANA Vora join me at Strata + Hadoop World in Singapore on December 1 – 3. I will be giving a keynote on Thursday, December 3 at 11:50 am in room 333.
I look forward to hearing your thoughts and answering any questions you may have on Twitter: @pmmarriott
* Source: Gartner, Connected “Things”
**Source: ICT Media, Global Hadoop Market
VN:F [1.9.22_1171]SAP HANA - Bridging the gap with SAP HANA Vora (Part 2),