Hadapt and Revelytix is acquired by Teradata for big data boost

Admin — Tue, 26 Aug 2014 11:38:18 +0000

Teradata adds data-analysis, data-prep and data-management capabilities by acquiring two notable companies from big data arena.

Loom is a metadata management system, which is developed by Revelytix and it is compatible with a number of Hadoop distributions in which few are Hortonworks, IBM, Cloudera, Pivotal, Apache and MapR.

Loom is geared for more quickly analysis, when data scientists prepare information in Hadoop. And Hadapt is famous for its software that integrates the specific SQL programming language with Hadoop. SQL is a simple and common skill set for database administrators, who may not be familiar with Hadoop

In market there are number of other SQL on Hadoop implementations, such as; Pivotal’s HAWQ and Cloudera’s Impala. Recently, Oracle, the rival of Teradata announced Big Data SQL, which can run a single SQL query against Oracle’s relational database as well as various other NoSQL information stores like; Hadoop.

Organizationally, the people and intellectual property of Revelytix and Hadapt will become part of Teradata Labs, according to official statement released on Tuesday, but still the terms of deal is undisclosed.

Teradata added in official statement, that acquisition of Revelytix and Hadapt underscores Teradata’s commitment towards the innovation and customer value, with the enhancement of Teradata Unified Data Architecture and extension of big data portfolio.

Teradata’s twin acquisition of Revelytix and Hadapt gives an indication for big data market.

Google Service – Live streaming data analysis

Admin — Mon, 25 Aug 2014 11:29:31 +0000

There are many options for companies and sites to analyze the data that gets uploaded, however seems like Google is one step ahead and planning for Google cloud Dataflow service that will enable to not only analyze live streaming data and batch data. This can certainly help the users of this service to change based on the current trends and make their moves.
According to Brian Goldfarb, head of the Google cloud Dataflow service marketing, as much as different data gets created, it becomes really important to ingest and secure the appropriate and important ones. When it comes to analyzing large data, it involves usage of different program models and technologies. But at the end the managers of this service will be able to learn and implement a lot new services.
Google cloud Dataflow:
This is completely managed service which enables one to create data pipelines in order to store and ingest the data and either live-streaming mode or batch mode. This service is meant for analyzing random amount of data’s. This service will enable to user to focus more on the analysis rather than giving importance to pipeline maintenance and processing infrastructure. This service can be used for measuring unusual activity in the form of security tool or by companies to analyze the emotions of consumers towards any of their products on a particular social networking platform. This service can be included in many other business applications and can well be used as an alternative service to ETL.
Advantages:
This service is based on the Mapreduce programming model which is currently being used in Apache Hadoop and the technologies which was developed by Google to use them internally. Through Hadoop large amount of data through different servers can be analyzed and pioneered the area of analyzing data even though it initially used to focus on writing the data and that too in batch mode. The limitation is reached when all the data needs to be collected before it could be analyzed.
Google has been developing and taking a different approach when it comes to live streaming data analysis, through incorporating different technologies including Flume and MillWheel which has been built by the company itself. Flume has been developed to store large amount of data and on the other hand Millwheel helps in providing a platform for data analysis.
How these will work:
This service will provide a platform (software development kit) which can be used to develop complex pipelines and perform analysis. This service will also be based on Java programming language. Currently it doesn’t support any other language. This will work as a library which will enable the users to store large amount of data’s from different sources and later on analyze it. This can be queried against Google’s own Bigquery service. User of this service can analyze the current trends by writing modules to examine the stored data.
Even though currently this service is only used by certain selected Google users, it might be available on public platform later on.

eWebSuite » data analysis

Hadapt and Revelytix is acquired by Teradata for big data boost

Google Service – Live streaming data analysis