27 February, 2014
There’s a lot of confusing jargon and buzzwords in this new field. It helps to know who some of the major players are and…
Publication: Data Science Central
24 February, 2014
A question I receive quite a bit from people who are newly introduced to NoSQL platforms and the overall concept of multi-structured data sets…
Publication: Information Management
18 February, 2015
Data projects are now moving from the experimental stage to providing real returns on investment, with tools such as Hadoop and Cassandra forming an integral part of organizations’ enterprise-wide analytics platform. However, big data is a big investment, both in terms of money and time.
The faster companies are…
Publication: Datacenter Dynamics
3 February, 2015
The idea of gaining knowledge through specialized analysis of mass data started with data collection in the 1960s, and has steadily increased both in the amount of data processed and the sophistication of questions businesses try to answer.
Through this progression from static to dynamic and now to proactive provision…
1 December, 2014
We’re being overrun with Big Data and that has created a need to increase computing and networking power to make it possible to manage the vast amount of information available.
Toward that goal, a new generation of Information and communications technology (ICT) systems has been inspired by the operating…
Publication: Science 2.0
17 November, 2014
With data production accelerating to unprecedented rates, many organizations have turned to Hadoop as an inexpensive way to store and process that data. But those new to Hadoop often find themselves confronting a technology as inscrutable as its name. What is Hadoop? What does it do? What’s with the…
8 October, 2014
When we deployed the first production Hadoop cluster in 2006, we were looking to build a more efficient and cost-effective web search index at Yahoo. Very quickly, other groups at Yahoo started using Hadoop for research jobs and revenue-driving applications, such as click prediction for sponsored search. Since…
22 September, 2014
Hadoop, the open source big-data framework, has gradually evolved from being a shiny object in the laboratory to an increasingly serious tool finding its place in the Enterprise. At Gigaom, we’ve covered Hadoop’s increasing maturity, and completeness as an enterprise technology, because that’s been the story…
17 September, 2014
Hadoop has been gaining popularity as a business intelligence solution and the number of data analytics solutions offering Hadoop is growing exponentially. What does this mean for the traditional data warehouse?
Hadoop is an open-source, distributed file system developed by the Apache Software Foundation to solve the problems that…
Publication: Data Informed
17 September, 2014
Data mining techniques come in two main forms: supervised (also known as predictive or directed) and unsupervised (also known as descriptive or undirected). Both categories encompass functions capable of finding different hidden patterns in large data sets.
Although data analytics tools are placing more emphasis on self service, it’s…
Publication: Cloud Tweaks
18 August, 2014
Big Data isn’t just Hadoop and in-memory anymore. Big data technologies and tools have grown significantly over the past few years — so much so that it’s hard to keep up with them.
If you’d like to get up to snuff and are primarily interested in open…
31 July, 2014
Many Hadoop experts believe an integrated data warehouse (IDW) is simply a huge pile of data. However, data volume has nothing to do with what makes a data warehouse. An IDW is a design pattern, an architecture for an analytics environment. First defined by Barry Devlin in 1988, the architecture…