NoSQL databases grew in popularity for use in highly distributed web applications that needed scale-out architectures but didn’t require the tabular relations used by traditional SQL relational ...
Hadoop is big, but there’s no doubt that the game changer will be marrying SQL— the primary language used by business analysts for ad hoc analysis—with Hadoop. If you don’t want the information in ...
Streaming is hot. The demand for real-time data processing is rising, and streaming vendors are proliferating and competing. Apache Kafka is a key component in many data pipeline architectures, mostly ...
It's been amusing to watch the NoSQL movement transition from a “We don’t need no stinking SQL” attitude to a “Can I please have some SQL with that?” philosophy. The nonrelational databases that ...
Historically, if you wanted to report against all of the business operations of your company, it was a very expensive ordeal. At ClearVoice, we needed to be able to collect data across many platforms, ...
Developers can now leverage real-time data using standard ANSI SQL, with new functionality including elastic storage separated from compute, strict-serializability, active replication and horizontal ...
Don't be afraid of using your chosen database's command-line client. I might as well say this up front: I don't like using GUI (aka non-command-line or graphical) tools with my databases. This is ...
One of the critical decisions facing companies embarking on big data projects is which database to use, and often that decision swings between SQL and NoSQL. SQL has the impressive track record, the ...
Speculation has been running rampant that Oracle may introduce its own NoSQL database at the OpenWorld conference, to be held next week in San Francisco. While Oracle thus far has been quiet about the ...
At the heart of Apache Spark is the concept of the Resilient Distributed Dataset (RDD), a programming abstraction that represents an immutable collection of objects that can be split across a ...