Predictions for Big Data in 2013 by Arvind

2012 is coming to an end, which means it's that time of year where media explode with their summaries and recaps. However, to me, looking forward is more exciting than looking back, so I thought I would be remiss if I didn't throw in my four ...

Why Use a Graph-Oriented Database?

Suppose you worked for a business analysis software company, and your CEO wanted you to look into the possibility of developing a product that would help investment banks detect insider trading. Further suppose that the CEO wanted you to brief her on ...

A Truth Table for SPARQL Query Datasets

While contributing to the development on the uRiKA platform, I was asked by our database team to describe the various conditions affecting a query's dataset in a SPARQL query.  I came up with the following truth table which describes the result of a ...

Systematizing Serendipity

I was in a meeting with a customer the other day discussing Cray's value proposition when they pointed out that what Cray is doing with graph analytics is "systematizing serendipity." Poetic alliteration aside, I thought that the idea was spot-on, ...

Calculating a histogram in SPARQL

Recently a colleague and I were working on some networking data, and we wanted to explore a bit to see if it seemed reasonable. One of the first things he thought of was to make a histogram of packet sizes. We did a quick web search expecting to see ...

Extending SPARQL with CONSTRUCT Sub-queries

Recently myself and some of my colleagues were discussing something that we consider a key limitation of SPARQL right now which is that is provides no direct mechanism to create a temporary graph from the existing data to use in your query. Yes, you ...

Hitting the Moving Target

The W3C [1] is a fantastic organization that has done more than any organization to help promote standards and interoperability on the web. It is also the body responsible for many of the core standards that our uRiKA system utilizes such as RDF [2] ...

Issues with Data Normalization in RDF

Back around 2004 to 2008, I worked a project that was taking the approach of modeling data via ontologies using some propriety methods.  As I sought out more standard methods for data modeling I found some of the new methods we find in today's ...

Practical SPARQL Benchmarking

There is a certain amount of misguided belief in the market that Semantic Web technologies simply aren't performant enough for the needs of a business and I often hear this presented as a reason for not choosing these technologies over a traditional ...