12

6

What triplestore are you using and what are its strengths and weaknesses?

How well does it scale? Is it easy to set up? Does it have good documentation?

Please only put one triplestore per answer if a triplestore is already mentioned provide additional information in comments.

flag

12 Answers

5

If you look for a comparison of query performance of different open source triple stores, take a look at the Berlin SPARQL Benchmark

Currently, the benchmark offers results for the following triple stores/RDF backends:

link|flag
3

I've only use 4store. http://4store.org/

  • It wasn't too hard to get installed. I needed a whole heap of libraries, but they were only an apt-get away.
  • It's Open Source
  • Command line tools
  • Developed by a commercial company (Garlik) rather than hobbyists or academics (not sure if that's a pro or con)

(for the record, the 4store developers are old friends of mine)

link|flag
2

Open Link Virtuoso Triple Store is another triple store. It can RDFize any XML or XHTML content. Easy to install on Linux. Windows installation is tougher. I actuallay started using it very recently. Am able to store my Triples, do SPARQL queries on those. Will have to further explore its other potentials. chk out here http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/

link|flag
Windows installation is actually pretty easy, of the Triple Stores I've installed it's probably the easiest – Rob Vesse Dec 14 at 9:27
Hi Rob, But does it not need Cygwin ? – Sashikiran Challa Dec 21 at 18:37
There are prebuilt 32-bit and 64-bit binaries for Windows, so it's just unzip and run. – PÄ“teris Caune Feb 9 at 17:53
2

BigOWLIM from Ontotext is another triple store that really worth a try. OWLIM is a high-performance semantic repository developed in Java. It is packaged as a Storage and Inference Layer (SAIL) for the Sesame RDF framework. Supports SPARQL. Offers high-performance reasoning and reliable persistence strategy. http://www.ontotext.com/owlim/index.html

link|flag
2

If you use Java, you can use Jena, which offers two storage backends.

First, there is SDB, which is based on a backing RDBMS like MySQL, Postgres, etc. The second alternative is TDB, which is a native, disk-based storage system (although not 1.x yet).

I've worked with RDB, the predecessor of SDB, and I've tested SDB, too. With the provided documentation, they are easy to use, and if there are any questions, there is a quite active community at the jena-dev group.

If you don't want to access the stores programmatical, the can also be used as backing storage for the Joseki SPARQL server, to offer a SPARQL endpoint.

link|flag
2

Sesame: in my experience it's very easy to setup and run, and the query performance is good. The docs and community are helpful. I've hardly bothered to look at other stores in months, which is both an endorsement of sesame but also a note that I can't compare it very well to the other projects :)

There are some missing optimizations for certain types of queries, especially with sorting. Using one of the RDBMS backends may help with that, but I've still had trouble with seemingly-simple things like "find the N most recent images".

link|flag
I am testing Sesame too at the moment. I also found it very easy to setup or to use as a library. – jorisslob Dec 22 at 14:28
2

I've been using the free version of AllegroGraph. Happy so far.

You might find this paper useful: "Comparison of Triple Stores" http://www.bioontology.org/wiki/images/6/6a/Triple_Stores.pdf

Also, check out the Tools page of the SemanticWeb wiki: http://semanticweb.org/wiki/Tools

link|flag
2

I think sesame is your best bet as far as ease of use and setting up. 4Store also is really quite good, and probably scales the best, I've used it in the past and have not hit a ceiling with it yet, but it suffers from some annoying missing features, like not being able to remove triples (you can only remove named graphs).

link|flag
2

Using Talis platform, have tried 4store and looked at a few others.

Talis Platform Pros: management is easy as its SaaS, no commitments to the platform, free developer account, good docs & community

Cons: everything goes via http maybe? you don't control every aspect of how things work

link|flag
1

I'm using rdflib + mysql as a backend storage. It's not particularly fast, but it's easy to setup. I plan to migrate after proper benchmarking and profiling.

link|flag
1

For PHP environments and small-sized applications (maybe up to 1MT) I can definitely advise to use ARC2. I have used it in several applications (such as in corrib) and am very happy with it.

link|flag
0

Hi Sashikiran,

The Virtuoso open source 32 & 64 bit Windows binaries can be downloaded from, http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VOSDownload

link|flag

Your Answer

Get an OpenID
or

Not the answer you're looking for? Browse other questions tagged or ask your own question.