Community Page
- phildawes.net/blog/ Jump to website »
-
Subscribe -
Community
-
Top Commenters
-
Popular Threads
-
Recent Comments
- Hi, Do you feel that your agility in Factor has improved since this post? Roger
- Thanks for the pointer - I've cleaned up the spam and regrettably added some moderation
- I'm loving the comments thread for this post. Can't decide whether to get my upholstery cleaned or do something about my fast food obesity.
- Cool - thanks Eric
- I pasted some code that does the moving sum in factor. http://paste.factorcode.org/paste?id=569#282
Jump to original thread »
Claire’s out tonight, so another evening spent on bulk rdf importing. Have managed to get the original 120705 statement dataset import down to 77.6 seconds - that’s ~1500 triples a second!
The extra speed was mainly due to removing the need for database URI to id look ... Continue reading »
The extra speed was mainly due to removing the need for database URI to id look ... Continue reading »
4 years ago
I believe the only way to load such amounts of data within reasonable time on reasonable hardware is to make use of the underlying database's bulk loading facilities - I gather you chose a similar approach. We can load 6'000 triples per second, most of which is required for building all the indexes...
4 years ago
When you say 6000 triples a second, is this from rdf/xml, or already parsed into some sort of optimized format?