<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Phil Dawes' Stuff - Latest Comments in importing statements at speed</title><link>http://phildawesstuff.disqus.com/</link><description></description><atom:link href="https://phildawesstuff.disqus.com/importing_statements_at_speed/latest.rss" rel="self"></atom:link><language>en</language><lastBuildDate>Thu, 23 Sep 2004 03:28:06 -0000</lastBuildDate><item><title>Re: importing statements at speed</title><link>http://www.phildawes.net/blog/2004/09/22/importing-statements-at-speed/#comment-2752886</link><description>&lt;p&gt;Wow, those figures are looking good. &lt;br&gt;If you're looking at big data it may be worth keeping one eye on developments in RFC3229  and feeds as a possible means to sync'ing big stores, see:&lt;br&gt;&lt;a href="http://bobwyman.pubsub.com/" rel="nofollow noopener" target="_blank" title="http://bobwyman.pubsub.com/"&gt;http://bobwyman.pubsub.com/&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">danja</dc:creator><pubDate>Thu, 23 Sep 2004 03:28:06 -0000</pubDate></item><item><title>Re: importing statements at speed</title><link>http://www.phildawes.net/blog/2004/09/22/importing-statements-at-speed/#comment-2752885</link><description>&lt;p&gt;That's it - many thanks.&lt;/p&gt;&lt;p&gt;Have been thinking about chunked imports too - makes sense with the bulk-import approach since most of the time is spent in the parsing/preparing rather than the actual dumping to db. Chunking it would enable parallelization of the time consuming preparation stage. Would work best with something like 3store, which stores md5 hashes for URIs in its main triples table - no centralization required to agree IDs for URIs.&lt;/p&gt;&lt;p&gt;Unfortunately (from this perspective), my store uses generated IDs for URIs to enable a 1:many logical-resource -&amp;gt; URI mapping for smushing. Would probably need to import in chunks and then reconcile IDs in a subsequent sweep.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Phil Dawes</dc:creator><pubDate>Thu, 23 Sep 2004 02:59:06 -0000</pubDate></item><item><title>Re: importing statements at speed</title><link>http://www.phildawes.net/blog/2004/09/22/importing-statements-at-speed/#comment-2752884</link><description>&lt;p&gt;Hi Phil! Nice work. Re data, you might be thinking of &lt;a href="http://rdfdata.org/" rel="nofollow noopener" target="_blank" title="http://rdfdata.org/"&gt;http://rdfdata.org/&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Re imports, I'd wondered about doing this sort of think chunked into, say, 10000 triple blocks, for large imports. But yes, having a benchmarking framework would take out some of the guesswork...&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Dan Brickley</dc:creator><pubDate>Wed, 22 Sep 2004 19:58:26 -0000</pubDate></item></channel></rss>