Data Import Process

Importing log data generated by the Distributed.Net proxy server into the Personal Proxy Statistics database has always been the slowest and most disk intensive process of creating stats. Every system administrator who has ever run ppstats has emailed me and asked if this could be fixed somehow, perhaps using a cache file of some sort. Because it is actually a fairly complicated problem, my standard response has always been "Wait for version 8.0". Well, this is version 8.0 and now we have to deal with this problem head on.

There are a few key goals to keep in mind when designing the data import process. Some are obvious, but they are worth restating.

