1 wikiq: a WikiMedia XML data dump to .tsv parser
3 author: Erik Garrison <erik@hypervolu.me>
5 wikiq is written in C using expat. It is designed to enable researchers to
6 rapidly extract revision histories (minus text and comments) from impossibly
9 To use, first make sure you have libexpat installed, then:
12 % ./wikiq -h # prints usage
13 % 7za e -so hugewikidatadump.xml | ./wikiq >hugewikidatadump.tsv