added wikipedia_namespaces.csv file
authorBenjamin Mako Hill <>
Thu, 23 Aug 2018 23:38:01 +0000 (16:38 -0700)
committerBenjamin Mako Hill <>
Thu, 23 Aug 2018 23:38:01 +0000 (16:38 -0700)
This file was required to run the scripts but was accidently not included.

wikipedia_namespaces.csv [new file with mode: 0644]

diff --git a/README b/README
index 7995e407d65ec4a6706a84af414afe83bc1ea0a8..342776d76f494ce2e78ca5a5e857a7bf48cbde99 100644 (file)
--- a/README
+++ b/README
@@ -75,6 +75,15 @@ Running the Software
 - GNU R
 - `data.table` R package available on CRAN
+There is also a dependency on a file called `wikipedia_namespaces.csv`
+which is included in this repository and which is drawn from data on
+this page:
+This file is taken from English Wikipedia in 2015. If you are working
+with different wikis or with an updated dump, you will likely to need
+to update this file.
 1. Download Dumps
diff --git a/wikipedia_namespaces.csv b/wikipedia_namespaces.csv
new file mode 100644 (file)
index 0000000..7474d7f
--- /dev/null
@@ -0,0 +1,39 @@
+3,User talk,FALSE
+5,Wikipedia talk,FALSE
+7,File talk,FALSE
+9,MediaWiki talk,FALSE
+11,Template talk,FALSE
+13,Help talk,FALSE
+15,Category talk,FALSE
+101,Portal talk,FALSE
+109,Book talk,FALSE
+119,Draft talk,FALSE
+446,Education Program,FALSE
+447,Education Program talk,FALSE
+711,TimedText talk,FALSE
+829,Module talk,FALSE
+5,Project talk,TRUE
+7,Image talk,TRUE

Benjamin Mako Hill || Want to submit a patch?