added a perl script to clean out some common broken encoding stuff

Benjamin Mako Hill || Want to submit a patch?