From: Benjamin Mako Hill Date: Fri, 9 Aug 2013 01:12:31 +0000 (-0400) Subject: finished version of talk for presentation X-Git-Url: https://projects.mako.cc/source/state_of_wikimedia_research_2013/commitdiff_plain/HEAD?ds=inline finished version of talk for presentation --- diff --git a/20130809-wikimania_research.tex b/20130809-wikimania_research.tex index 58bb079..f7e4993 100644 --- a/20130809-wikimania_research.tex +++ b/20130809-wikimania_research.tex @@ -248,7 +248,7 @@ So anyway, I had a 45 minute talk so it worked out to 3.45 seconds to per paper... - And believe it or, this year is even bigger. + And believe it or not, this year is even bigger. And my talk is even shorter.} @@ -300,9 +300,9 @@ \item Research by people who are \e{not at Wikimania}. \end{itemize} - Within these goals, the selections are \e{incomplete}, and \e{wrong}. + \note{This is my disclaimer slide... - \note{This is my disclaimer slide...} + Within these goals, the selections are \e{incomplete}, and \e{wrong}.} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% @@ -340,7 +340,7 @@ \begin{itemize} \larger \larger \larger \item Compulsive collection - \item Stigmertic accumluative + \item Stigmergic accumluation \item Corporate production \end{itemize} @@ -360,22 +360,20 @@ references accumulation. In the past, this meant piracy and building off of others. In Wikipedia, it means revision, incorporation of other sources, and more. - \item \e{Corporate productin} means working together with many + \item \e{Corporate production} means working together with many other people. Diderot took advantage of at least 140 different - authors. Think the OED collecting information from + authors. Think of the OED collecting information from others. Wikipedia of course uses a similar model. \end{itemize} In each case, they think that Wikipedia's model is not a total - break from the past in the way many people talk abou it.} + break from the past in the way many people talk about it.} - \note<2>{Now my own bias as a reseacher is to look to more - quantitative or easy to apply work. - - \e{Takeaway:} But I think is a great example how much of the more humanities + \note<2>{\e{Takeaway:} But I think is a great example how much of the more humanities focused work on Wikipedia can do a wonderful job of providing us context and a better way to think about and talk about what we're doing.} + \end{frame} @@ -384,7 +382,7 @@ %% SLIDE: Citation %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% -\begin{frame}{Wikipedia as Data Source} +\begin{frame}{Wikipedia as a Data Source} \larger \larger @@ -451,12 +449,9 @@ \note{There is little industry of articles designed to evaluate Wikipedia's quality. There are literally dozens of these each - year. And one that thing that frustrates me is that its very rare - that the people doing these coordinate with Wikipedia or that - Wikipedians systematically reach out to the people doing these to - learn. - - This is an example of one from pediatric otolayrnology. That is, + year. + + This is an example of one from pediatric otorhinolayrnology. That is, the study of dieases of the ear, nose, and throat -- in children.} \end{frame} @@ -503,13 +498,13 @@ } \note{Like many of these studies, this study cmpares Wikipedia to - other sites. In this case, eMedicne, and Medicine Plus. They used + other sites. In this case, eMedicine, and Medicine Plus. They used a series of textbooks and experts to evaluate the the content errors and they used some standard systems to evaluate usability and reading level. They find that Wikipedia has the most errors, the least accuracy, - aa medium reading level. But similar in most cases to MedLinePlus. + and a medium reading level. But in most cases it is similar to MedLinePlus. And Wikipedia had a rather good user interface compared to the others. @@ -517,15 +512,17 @@ I'm not sure what that says about the others user interface. \e{Takeaway:} We need to be better about getting these datsets and - helping integrate these into improving the encyclopedia.} + helping integrate these into improving the encyclopedia. One thing that frustrates me is that it's very rare + that the people doing these studies coordinate with Wikipedia or that + Wikipedians systematically reach out. At a minimum, we could try to collect a list of errors papers like this discover and correct them.} \end{frame} \subsection{Perception of Quality} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% -%% SLIDE: Perception of Quality +%% SLIDE: Viewership %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% -\begin{frame}{Perception of Quality} +\begin{frame}{Viewership} \larger \larger Towne, W. Ben, Aniket Kittur, Peter Kinnaird, and James Herbsleb. “Your Process Is Showing: Controversy Management and @@ -565,20 +562,20 @@ \begin{itemize} \item When discussion is shown, quality rating were significantly lower. \item When discussion involving conflict was displayed, article - quality ratings were even lower. + quality ratings were even lower yet. \item If the editors involved in the conflict resolved it through a positive collaboration approach, the negative effects of conflict disappeared. \item Participants reported that reading the discussion raised their perceptions of both the article’s quality and Wikipedia in general. (i.e., they were not aware of the rating-lowering - effect of the discussion, and generally.) + effects of the discussion.) \end{itemize} \e{Takeaway:} There's a deep and interesting tradeoff that cuts to the core of Wikimedia's two missions to empower folks by getting involved in the process to display material. This kind of work - explores big important questions at the heart of the foundations + explores big important questions at the heart of the foundation's work.} \end{frame} @@ -626,7 +623,7 @@ represents a set of tools and techniques from which the community could directly benefit. - \e{Takeaway:} We need to get better about working with all the + \e{Takeaway:} Again, we need to get better about working with all the people, like this, building tools for our communities.} @@ -711,15 +708,15 @@ \item \e{Much More} \end{itemize} - \note{Those are my six postcards. + \note{Those are my six exemplary studies from the past year. There has been just tons and tons of work in this area. Trying to talk about this in 20 minutes strikes me as increasingly crazy every year I try to do it. The most important source, now going for a couple years, is the - Wikimedia Research Newsletter which is published monthly in the - signpost. + Wikimedia Research Newsletter which is published monthly in the (English) + Signpost. But there are other resources as well. And I encourage you to get involved.} diff --git a/Makefile b/Makefile index 82020c3..ac30864 100644 --- a/Makefile +++ b/Makefile @@ -5,7 +5,6 @@ pdf: all %.pdf: %.tex rubber -fd $< - extract-notes $< # by default, we produce combined notes/slides output notes.config: diff --git a/extract-notes b/extract-notes deleted file mode 100755 index 0d11083..0000000 --- a/extract-notes +++ /dev/null @@ -1,48 +0,0 @@ -#!/usr/bin/perl -w - -foreach $filename (@ARGV) { - chomp $filename; - open(TEXFILE, "<$filename"); - - $notes_filename = $filename; - $notes_filename =~ s/\.tex$/.pdfpc/; - - $pdf_filename = $filename; - $pdf_filename =~ s/\.tex$/.pdf/; - - open(NOTESFILE, "\>$notes_filename"); - print NOTESFILE "[file]\n$pdf_filename\n[notes]\n"; - - $note_record = 0; - $slide_counter = 1; - while ($line = ) { - # if this is a new frame - if ($line =~ /^\\begin\{frame\}/) { - $slide_counter++; - } - - if ($line =~ /^\% pdfpc\:end/) { - $note_record = 0; - $note_text =~ s/^\s*(.*?)\s*$/$1/; - print NOTESFILE "\#\#\# $slide_counter\n$note_text\n"; - } - - # record if we're between a begin and an end - if ($note_record) { - $line_tmp = $line; - $line_tmp =~ s/^\%\s*//; - $note_text = $note_text . $line_tmp; - } - - # if this a new note - if ($line =~ /^\% pdfpc\:begin/) { - $note_text = ""; - $note_record = 1; - } - } - - close(TEXFILE); - close(NOTESFILE); - -} - diff --git a/figures/multiple_issues.png b/figures/multiple_issues.png new file mode 100644 index 0000000..5f26cf5 Binary files /dev/null and b/figures/multiple_issues.png differ diff --git a/figures/multiple_issues_screenshot.png b/figures/multiple_issues_screenshot.png new file mode 100644 index 0000000..5f26cf5 Binary files /dev/null and b/figures/multiple_issues_screenshot.png differ diff --git a/figures/wikipeda_citations_bytime.png b/figures/wikipeda_citations_bytime.png new file mode 100644 index 0000000..d5377a3 Binary files /dev/null and b/figures/wikipeda_citations_bytime.png differ