A Vision for Performing Social and Economic Data Analysis using Wikipedia's Edit History

Lade...
Vorschaubild
Dateien
Dahm_2-1l512au51xto19.pdf
Dahm_2-1l512au51xto19.pdfGröße: 1.04 MBDownloads: 350
Datum
2017
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
ArXiv-ID
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Open Access Green
Core Facility der Universität Konstanz
Gesperrt bis
Titel in einer weiteren Sprache
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published
Erschienen in
BARRETT, Rick, ed.. WWW'17 Companion : Proceedings of the 26th International Conference on World Wide Web. New York, NY, USA: ACM Press, 2017, pp. 1627-1634. ISBN 978-1-4503-4914-7. Available under: doi: 10.1145/3041021.3053363
Zusammenfassung

In this vision paper, we suggest combining two lines of research to study the collective behavior of Wikipedia contributors. The rst line of research analyzes Wikipedia's edit history to quantify the quality of individual contributions and the resulting reputation of the contributor. The second line of research surveys Wikipedia contributors to gain insights, e.g., on their personal and professional background, socioeconomic status, or motives to contribute toWikipedia. While both lines of research are valuable on their own, we argue that the combination of both approaches could yield insights that exceed the sum of the individual parts. Linking survey data to contributor reputation and content-based quality metrics could provide a large-scale, public domain data set to perform user modeling, i.e. deducing interest pro les of user groups. User pro les can, among other applications, help to improve recommender systems. The resulting dataset can also enable a better understanding and improved prediction of high quality Wikipedia content and successfulWikipedia contributors. Furthermore, the dataset can enable novel research approaches to investigate team composition and collective behavior as well as help to identify domain experts and young talents. We report on the status of implementing our large-scale, content-based analysis of the Wikipedia edit history using the big data processing framework Apache Flink. Additionally, we describe our plans to conduct a survey among Wikipedia contributors to enhance the content-based quality metrics.

Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
004 Informatik
Schlagwörter
Wikipedia; Author Reputation; Article Quality; Editor Types
Konferenz
26th International Conference on World Wide Web, 3. Apr. 2017 - 7. Apr. 2017, Perth, Australia
Rezension
undefined / . - undefined, undefined
Zitieren
ISO 690DAHM, Erik, Moritz SCHUBOTZ, Norman MEUSCHKE, Bela GIPP, 2017. A Vision for Performing Social and Economic Data Analysis using Wikipedia's Edit History. 26th International Conference on World Wide Web. Perth, Australia, 3. Apr. 2017 - 7. Apr. 2017. In: BARRETT, Rick, ed.. WWW'17 Companion : Proceedings of the 26th International Conference on World Wide Web. New York, NY, USA: ACM Press, 2017, pp. 1627-1634. ISBN 978-1-4503-4914-7. Available under: doi: 10.1145/3041021.3053363
BibTex
@inproceedings{Dahm2017Visio-41870,
  year={2017},
  doi={10.1145/3041021.3053363},
  title={A Vision for Performing Social and Economic Data Analysis using Wikipedia's Edit History},
  isbn={978-1-4503-4914-7},
  publisher={ACM Press},
  address={New York, NY, USA},
  booktitle={WWW'17 Companion : Proceedings of the 26th International Conference on World Wide Web},
  pages={1627--1634},
  editor={Barrett, Rick},
  author={Dahm, Erik and Schubotz, Moritz and Meuschke, Norman and Gipp, Bela}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/41870">
    <dc:creator>Gipp, Bela</dc:creator>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2018-03-21T09:50:22Z</dcterms:available>
    <dc:contributor>Meuschke, Norman</dc:contributor>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/41870/1/Dahm_2-1l512au51xto19.pdf"/>
    <dc:creator>Schubotz, Moritz</dc:creator>
    <dcterms:title>A Vision for Performing Social and Economic Data Analysis using Wikipedia's Edit History</dcterms:title>
    <dc:language>eng</dc:language>
    <dc:contributor>Dahm, Erik</dc:contributor>
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:abstract xml:lang="eng">In this vision paper, we suggest combining two lines of research to study the collective behavior of Wikipedia contributors. The  rst line of research analyzes Wikipedia's edit history to quantify the quality of individual contributions and the resulting reputation of the contributor. The second line of research surveys Wikipedia contributors to gain insights, e.g., on their personal and professional background, socioeconomic status, or motives to contribute toWikipedia. While both lines of research are valuable on their own, we argue that the combination of both approaches could yield insights that exceed the sum of the individual parts. Linking survey data to contributor reputation and content-based quality metrics could provide a large-scale, public domain data set to perform user modeling, i.e. deducing interest pro les of user groups. User pro les can, among other applications, help to improve recommender systems. The resulting dataset can also enable a better understanding and improved prediction of high quality Wikipedia content and successfulWikipedia contributors. Furthermore, the dataset can enable novel research approaches to investigate team composition and collective behavior as well as help to identify domain experts and young talents. We report on the status of implementing our large-scale, content-based analysis of the Wikipedia edit history using the big data processing framework Apache Flink. Additionally, we describe our plans to conduct a survey among Wikipedia contributors to enhance the content-based quality metrics.</dcterms:abstract>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2018-03-21T09:50:22Z</dc:date>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:creator>Meuschke, Norman</dc:creator>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/41870"/>
    <dcterms:issued>2017</dcterms:issued>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/41870/1/Dahm_2-1l512au51xto19.pdf"/>
    <dc:creator>Dahm, Erik</dc:creator>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dc:contributor>Gipp, Bela</dc:contributor>
    <dc:contributor>Schubotz, Moritz</dc:contributor>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
URL der Originalveröffentl.
Prüfdatum der URL
Prüfungsdatum der Dissertation
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen