WikiDragon

The Neo4Wikipedia module was developed in order to make Wikipedia accessible as a linguistical resource for present-day language corpora. Neo4Wikipedia supports local, high-quality, memory-efficient representations of any number of versions of Wikipedia in a single database. The WikiDragon API [1], developed specifically for this purpose, creates and stores Wikipedia-releases in a Neo4J-based graph database. The WikiDragon API makes it possible to access, with full-text search, all revisions of wiki pages, their meta-data, and associated link- and category-structure. The data processed by the WikiDragon API undergoes a stochastic network analysis [2, 3].

Process chain and architecture of Neo4Wikipedia (Gleim in preparation)


[1] [pdf] A. Mehler, C. Stegbauer, and R. Gleim, “Latent Barriers in Wiki-based Collaborative Writing,” in Proceedings of the Wikipedia Academy: Research and Free Knowledge. June 29 – July 1 2012, Berlin, 2012.
[Bibtex]
@InProceedings{Mehler:Stegbauer:Gleim:2012:b,
  Author         = {Mehler, Alexander and Stegbauer, Christian and Gleim,
                   Rüdiger},
  Title          = {Latent Barriers in Wiki-based Collaborative Writing},
  BookTitle      = {Proceedings of the Wikipedia Academy: Research and
                   Free Knowledge. June 29 - July 1 2012},
  Address        = {Berlin},
  month          = {July},
  pdf            = {https://www.texttechnologylab.org/wp-content/uploads/2015/08/12_Paper_Alexander_Mehler_Christian_Stegbauer_Ruediger_Gleim.pdf},
  year           = 2012
}
[2] [pdf] A. Mehler and C. Stegbauer, “On the Self-similarity of Intertextual Structures in Wikipedia,” in Proceedings of the HotSocial ’12: The First ACM International Workshop on Hot Topics on Interdisciplinary Social Networks Research, Beijing, China, 2012, pp. 65-68.
[Bibtex]
@InProceedings{Mehler:Stegbauer:2012,
  Author         = {Mehler, Alexander and Stegbauer, Christian},
  Title          = {On the Self-similarity of Intertextual Structures in
                   Wikipedia},
  BookTitle      = {Proceedings of the HotSocial '12: The First ACM
                   International Workshop on Hot Topics on
                   Interdisciplinary Social Networks Research},
  Editor         = {Xiaoming Fu and Peter Gloor and Jie Tang},
  Pages          = {65-68},
  Address        = {Beijing, China},
  pdf            = {http://wan.poly.edu/KDD2012/forms/workshop/HotSocial12/doc/p64_mehler.pdf},
  website        = {http://dl.acm.org/citation.cfm?id=2392633&bnc=1},
  year           = 2012
}
[3] A. Mehler, C. Stegbauer, and R. Gleim, “Zur Struktur und Dynamik der kollaborativen Plagiatsdokumentation am Beispiel des GuttenPlag Wiki: eine Vorstudie,” in Die Dynamik sozialer und sprachlicher Netzwerke. Konzepte, Methoden und empirische Untersuchungen am Beispiel des WWW, B. Frank-Job, A. Mehler, and T. Sutter, Eds., Wiesbaden: VS Verlag, 2013.
[Bibtex]
@InCollection{Mehler:Stegbauer:Gleim:2013,
  Author         = {Mehler, Alexander and Stegbauer, Christian and Gleim,
                   Rüdiger},
  Title          = {Zur Struktur und Dynamik der kollaborativen
                   Plagiatsdokumentation am Beispiel des GuttenPlag Wiki:
                   eine Vorstudie},
  BookTitle      = {Die Dynamik sozialer und sprachlicher Netzwerke.
                   Konzepte, Methoden und empirische Untersuchungen am
                   Beispiel des WWW},
  Publisher      = {VS Verlag},
  Editor         = {Frank-Job, Barbara and Mehler, Alexander and Sutter,
                   Tilman},
  Address        = {Wiesbaden},
  year           = 2013
}