Skip to content
PiRSquared17 edited this page Sep 23, 2014 · 9 revisions

MediaWiki

List of wikis

Already exists.

Dump algorithm

Already exists.

Dump format.

MediaWiki XML dump + extras. Notes:

DokuWiki

List of wikis

Dump algorithm

Sketch:

====== ~~INFO:syntaxmodes~~ ======
~~INFO:syntaxmodes~~
====== ~~INFO:syntaxtypes~~ ======
~~INFO:syntaxtypes~~
====== ~~INFO:syntaxplugins~~ ======
~~INFO:syntaxplugins~~
====== ~~INFO:adminplugins~~ ======
~~INFO:adminplugins~~
====== ~~INFO:actionplugins~~ ======
~~INFO:actionplugins~~
====== ~~INFO:rendererplugins~~ ======
~~INFO:rendererplugins~~
====== ~~INFO:helperplugins~~ ======
~~INFO:helperplugins~~
====== ~~INFO:helpermethods~~ ======
~~INFO:helpermethods~~
====== ~~INFO:authplugins~~ ======
~~INFO:authplugins~~
====== ~~INFO:remoteplugins ~~ ======
~~INFO:remoteplugins~~
====== ~~INFO:version~~ ======
~~INFO:version~~

Dump format

Compressed https://www.dokuwiki.org/devel:dirlayout directory. cache, index, locks, tmp probably not needed.

MoinMoin

...

UseModWiki, OddMuseWiki, etc.

List of wikis

Dump algorithm

  • Check if raw=1 is available.
  • Get list of pages
  • Use http://usemod.com/cgi-bin/wiki.pl?action=index (add &raw=1 if available).
  • Download current version only
  • For each page title, either get action=browse&id=FooBar&raw=1 (preferable) or action=edit&id=FooBar. If raw not available, scrape textarea content of edit box.
  • Loop
  • Get history of each page (note: UseModWiki history is not permanent!)
  • Use action=history&id=FooBar
  • Parse.
  • For each revision, download raw content:
  • If action=browse&id=Foo&revision=123&raw=1 is available, use that. Otherwise, use action=edit&id=Foo&revision=123
  • Get images.
  • Go through each saved page text, and search for image URLs defined using same regex as UseModWiki uses.
  • Save image.
  • Save site version/metadata.
  • Save action=version. In UseModWiki, this is not very useful, but it's cool to have for Oddmuse. Example: http://communitywiki.org/?action=version

Dump format

http://www.usemod.com/cgi-bin/wiki.pl?DataBase

Welcome to the WikiTeam documentation wiki! We are a group dedicated to archiving wikis around the Internet, and you are invited to be part of it! Find out more.


Clone this wiki locally