Making a English Wikipedia server: Difference between revisions
m Fixed link syntax |
m Stuckthrough obsolete mwdumper - php import method is now the best way and will be covered at some point |
||
| Line 1: | Line 1: | ||
[https://www.mediawiki.org MediaWiki] makes dumps of the English Wikipedia about once a month. As it is | [https://www.mediawiki.org MediaWiki] makes dumps of the English Wikipedia about once a month. As it is free and open source content, you can use these dumps to make your own server with the English Wikipedia content in it. As the English Wikipedia is the largest Wikipedia, it does take a while to import the dumps, but it is by no means impossible. | ||
This guide will help you long the way. | This guide will help you long the way. | ||
| Line 14: | Line 14: | ||
* MySQL/MariaDB | * MySQL/MariaDB | ||
* A dump of your Wikipedia of choice | * A dump of your Wikipedia of choice | ||
* MWDumper | * <s>MWDumper</s> | ||
* The latest Java JRE and Java JDK from the [https://www.java.com Oracle Java] website | * <s>The latest Java JRE and Java JDK from the [https://www.java.com Oracle Java] website</s> | ||
=== Before you start === | === Before you start === | ||
| Line 27: | Line 27: | ||
=== Downloading the dumps === | === Downloading the dumps === | ||
# The dumps for English Wikipedia are available from | # The dumps for English Wikipedia are available from [https://dumps.wikimedia.org/enwiki/ here]. When there, you'll obviously want to select the latest date. | ||
# Once there, you'll need to download the following: | # Once there, you'll need to download the following: | ||
#* <code>enwiki-<date>-pages-artilcles.xml.bz2</code> (This is the latest revision of every Wikipedia page, article and template - the basis you need to get going) | #* <code>enwiki-<date>-pages-artilcles.xml.bz2</code> (This is the latest revision of every Wikipedia page, article and template - the basis you need to get going) | ||
| Line 37: | Line 37: | ||
=== Downloading MWDumper === | === Downloading MWDumper === | ||
# MWDumper is available from many places around the Internet, both in source form and already built Java packages. You will need to download a copy from [https://integration.wikimedia.org/ci/view/Java/job/MWDumper-package/ Jenkins], this is pre-built by MediaWiki. MWDumper 1.16 (26/06/2013) was the latest at the time of writing. | # <s>MWDumper is available from many places around the Internet, both in source form and already built Java packages. You will need to download a copy from [https://integration.wikimedia.org/ci/view/Java/job/MWDumper-package/ Jenkins], this is pre-built by MediaWiki. MWDumper 1.16 (26/06/2013) was the latest at the time of writing. | ||
# You will need to remove any versions of OpenJDK already installed (remove <code>libreoffice-calc-extensions</code> and <code>libreoffice-writer-extensions</code> before OpenJDK so that it doesn't try to install another version of Java). | # You will need to remove any versions of OpenJDK already installed (remove <code>libreoffice-calc-extensions</code> and <code>libreoffice-writer-extensions</code> before OpenJDK so that it doesn't try to install another version of Java). | ||
# You will then need to install the latest Oracle Java JRE and JDK packages (64-bit packages are safe and better for this as we don't need the web plugin) | # You will then need to install the latest Oracle Java JRE and JDK packages (64-bit packages are safe and better for this as we don't need the web plugin)</s> | ||
=== Downloading and installing MediaWiki === | === Downloading and installing MediaWiki === | ||