Misplaced Pages

Help:Downloading pages: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 08:00, 18 October 2009 editIslanders27 (talk | contribs)1,970 edits + lead section← Previous edit Revision as of 01:06, 9 November 2009 edit undoLeevanjackson (talk | contribs)Extended confirmed users, Pending changes reviewers6,619 edits correcting wikilinks and removing one that has goneNext edit →
Line 3: Line 3:
== Downloading a MediaWiki page == == Downloading a MediaWiki page ==


] shows the possibilities for saving a local copy of a webpage. Alternatively (or in addition) one can copy the ], i.e. the text in the edit box (the source code within the database). ] shows the possibilities for saving a local copy of a webpage. Alternatively one can copy the ], i.e. the text in the edit box (the source code within the database).


Information in the webpage but not in the wikitext: Information in the webpage but not in the wikitext:
Line 19: Line 19:
*comments (even though HTML also allows comments) *comments (even though HTML also allows comments)


See also ], ]. See also ]


== Downloading some linked pages == == Downloading some linked pages ==
Line 40: Line 40:
== See also == == See also ==


*] *]

Revision as of 01:06, 9 November 2009

Downloading pages refers to the saving of a local copy of a page by a Misplaced Pages user. For more detailed information, see the sections below.

Downloading a MediaWiki page

Saving a web page shows the possibilities for saving a local copy of a webpage. Alternatively one can copy the wikitext, i.e. the text in the edit box (the source code within the database).

Information in the webpage but not in the wikitext:

  • images
  • content of templates referred to
  • values of variables
  • existence at the time of saving of linked internal pages
  • date and time of the last edit before saving
  • in the Image namespace (Image description pages): the image itself, the image history and the list of pages linking to the image
  • in the Category namespace: the lists of subcategories and pages in the category.

Information in the wikitext but not in the webpage:

  • comments (even though HTML also allows comments)

See also XML export

Downloading some linked pages

When saving a local copy of some MediaWiki pages, note the following.

A link to e.g. the train article in Misplaced Pages is given in the HTML-code as /Train . This refers to http://en.wikipedia.org/Train . Depending on your browser settings, the former may be changed into the latter when saving the page. To avoid this, apply View Source and save that.

Put the copy in folder C:\wiki (another drive letter is also possible, but wiki should not be a sub-folder) and do not use any file name extension. This way the links work. One inconvenient aspect is that you cannot open a file in a folder listing by clicking on it, because of the lack of a file name extension.

A problem with saving the source code is that images are not saved automatically with the page. Saving them separately in a place corresponding to the HTML code is cumbersome, e.g. the first image of the train article would have to be C:/upload/thumb/c/c2/250px-Tile_Hill_train_550.jpg

If the images are more important than the mutual links, then use the browser option to save the webpage with images.

Of course variations are possible by changing the HTML-code oneself, e.g. changing http://en.wikipedia.org to C: and/or adding the file name extension .htm .

On some sites outside Wikimedia, instead of "\wiki" another folder has to be used, see Help:URL.

When downloading pages from different sites to the same folder \wiki on the same drive, note that a page name can only be used once.

See also