Misplaced Pages

Help:Downloading pages: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editContent deleted Content addedVisualWikitext
Revision as of 04:47, 25 November 2005 editUncle G's 'bot (talk | contribs)Extended confirmed users7,076 edits TRANSWIKI: Copied from Meta:Help:Downloading pages.← Previous edit Latest revision as of 08:54, 30 October 2021 edit undoJohn of Reading (talk | contribs)Autopatrolled, Extended confirmed users, Pending changes reviewers767,601 edits Reverted 1 edit by 2600:1700:7B1:3B30:C07A:DBA6:CED1:3D82 (talk): That's off-topic for this Misplaced Pages help pageTags: Twinkle Undo 
(22 intermediate revisions by 18 users not shown)
Line 1: Line 1:
{{Misplaced Pages how-to|H:DP}}
{{H:h|reader toc}}
'''Downloading pages''' is saving a MediaWiki page as a local copy.


== Downloading a MediaWiki page == == Downloading a page as HTML file ==


] shows the possibilities for saving a local copy of a webpage. Alternatively (or in addition) one can copy the ], i.e. the text in the edit box (the source code within the database). Saving a ] shows the possibilities for saving a local copy of a webpage.


=== A set of linked pages ===
Information in the webpage but not in the wikitext:
When saving a local copy of pages, please note the following.


A link to e.g. the train article in Misplaced Pages is given in the HTML-code as <code>/Train</code> . This refers to <code>http://en.wikipedia.org/Train</code> . Depending on your browser settings, the former may be changed into the latter when saving the page. To avoid this, apply View Source and save that.
*images
*content of ]s referred to
*values of ]s
*existence at the time of saving of linked internal pages
*date and time of the last edit before saving
*in the Image namespace (]s): the image itself, the image history and the list of pages linking to the image
*in the Category namespace: the lists of subcategories and pages in the category.


Put the copy in folder <code>C:\wiki</code> (another drive letter is also possible, but wiki should not be a sub-folder) and do not use any file name extension. This way the links work. One inconvenient aspect is that you cannot open a file in a folder listing by clicking on it, because of the lack of a file name extension.
Information in the wikitext but not in the webpage:


A problem with saving the source code is that images are not saved automatically with the page. Saving them separately in a place corresponding to the HTML code is cumbersome, e.g. the first image of the train article would have to be <code>C:/upload/thumb/c/c2/250px-Tile_Hill_train_550.jpg</code>
*comments (even though HTML also allows comments)


If the images are more important than the mutual links, then one can use the browser 'save page' option to save the webpage with images.
See also ], ].


Of course variations are possible by changing the HTML-code oneself, e.g. changing <code>http://en.wikipedia.org</code> to <code>C:</code> and/or adding the file name extension .htm .
== Downloading some linked pages ==
When saving a local copy of some MediaWiki pages, note the following.


== Downloading the wikitext of a page ==
A link to e.g. the train article in Misplaced Pages is given in the HTML-code as /Train . This refers to http://en.wikipedia.org/Train . Depending on the browser (settings) the former may be changed into the latter when saving the page. To avoid this, apply View Source and save that.


Alternatively one can copy the ], i.e. the text in the edit box (the source code within the database).
Put the copy in folder C:\wiki (another drive letter is also possible, but wiki should not be a subfolder) and do not use any file name extension. This way the links work. Inconvenient is that you can not open a file in a folder listing by clicking on it, because of the lack of a file name extension.


This has a limited use. There is more information in the webpage than conveyed by the wikitext:
A problem with saving the source code is that images are not saved automatically with the page. Saving them separately in a place corresponding to the HTML code is cumbersome, e.g. the first image of the train article would have to be C:/upload/thumb/c/c2/250px-Tile_Hill_train_550.jpg


*images
If the images are more important than the mutual links, then use the browser option to save the webpage with images.
*content of ]s referred to (to access a full wikitext, with all templates being expanded, use ]; it has the effect of the recursively applied "]:" operation).
*values of ]s
*existence at the time of saving of linked internal pages
*date and time of the last edit before saving
*in the Image namespace (]s): the image itself, the image history and the list of pages linking to the image
*in the Category namespace: the lists of subcategories and pages in the category.


Information in the wikitext but not in the webpage:
Of course variations are possible by changing the HTML-code oneself, e.g. changing http://en.wikipedia.org to C: and/or adding the file name extension .htm .


*comments (even though HTML also allows comments)
On some sites outside Wikimedia, instead of "\wiki" another folder has to be used, see ].


See also ].
When downloading pages from different sites to the same folder \wiki on the same drive, note that a page name can only be used once.


==See also== == See also ==


*] *]
*], for downloading to another MediaWiki site


{{Help navigation}}
{{h:f|langs=|enname=Downloading pages}}

Latest revision as of 08:54, 30 October 2021

This help page is a how-to guide.
It explains concepts or processes used by the Misplaced Pages community. It is not one of Misplaced Pages's policies or guidelines, and may reflect varying levels of consensus.
Shortcut

Downloading pages is saving a MediaWiki page as a local copy.

Downloading a page as HTML file

Saving a webpage shows the possibilities for saving a local copy of a webpage.

A set of linked pages

When saving a local copy of pages, please note the following.

A link to e.g. the train article in Misplaced Pages is given in the HTML-code as /Train . This refers to http://en.wikipedia.org/Train . Depending on your browser settings, the former may be changed into the latter when saving the page. To avoid this, apply View Source and save that.

Put the copy in folder C:\wiki (another drive letter is also possible, but wiki should not be a sub-folder) and do not use any file name extension. This way the links work. One inconvenient aspect is that you cannot open a file in a folder listing by clicking on it, because of the lack of a file name extension.

A problem with saving the source code is that images are not saved automatically with the page. Saving them separately in a place corresponding to the HTML code is cumbersome, e.g. the first image of the train article would have to be C:/upload/thumb/c/c2/250px-Tile_Hill_train_550.jpg

If the images are more important than the mutual links, then one can use the browser 'save page' option to save the webpage with images.

Of course variations are possible by changing the HTML-code oneself, e.g. changing http://en.wikipedia.org to C: and/or adding the file name extension .htm .

Downloading the wikitext of a page

Alternatively one can copy the wikitext, i.e. the text in the edit box (the source code within the database).

This has a limited use. There is more information in the webpage than conveyed by the wikitext:

  • images
  • content of templates referred to (to access a full wikitext, with all templates being expanded, use Special:ExpandTemplates; it has the effect of the recursively applied "subst:" operation).
  • values of variables
  • existence at the time of saving of linked internal pages
  • date and time of the last edit before saving
  • in the Image namespace (Image description pages): the image itself, the image history and the list of pages linking to the image
  • in the Category namespace: the lists of subcategories and pages in the category.

Information in the wikitext but not in the webpage:

  • comments (even though HTML also allows comments)

See also XML export.

See also

Misplaced Pages help pages

About Misplaced Pages (?)
Help for readers (?)
Contributing
to Misplaced Pages
 (?)
Getting started (?)
Dos and don'ts (?)
How-to pages and
information pages (?)
Coding (?)
Directories (?)
Missing Manual
Ask for help on your talk page (?)
Category: