jawiki dump progress on 20191201
This is the Wikimedia dump service.
Please read the copyrights information.
See Meta:Data dumps
for documentation on the provided data formats.
The 7zip decoder on Windows is known to have
problems with some bz2-format
files for larger wikis; we recommend the use of bzip2 for Windows for these cases.
Please report problems with these dumps on Phabricator and add the
Dumps-generation tag.
See all databases list.
Last dumped on 2019-11-20
For a machine-readable version of the information on this page,
see the json status file.
Dump complete
Verify downloaded files against the (md5), (sha1) checksums
to check for corrupted files.
- 2019-12-03 10:31:22 done Recombine multiple bz2 streams
- 2019-12-03 10:03:03 done Articles, templates, media/file descriptions, and primary meta-pages, in multiple bz2 streams, 100 pages per stream
- 2019-12-08 05:28:58 done All pages with complete edit history (.7z)
- 2019-12-07 22:05:16 done All pages with complete page edit history (.bz2)
b'getting/checking text 68050901 failed (Received text is unplausible for id 68050901) (Will retry 4 more times)'
- 2019-12-07 00:23:59 done Recombine Log events to all pages and users
- 2019-12-07 00:23:04 done Log events to all pages and users.
- 2019-12-04 15:50:14 done Recombine all pages, current versions only.
- 2019-12-04 15:08:47 done All pages, current versions only.
- 2019-12-03 00:35:47 done Recombine articles, templates, media/file descriptions, and primary meta-pages.
- 2019-12-02 23:43:07 done Articles, templates, media/file descriptions, and primary meta-pages.
- 2019-12-02 07:08:54 done Recombine first-pass for page XML data dumps
- 2019-12-02 06:46:27 done First-pass for page XML data dumps
- 2019-12-07 00:17:21 done Recombine extracted page abstracts for Yahoo
- 2019-12-07 00:16:47 done Extracted page abstracts for Yahoo
b'2019-12-07 00:02:52: jawiki (ID 55346) 2000 pages (34.5|73.0/sec all|curr), 2000 revs (34.5|36.5/sec all|curr), ETA 2019-12-08 08:36:26 [max 4044256]'
- 2019-12-06 21:02:15 done List of all page titles
- 2019-12-06 21:01:51 done List of page titles in main namespace
- 2019-12-06 21:01:30 done Namespaces, namespace aliases, magic words.
- 2019-12-02 11:56:39 done Wiki page-to-page link records.
- 2019-12-02 11:47:12 done Redirect list
- 2019-12-02 12:05:03 done User group assignments.
- 2019-12-02 12:02:41 done A few statistics such as the page count.
- 2019-12-02 12:04:00 done Wiki category membership link records.
- 2019-12-02 12:04:26 done Past user group assignments.
- 2019-12-02 11:59:39 done Tracks which pages use which Wikidata items or properties and what aspect (e.g. item label) is used.
- 2019-12-02 12:00:31 done Wiki interlanguage link records.
- 2019-12-02 11:48:38 done List of annotations (tags) for revisions and log entries
- 2019-12-02 11:51:58 done Wiki template inclusion link records.
- 2019-12-02 11:48:57 done Newer per-page restrictions table.
- 2019-12-02 12:04:47 done List of pages' geographical coordinates
- 2019-12-02 12:01:42 done Category information.
- 2019-12-02 11:46:27 done This contains the SiteMatrix information from meta.wikimedia.org provided as a table.
- 2019-12-02 11:49:25 done Interwiki link tracking records
- 2019-12-02 12:00:56 done Metadata on current versions of uploaded media/files.
- 2019-12-02 11:50:08 done Base per-page data (id, title, old restrictions, etc).
- 2019-12-02 11:57:28 done Nonexistent pages that have been protected.
- 2019-12-02 12:01:17 done Language proficiency information per user.
- 2019-12-02 11:50:36 done Annotation (tag) names and ids.
- 2019-12-02 11:58:57 done Wiki external URL link records.
- 2019-12-02 12:02:19 done Wiki media/files usage records.
- 2019-12-02 11:47:57 done Name/value pairs for pages.