2025-04-30 14:01:56 +02:00
2025-04-23 15:10:39 +02:00
2025-04-09 13:40:03 +02:00
2025-04-26 22:21:41 +02:00
2025-04-18 17:26:37 +02:00
2025-04-09 13:50:08 +02:00
2025-04-09 13:40:03 +02:00
2025-04-09 13:44:23 +02:00
2025-04-18 17:26:37 +02:00
2025-04-26 23:15:29 +02:00
2025-04-09 13:40:03 +02:00
2025-04-18 17:26:37 +02:00

Structured Wikivoyage Exports

Small utility to convert the wikitext data from the Wikivoyage dumps into a structured format. The goal is to make it easier to work with the data and extract useful information programmatically.

Usage

Docker

This script is intended to be run with docker. A docker image is available from the GitHub registry. For example, you may run it using the filesystem handler with docker run -e HANDLER=filesystem -e HANDLER_FILESYSTEM_OUTPUT_DIR=/output -v ./output:/output --ulimit nofile=65536:65536 ghcr.io/bcye/structured-wikivoyage-exports. For all the different options, refer to the docs.

Types

TypeScript types for consuming the json output are available, you may install them from the @bcye/structured-wikivoyage-types npm package. Refer to the included docstrings in types/index.d.ts for reference.

Documentation

See docs for more information on how to use this utility.

Description
Fetch and transform Wikivoyage data dumps into structured JSON trees
Readme AGPL-3.0 475 KiB
Languages
Python 52.9%
Jupyter Notebook 39.5%
TypeScript 7.4%
Dockerfile 0.2%