1. Introduction

1.1. About

twikiget is a tool to download twiki pages and archive them in .warc format. It uses wget underneath and so it includes all its downloading features.

1.2. Features

  • download and archive specific TWiki page and all its attachments
  • create WARC files for long-term preservation purposes
  • save local cache for faster and periodic reprocessing
  • (planned) extract specific metadata from TWiki document markup according to configurable templates