Continued problems with warcit and wacz

If you need a smaller file as a test, feel free to use this WACZ of browsertrix.cloud (192KB)

Fun fact, (this isn’t well documented yet, I plan to do a pass on the ReplayWebpage docs in the nearish future) but ReplayWebpage (when embedded, not in the UI on the site) supports loading multiple WACZ files! You don’t actually have to rip them apart and re-create them (though that can be nice for portability).

You can do this by passing a JSON file to the sourceurl of ReplayWebpage using the following spec (should work :wink: )

{
  "name": "Collection Name",
  "description": "Description of all files for curation reasons",
  "modified": "2023-09-23T04:57:33",
  "crawlCount": 2,
  "tags": ["collectiontag1", "collectiontag2"],
  "resources": [
    {
      "name": "archive1.wacz",
      "path": "path/to/archive1.wacz",
      "hash": "e511cc962b156a37d3c7546d8e0533b31e7b49f8ef902f5cdd0c0e093a10522f",
    },
    {
      "name": "archive2.wacz",
      "path": "path/to/archive2.wacz",
      "hash": "caac5f6789d7f112a2717574f276861cf583db2eb145ac67d0c1d9b4f2713b1a",
    }
  ]
}

file hash, description (though encouraged!), tags, and modified timestamp should be optional??

Note that as this is undocumented some things are subject to change, also I’ve omitted some of the fields we fill in automatically in Browsertrix so if you try it out with only these fields LMK if it works for you! :slight_smile: