Trying to archive video from https://www.govinfo.gov/collection/january-6th-committee-final-report
Link to video.
Steps to reproduce.
Goto Committee page
Select Supporting Materials - Video Exhibits
Turn on recorder
Click on MP4 to watch.
Wait for recording to complete.
Try to replay.
edsu
January 31, 2023, 1:29pm
2
Hi @boconnor I noticed the same problem when following your steps with ArchiveWebPage. The resulting WACZ can be viewed here temporarily.
I noticed that when creating the archive that clicking on the first MP4 link:
https://www.govinfo.gov/content/pkg/GPO-J6-VIDEO-EXH-103/video/GPO-J6-VIDEO-EXH-103.mp4
opens a new tab, and that the server then sends a 302 Found response which redirects to:
https://customer-uh7tqhki3bpanql6.cloudflarestream.com/815f9de14a4a3efc7cecaa21a955b060/watch
I don’t understand why the first URL and its response aren’t being written to archive. I don’t see it in the CDX index that is bundled in the WACZ and that’s why it comes back as not found when you click on it.
But fortunately the video itself is archived, for example here is the first one?
In case its helpful I created an ArchiveWebPage issue to track this.