Replay issue w/ embedded Vimeo video

Hi all,

I’m just getting started with Browsertrix, and I’m experiencing a replay issue with embedded Vimeo videos captured on this site: https://sephardivoices.com/iw/.

For example, I ran a crawl of this single page – https://sephardivoices.com/iw/nina-hallack-lebanon/ – which generated a WACZ of about 13 MB.

The crawl did capture an .mp4 video, which is accessible and playable in replayweb.page if you filter by audio/video URLs, but the video is not playable from the seed page above. A static image is displayed in the frame, but there are no player controls to start the video.

The video is captured–which is the most important thing!–but for public access we would ideally want it to be playable from the seed page.

Any thoughts?

PS - I used default Browsertrix settings for the crawl, except for the following (which I don’t think should make any difference)


include:

  • https://sephardivoices.com/iw/wp-content/uploads/.*

behaviorTimeout: 0
generateWACZ: true

I have similar issues with Vimeo videos. Thanks for bringing out.

Peter

https://sephardivoices.com/iw/ no longer seems to work?

Here is a sample of my issue: https://swap.stanford.edu/was/20240108161642/https://eastwindezine.com/utom-vibrant-new-music-from-florante-aguilar/

Thanks for circling back on this, Ed. The /iw part of the site does seem to be inaccessible now, but we did manage to get quite a good capture of it through a combination of Archive-It (our usual crawling platform) and WebRecorder (supplemental WARCs uploaded to Archive-It).

If interested, the capture is here.

Looking forward to more experimentation in this area. There is potentially a big role for WebRecorder and Browsertrix for complex sites and as supplement to our Archive-It crawls.