So, I’ve created an archive of an blog that we’re looking to decommission as Browsertrix collection. It works very well but we’re realizing we’d like navigating to external URLs during a replay to automatically load the live page. I don’t see anything the ReplayWeb.page options that appear to allow that but it’s possible I’m missing something. Is this possible? And if so, how is it configured?
Replayweb runs in a sandbox to explicitly block external links - so what is in your WARC is what you see. We have a Python script which allows WARC reply and allows external links for Dev work but Replayweb is the ‘Gold Standard’ for us to be sure our WARCs are valid.
Hope that helps.