Hi there! So I have been wondering about this, where is the data collected by the WebRecorder Browser Extension stored? In the Browser itself? Somewhere on the PC?
I noticed that I don’t have to be connected to the Internet to download the collected data as a WACZ or WARC File so I take it that the data is not stored anywhere Online correct?
The answer is probably yes (I’m not sure where it is located exactly) but ArchiveWeb.page interally stores data in a database which means you can’t just search for WARC or WACZ files on the drive and find them.
The best way to go about this is probably exporting and importing your archives to the other machine IMO.
am using ArchiveWeb.page • Webrecorder both google chrome extension and standaone app i have archived 2 gb worth of data and i deleted the data from the extension and app page but in my mac space has freed, where does this extension and app store data in computer and how to access and delete them.
I have deleted the archived pages and site from both extension and app but the space in not freeing neither there is some kind of recycling bin to delete it further. Once you press x it vanishes but no pop up dialog like permanently delete will show up.
for browsers, they are not stored in your provided locations. Its in browsers own folder in library but is it stored there as .wacz or in what format because we can’t export larger then 13+GB .wacz from browsers , it always gets cancelled after 10-11 gb, also it starts faster but later speed gets so slow that it gets cancelled, tried 10000s of time and every time same issue
I can assure you that your data is, in fact, stored in IndexedDB for both the browser extension and desktop app. Unlike Browsertrix, ArchiveWeb.page keeps archived data in a database, not in individual WACZ files.
In order to create WACZ or WARC files out of your data you’ll need to do so by exporting it from the app / extension. If you haven’t tried exporting a WARC, that might be something to give a shot? It’s a little tricky to find, but the download option for WARC files is available in the pages list.
why is the playback so slow I tried it with instagram, even a few scrolled archived page both in extension and standalone app takes too much time to load posts and its not practical to view them all, can’t it be viewed instantly instead of having to wait for few minutes every time to load the posts
Unfortunately, because of how our crawler & replay system work, replay can be slow for sites like Instagram that have a long waterfall of requests. We capture the full browser session, including all separate requests, rather than simply capturing a snapshot of the page as it is at a given moment, which makes for more accurate capture, but it also means that when sites (such as Instagram) use many consecutive requests in order to render the page, playback can sometimes be slow. When viewing a WACZ file with very massive numbers of pages, this issue is sometimes compounded because of the file compression used by the WACZ file format.
All that said, it’s not something I’ve particularly noticed in my own Instagram captures — but it’s possible the bottleneck might be your system storage or CPU maybe? Without more details it’s a bit tricky to say for sure.
it fails to open captured pages for even sized 3 GB + also 1 gb also takes minutes to load each posts, not usable at all, also I tested With recording the profile which had 10k posts and after completing the capture the size was 13 gb the browser crashed when replaying , not even first page loaded properly, also can’t download it gets cancelled no matter which browser I try and I have tried 100 of times, what’s the use then, frustrated, unreliable on so many levels.
autopilot on profile main page is also not working