Join the ArchiveTeam project to help index Reddit and other sites

PoorlyShavedApe@beehaw.org · 2 years ago

Join the ArchiveTeam project to help index Reddit and other sites

rektifier · edit-2 2 years ago

This is true. If you run the reddit-grab project directly without using the warrior (sudo docker run -d --name reddit --label=com.centurylinklabs.watchtower.enable=true --restart=unless-stopped atdr.meo.ws/archiveteam/reddit-grab --concurrent 6 yourname), you can set up to --concurrent 20, and some projects do work well with higher concurrent, but not reddit. 6 is already pushing the limit.

I’m running reddit-grab on 25 VMs on azure (trying to burn my $200 free credit that expires in 10 days) and I can only run --concurrent 4 safely on most of them. The only VMs that can run --concurrent 6 are the ones in India, which seem to be soft-ratelimited by their higher latency anyway.

Join the ArchiveTeam project to help index Reddit and other sites

Join the ArchiveTeam project to help index Reddit and other sites

ArchiveTeam Warrior