Reddit admins using their skill on actually not answering the question.

supersaiyan [he/him] · edit-2 1 year ago

Reddit admins using their skill on actually not answering the question.

FuckyWucky [none/use name] · 1 year ago

one could make a pretty decent reddit scraper to api script in python in like a day. im sure that will be very efficient. :sicko-hexbear:

KnilAdlez [none/use name] · 1 year ago

honestly I've made things with the public reddit api, it's so bad I could almost believe a scraper is more efficient

save_vs_death [they/them] · 1 year ago

they probably keep track of problem IPs doing that and autoban them, probably a lot of people interested in scraping reddit

SerLava [he/him] · 1 year ago

botnet :)

save_vs_death [they/them] · 1 year ago

should get me one of those at some point

SerLava [he/him] · 1 year ago

Hey when the legit method is 20 million dollars that leaves a lot of room for some very well funded botnets

Owl [he/him] · 1 year ago

Scraping enough to make a reddit clone won't make more / different requests than someone using the desktop site. Each person using the app is a different IP.

save_vs_death [they/them] · edit-2 1 year ago

Scraping usually implies that one (or more, but usually a small number) of servers are making a large number of requests to some other website with the ultimate goal of duplicating as much of its content as possible. Something that only accesses whatever content you want to see on-demand, interactively, is not really a scraper. Yeah this reply is pendantinc weenieism, I'm sorry. Something like, idk, Invidious, is called an "alternative frontend." Point being a website can always screw with alternative frontends in various ways, including going out of business and shutting down, whereas, once the scraper has done its job, there's no screwing with it, the content is duplicated. Obviously scraping the entirety of reddit or what have you is a very dumb idea because there's a lot of garbage data that nobody will care about in 3 days time. Except for selling the data as an AI training set, no sane user would want to use that data interactively.

Bottom line is, scraping makes it so data is requested only once, and then recorded somewhere. Interactive frontends ask for the data from the source website every time someone wants to see it.

FuckyWucky [none/use name] · edit-2 1 year ago

deleted by creator