https://reddit.com/r/redditdev/comments/13wsiks/_/jmnj9xc/?context=1

  • save_vs_death [they/them]
    ·
    edit-2
    1 year ago

    Scraping usually implies that one (or more, but usually a small number) of servers are making a large number of requests to some other website with the ultimate goal of duplicating as much of its content as possible. Something that only accesses whatever content you want to see on-demand, interactively, is not really a scraper. Yeah this reply is pendantinc weenieism, I'm sorry. Something like, idk, Invidious, is called an "alternative frontend." Point being a website can always screw with alternative frontends in various ways, including going out of business and shutting down, whereas, once the scraper has done its job, there's no screwing with it, the content is duplicated. Obviously scraping the entirety of reddit or what have you is a very dumb idea because there's a lot of garbage data that nobody will care about in 3 days time. Except for selling the data as an AI training set, no sane user would want to use that data interactively.

    Bottom line is, scraping makes it so data is requested only once, and then recorded somewhere. Interactive frontends ask for the data from the source website every time someone wants to see it.