• irelephant [he/him]🍭@lemm.ee
    link
    fedilink
    English
    arrow-up
    6
    ·
    7 hours ago

    A small publisher’s ebook platform recently started blocking firefox for me, did a bit of digging and found that if pages aren’t requested with the right headers (which work in chrome and msedge) it will respond with a 302, suggesting you go to another page which takes a few minutes and then times out.

    This is probably to stop scraping, and could be because I started testing some scraping scripts on it.

    Anyway, this hasn’t even stopped me scraping, I just copied the headers and use those in my script.