The linked page has 2 “PDF” docs. But they are not really PDFs. If you wget them, they are HTML with javascript embedded.

So we can no longer simply download a PDF anymore. Apparently we must run a JavaScript application to get the PDF in a browser tab, then use pdf.js to save it. WTF? This breaks my script (which stores the URL as metadata on every PDF I fetch).

Other sites do this too. I’ve seen websites for restaurants pull this shit with their menus.

What’s the point?

  • autonomousPunkOP
    link
    fedilink
    arrow-up
    3
    ·
    20 days ago

    Thanks for the insight. Apparently Mozilla is okay with this.

    I suspect it violates open data law to impose JS execution as a precondition to reaching public documents.