I bought an online course recently. They mention on their websites that a note will be provided. But in the names of note they provide a 2-page summary for download, and the original note PDFs are not downloadable. I tried to download PDF from their sites via IDM but file corrupted. Even I tried to force copy and pasting into other documents but not successful as it pastes some encoded language. Only way I found to click screenshots of each and every pages on pc and paste into the word. It is very time-consuming. Is there another way to download PDF from their sites? Edit: look like the best way to capture it through screenshot.
Without telling us the exact site, the only answer you can expect to get is a resounding “maybe”.
it is a indian website and all the content are DRM protected.
You’re not helping if you don’t tell us the exact name of the website. Every DRM implementation works different.
Print to PDF?
pdf open only to their websites in a window… it is DRM protected content.
I tried, not working… It is not a pdf, some how all the pages are in different div container and text inside the page are in the span container but in distorted form.
Search internet for the file name maybe?
their notes are like compilation of some books regarding a particular exam. I can find books online but not their notes.
If you are not able to share the name of the website, could you maybe share a screenshot of how the PDF is displayed on the website? That would a bit more helpful towards solving your problem.
https://temp-file.org/jcwX1twb7yjgByb/preview https://temp-file.org/hIjgiABXlvinKlh/preview https://temp-file.org/TiC5Kp8vpya5YsM/preview https://temp-file.org/Z3df1Yzh47CvtIu/preview it look like this… it looks like every page in the div container, and every element or text of 1 page in the different span container, and even the span container the word is encoded not showing accurately…
Looks like the link you shared is not set to public. Just a heads-up—Google Drive will show your account name and profile picture for shared files. If you don’t want that, you could upload the image directly here in a Lemmy comment, or use something like Pixelfed.
They may not expose the actual PDF to you at all, just some software rendering of it. In that case I’d focus on making screenshotting efficient. Find a program that lets you save the whole screen to file automatically at once (one button press), or use Firefox ctrl+shift+s -> click on the page area -> save -> enter.
I think the same… because I tried every way that I know… On developer menu it shows every page inside a div container, and every element on that page word or text in the span container. even it is not directly written in the span container it is distorted the way words are written… https://temp-file.org/jcwX1twb7yjgByb/preview https://temp-file.org/hIjgiABXlvinKlh/preview https://temp-file.org/TiC5Kp8vpya5YsM/preview https://temp-file.org/Z3df1Yzh47CvtIu/preview have a look
Try reverse engineering the DRM. Most DRMs are stupid simple to break. Look at the JS code or the network tab under devtools. Maybe you can get the key.
If all fails, login through a headless browser session using python + selenium and use some quick and dirty scripting to screenshot each page.
https://drive.google.com/drive/folders/1ussgvci_rGPZLrm4eLTEfkbJRv5Sgh3s?usp=sharing … take a look into this screenshot… it looks like all the page are in div container, every text in span container but in distorted format
I can’t view this because the screenshot is not public and I don’t have a Google account to request access. In general you will want to right click the div, copy its selector (or if you wanna be specific, the XPATH) and use that in selenium to save the text or the image content.
Seems like you can remove the header and the sidebar HTML nodes, set the width of the
pdfViewer
class to 100% and hitCtrl+P
. Alternatively you may save one “page” at a time usingCtrl+Shift+S
as described by @tumulus_scrollsyes I think the best and easy way is to take screenshot… document is secured and text are cipher. I tried to download their videos but they blocked me from their site too…
I found another site that seems to use the same backend and pdf renderer. I looked at the network tab, and it downloads a .epdf file, but I’m not sure how to open it. If you want to have a go at cracking it, here’s a link to it: https://pastebin.com/w6JkNDYH