No, it’s not a pdf indeed, but a html. It does include images, styles, etc. I’d prefer a pdf, but I guess saving a website to pdf is not straightforward, and html does the trick. For me the most important thing is to have a backup copy in case the website goes down, and to know exactly what a website looked like on a certain date.
+1 for this feature. Saving as pdf or MHTML single file instead of zotero way which is to save legacy MHTML with related directory to save non-html external resources, images, scripts, etc. PDF is always better as long as it is readable format with print-friendly css.
+1 from me too.
In relation to this request, or perhaps partly to satisfy it, it would be great if Paperpile could be somehow integrated with WebCite.
The dream would be that Paperpile – all from simply adding a webpage as a reference – would:
- Archive the URL in WebCite,
- Save the archived URL as the reference in Paperpile,
- Create (and save in Paperpile) a PDF version of the archived webpage,
- Serious dreaming now, but, if Paperpile could work with WebCite to generate a DOI for the archived webpage!
That would be a feature seriously worth paying for. Citing webpages (blogs etc) is becoming increasingly important, and this would be a really excellent way of doing it.
I think that would be a fantastic feature needed by most people.
I am not sure whether method mentioned is also saves webpage with date and also can it save with the images available in the webpage. Please suggest
Where do you store the pdf file that you save and link?
+1! Readcube used to do this too, which was terribly convenient. Even better if you could make a permanent URL to the PDF screenshot for sharing.
This thread seems to have gone dark. I am a new subscriber to paperpile. It’s wonderful, with this huge exception of not archiving web pages. Has this gone anywhere over the past couple of years, or does it remain an unacknowledged feature request?
We do not currently have plans for this feature; we have limited resources and other features - such as the mobile app and the Word integration have taken priority. That said, we understand the utility of such an automated process and encourage users who would like this feature to voice their interest and explain their use cases in this thread.
I certainly understand that this might not be top priority for Paperpile, esp. given its academic roots and focus, focus on a mobile app, and small staff. I am an entrepreneur and am constantly researching new topics for presentations, reports, white papers and other documents. Many of the ideas that I pull come from non-academic sources, and I get a tremendous amount of research and inspiration from web-based sources. If a site changes or pages disappear, the value of citation is zero. It is currently a cumbersome process to save a page or print it into pdf format, then add the citations. This is clearly an area where Evernote is exceptionally strong; however, their system is not at all geared to reference work, and I frequently get lost without details on document sources.
Adding the ability to pull in html/web-based material would make Paperpile a killer app outside of the academic world.
I agree with you completely and would add that academics frequently rely on non-academic primary and secondary sources when researching and writing. I can’t believe that no-one has tackled this as of 2019. Evernote development team maintain a great web clipper, but are blind to the writing, citing and referencing workflow.
Paperpile already does reference material metadata clipping, automatic sourcing of full text PDFs and inline citations incredibly well – better than the competition – if they add the capability to clip clutter-free HTML pages and/or clip to PDF they would potentially create a market leading opportunity to steal tens of thousands of Evernote Premium subscribers, as well as significant market share from other academic referencing tools like Endnote, Mendeley, Papers and Zotero.
You are spot on. There is a very big market opportunity here. I hope that paperpile’s founders take note! There are many precedents of other applications that sit on top of Google, and others, that have killed it by being more flexible, easier, lighter, and inherently cloud-based.
I create a pdf of the webpage and save it in Evernote. I make a notation in Paperpile of the date I created the pdf and the Notebook I saved it in. This isn’t ideal - I sometimes return to a website, and don’t always remember to create a pdf or screenshot every time.
Government and Congressional websites are constantly changing, and lately data is being removed and archived so this is becoming a critical task for me.
I agree that this would be very helpful, and easy to implement. I use it for gray literature - ie blogs and online-only open-access journals, reports, legal cases, and white papers. I actually find myself citing web pages very often. My work-around :
- print the web page as a pdf file
- create the PP entry, as a type-website
- manually update the details
- manually add the pdf from the file on my pc
here’s just a few of my recent entries. As you can see, automating the process would help. Ive got a keyboard macro program… mb I should make a macro for this!!!
you can save a webpage as pdf or image in https://www.webpageto.com
I think the best option is likely to print to PDF, then add PDF from within paperpile. I think this is a reasonable solution to preserve the information. Good luck!
this is the workaround that I use, but PP REALLY should do this for the user. Web pages change and disappear, so preserving it as a PDF in the database should be a mandatory (i.e., correct, the only permitted) function.
I agree that it should be a built-in feature, but one advantage of printing separately is having more control of the PDF. I use this Chrome plugin to clean up the crud from webpages before printing, making a much easier-to-read experience.
I am a new user and this is one of two features so far that are missing. Paperpile was highly recommended by a fellow post graduate researcher and I’m in the arts where webpages are actually primary sources quite often so have to save a pdf so that info doesn’t get lost even if the website just updates (an artist changes their portfolio).
Right now I save it as a PDF then upload it to paperpile, but a one click when adding a citation as a webpage would be super helpful and much faster!
longtime user about to switch to Zotero permanently due exactly to this ONE feature. Well – two. I can hit a shortcut extension in Brave and save a Zotero link without using the mouse, plus I get the html saved. with paperpile I have to move mouse, and I don’t get html. I have over 8k items in paperpile.