Full text search in PDFs

This will never happen

2 Likes

Iā€™ve written some words on the progress of things in another thread. Improved search of all kind (your references, full-text of PDFs and external databases) is part of our long term strategy. Also Jirka posted a quick intro, he is our new database and backend engineer working on search infrastructure.

http://forum.paperpile.com/t/is-paperpile-development-dead/2897/19

http://forum.paperpile.com/t/is-paperpile-development-dead/2897/24

3 Likes

I would like to see Full text search too

2 Likes

I have been using pdf managers for some years. I was just about to start with Paperpile, when I found this thread. Full text searching is the most basic function for a researcher. While you have many other sexy projects underway, all pale in importance to basic text searching. I encourage you to do some focus groups with folks who use these products to better match your development priorities with the needs of your potential customers.

6 Likes

@Edward_Richards exactly my thoughts from Sept 2014!..

I donā€™t think itā€™s fair to say that the software isnā€™t being updated. That said, itā€™s true that there are some functionalities that have been for a long time in the pipeline. Full text search would be my top one pick, but so it was the IOS version, and they are delivering it. Iā€™m happy with Paperpile and do not plan to switch.

2 Likes

I second that, and I do not plan to switch either (btw, I am not aware of any alternatives for managing references in Google Docs). But at the same time, similar to what I said 2-3 years ago, it is still very unclear how Paperpile defines priorities for their development.

1 Like

Yes, thereā€™s stuff thatā€™s been on the making for a long time. Still, Iā€™ve tried many other alternatives (Mendely, Zotero, ReadCube, Papers, Sente, Bibdesk, EndNote) and I think that Paperpile is the best option.

1 Like

It is great to hear that advanced search is on the roadmap, as search and a word plug-in are still my top two remaining features.

Would any of the staff be able to provide any kind of updates on this? @jirka

Any word on the development schedule of full text search? Along with multi-document annotation mapping and export this is one of the key features thatā€™s really holding Paperpile back.

Napier, please, do not abuse the flagging feature.

We do take the feedback into account, which is why we have the Mobile App beta and Word Plugin beta running. We are also updating our infrastructure behind the scenes to make sure we can support all the features we would like to implement. When there is any news to share in regards to the fulltext search, we will do so here on the forums.

1 Like

I apologise and stand corrected ā€“ I did not realise that flagging such a long running topic for your attention and requesting an update from Paperpile constituted anything other than common sense. No developer response has been provided since October '18 and full-text search has been discussed but not developed since 2014. Being able to search the indexed content of a collected research library is of key importance to any conceivable research and writing workflow.

I am glad that you are laying the infrastructure groundwork for future improvements to Paperpile but Iā€™m sure that you can also understand the growing impatience of many of your long standing paying users who rely on your software academically and professionally. I donā€™t intend to be overly critical ā€“ Paperpile is very good ā€“ itā€™s just that there are some glaring feature gaps that prevent it from being brilliant.

3 Likes

@Napier - well said! The lack of this feature is the reason I need Mendeley to search my Paperpile PDF library. For whatever reason, and as you should be able to see from the very start of this thread, Paperpile did not consider this feature as important, and never prioritized its development. Unfortunately so.

2 Likes

Hereā€™s a workaround to search PDFs stored in Paperpile.

Google Drive contains a Paperpile folder with all your documents. Instead of opening the PDF in Paperpile on your browser (which takes you to Paperpileā€™s PDF reader without search capacity), try opening them from the Paperpile folder (Google Drive -> Paperpile -> search for document and click open). Then you get the option to choose a PDF reader other than Paperpileā€™s. For example, I use Preview, which allows me to search any individual PDF.

From Finder on a Mac, I can also focus a text search on just the PDFs in my Paperpile folder. Working for me so far.

1 Like

The beta PDF annotator from Paperpile does offer search. You can turn this on from the settings menu, under ā€œbrowser integrationsā€ > PDF Viewer > ā€œviewer with annotations (beta)ā€.

Wow, thanks. Seems like it works great. (Software developer husband says thumbs up!). I think others would appreciate this too.

1 Like

My workaround, and I realize this might not work for most people, is to copy the papers I need to search onto my laptop or cloud storage, tag the papers with a project specific tag and do the search from there. This does work from multiple different laptops and computers, but it doesnā€™t allow a search on files that are in Paperpile but not in a folder that is obviously related to my current project.
If you have an exact phrase you are searching for Google Scholar can sometimes be helpful.

I agree this is an important feature, and would very much like to have it.

Iā€™ve now tried multiple options for a full-text search workaround:

I used DocFetcher (portable, free, open source) to index the Paperpile folder in my Google Drive.
With that I get super fast full text search, but the preview is only plain text, which is ok, just doesnā€™t look nice. Sometimes you get strange PDF mumbo jambo effects like doubled text.

Foxit Reader and Acrobat DC also also allow to perform a text string search on a folder, but that can take a long time since these apps do not create an index, Acrobat DC Pro offers this feature I think, but Adobeā€™s pricing is bs.
The nice effect here is that you get a collapsible list of preview snippets and when you click on them they get opened directly in the PDF at the right position. Which would be the dream feature for Paperpile, especially because I have this nice free white space on the right in Paperpile :stuck_out_tongue:

3 Likes

Thanks Daniel - this is a great workaround. I really canā€™t believe this feature is still completely missing and is seemingly being ignoredā€¦18+ months since any type of developer feedback on the issue

I understand how useful this feature would be and workarounds are never as good as comprehensive, fast, and accurate search right in our app. Thatā€™s still our goal and we are far from ignoring the issue.

Originally, the files were only stored in Google Drive which made it impossible for us to efficiently index them. Thatā€™s why we spent the last year or so rewriting our backend and transferring tens of millions of PDFs to a new backend infrastructure which will allow us to search the files eventually. But for technical reasons you will see some other search improvements first before the full-text will be added.

3 Likes