Error: duplicating references

Now that I’ve gotten 20 references, if I try to cite a reference (say reference number 16), then Paperpile will not reference the same reference number (16) that it does earlier and will instead make a new reference (number 21) so I will have duplicate references. Can you please fix this? Thanks!

I’m having the same problem. May have been because my collaborator worked on the document as a separate .doc, then I applied their changes to my googledoc and each of the references that overlapped made a duplicate.

Would love a fix or even a workaround for the meantime.
Thanks!

I have the same problem only it occurs whenever I open by document on a different machine than I’m currently working on. I’ve gone from a bibliography with 36 items to 53 just from duplicate & triplicate copies of my references! Manually merging them in a huge pain.

Please fix this!

I also have this problem!

Yes please fix this, or provide a straightforward solution.

I have the same issue.

Hi, I’m having the same problem with duplication of references.
Did anybody find a way or receive an answer to solve that?
Thanks, Valentina

We have figured out how this is most likely happening: when creating a copy of a document, the sidebar is unable to detect duplicates until the document has been formatted at least once. The current best workaround is to always run “Update Citations and Bibliography” after making a copy of a Google Doc.

For existing errors, please remove all in-text citations, go to the gear in the sidebar --> Manage references. Then remove one of the duplicates and re-insert the citations.

I am having the same problem, but this is not fixed by the above suggestions.

The “View all references/Manage References” page will always reflect the current (and past) citations in the document following a formatting. If there are not duplicate entries there, then there will be none in the bibliography. Are you sure that you have removed the duplicate from this page?

hey, just to confirm if I get this right

  • the problem is: multiple co-authors contributing the identical references to the document will cause duplictes in the reference list after formatting (which what I am facing).
  • the workaround is to unformat citations, remove all in-text citations (manually by going through the text?) and reinserting them all again?

In general, when you add a citation to a document we check it against the existing references in the document and don’t allow you to add duplicates.

However, in some cases we have limited or outdated knowledge about which references are in the document, and this can result in duplicates:

  1. Copying references from another document. When you paste text and references from another document, we are not immediately aware of it - we only become aware of it when you format citations, at which point we retrieve the metadata. This may result in duplicates because we make no prioritization among recognized and unrecognized references when formatting the document.

  2. Adding references to a copy of a document before it has been formatted. This is similar to the above case. When you make a copy of a document, we are not aware that it contains citations (since we don’t want to process every document you open). It is therefore possible to add duplicates of citations already in the document. From our point of view, it seems like you are writing a new document, then when you format citations, like you copied in all the paragraphs/references from the old document, leading to the same problem. This second cause can be avoided by always formatting a document when you make a copy.

Note that the common use case of colleagues working on the same document does not result in duplicates.

When stuck with duplicates this may be the easiest solution:

  1. Go to the view all references page. Locate the duplicate you want to remove and edit its metadata to something which will clearly identify it (i.e. change the year to 9999).

  2. Format the citations in the document.

  3. Remove all copies of the duplicate citation from the text (manually).

  4. Remove the undesired copy of that citation from the (“View all references”) page.

  5. Re-insert the removed citations as normal.

I am trying not to be rude, but it is very… surprising to me that a piece of paid software has such a huge issue, and there is no easier fix than the complicated procedure of removing duplicates and re-inserting them. For a substantial manuscript, this is hours of work to get right, and the stakes are high. Perhaps an effective solution would be to add some feature to merge references / citation keys, e.g. from the “View all references” page.

I unfortunately started this current project using Paperpile, and so it is very difficult to switch away now. But I can’t imagine using it again for future projects, due to the number of headaches this issue has caused. Judging based on the comments above, I am not the only one with this problem. Asking your (paying) users to go through an /incredibly/ tedious process to correct this error does not seem like a fair response.

I have to agree with Chethan. This seems like a huge core functionality problem. Copy-Pasting sections of text with citations from document to document is not uncommon. Many academics are writing repeatedly on similar topics, and sections of background material are often easily reused. The manual work-around to fix this is more aggravating than writing from scratch. We have had a set of licenses for our lab members for a few months, and one just moved back to Zotero after discovering this issue. After chatting about it, a few more of our lab also started debating moving to other platforms.

It seems it must be possible for you to recognise matching citations. The fact that it populates the bibliography with duplicate references with an added a, b, c letter implies that you pulled the same info from the library multiple times. Seems much safer in that case to assume they are meant to be the same citation, and edit the in-line citations to match a single bibliography entry.

1 Like

I am redoing all my references with Zotero since Paperpile messed up with my references. The whole process is taking me so long. Never using Paperpile again. I strongly recommend anyone who is reading this to leave Paperpile and use a proper reference manager. And did I mention I am never using Paperpile again?

I just came aware of this thread here. I also was not aware of this particular issue and that this causes such frustration. I’m sorry about that.

We’ve been fine-tuning duplicate detection in many other contexts for years and we definitely do duplicate detection when copy/paste in Google Docs. It seems there is a loophole/bug left that needs to be addressed.

I will discuss with my team and post updates here once we know more.

As promised quick update: We’ve identified the issue and fixed it. It will be released with our next update. I will provide more background when the fix is live and after we have done more testing.

1 Like

Hi Stefan. I see this error fix was included in the most recent update notes (Oct 5). Perhaps you can comment in more detail here about the extent of the fix, and if there’s are any residual issues remaining? I’ve just tested the issue in its most simple form, and can confirm it’s working as I would expect. Thanks for the fix!

Hi!
It’s 2021, and the problem has not been resolved.:slight_smile: Could you please tell me if there is any solution I missed? Thanks in advance!
Best,
Maria

I found a simple solution- it is necessary to create an account, then it generates less errors and overwrites. After this, I updated the double links (from b, c, etc.) and everything works fine now.