‘If Citations Could Talk’: Extracting, Structuring And Linking References To Reveal Earlier Research Findings
Researchers have been making their early-stage research available on preprint servers since the early 90s, but it’s really over the past year or two that preprints have gone mainstream.
Unless you insist that authors write their preprints in LaTeX or XML, the majority of these preprints are not in Web friendly formats. Ben Firshman has done some great work with arxiv-vanity, which turns LaTeX articles from arXiv.org into responsive web documents with citations as hyperlinks. And bioRxiv is investing significant funds to convert its PDFs to XML.
But in the main, preprints are Word or PDF files that have:
no linked citations
little or no metadata
no easy navigation
In this post, we’ll look at a possible solution to the first point – the lack of linked citations – and we’ll cover metadata and navigation in future posts.
In the example above, you can see how the citations in the original flat PDF have been turned into links to various resources, including the full text. The context of the citation is also highlighted.
We use the Scite API to display the citation sentiment (how many times it’s been supported or contradicted), and link to the full report card on the Scite.ai website.
Scholarcy summarises the findings of cited sources
It’s also useful to be able to compare what the citing author is claiming vs. what previous, cited work has claimed. This allows us to build on the excellent work that Scite.ai has done on identifying subjective sentiment – we can also show the objective differences between studies.
In the example above, we can see that the Scott and Schramke study had 207 participants, whereas the current study (below) has 2403, and the makeup of the cohorts in each study was different. These differences might partly explain why the current study does not replicate the Scott and Schramke findings, beyond the broad ‘supporting’ or ‘contradicting’ categories usefully provided by Scite.
Scholarcy identifies study participants
Taking this a step further, we aim to automatically flag where the cited findings differ from those of the current study – and even identify if the citing paper is misreporting the cited study in some way.
In future, the summary information from each citation could be incorporated into the citation graph if expressed in a concise way. Sources would then carry metadata about their own findings in a machine-readable format, which would make comparing studies much easier.
Unlocking 100 Years Of Scientific Papers Through Machine Learning
Reference mining is fundamental to the creation of citation networks and rich,... read more
Literature Review Pitfalls
One of the activities common to all researchers – from undergraduates to post-docs – is conducting literature reviews.... read more
The Power Of Open Citations For Researchers And Publishers
Open citations benefit researchers, journals and publishers
When researchers cite previous work, they... read more
What's the most challenging part of working towards your postgraduate degree? For many students, combing through mountains of research presents... read more
A fundamental component of scholarly research is reading, and citing, other people’s work. One effective way to get more context... read more
XThis website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Read MoreRejectAcceptSettings
Privacy & Cookies Policy
Privacy Overview
This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.