Using the Scholarcy Chrome Extension
When you right-click on the Scholarcy extension and choose ‘Options’, you can modify the way Scholarcy processes your PDF files.
By default, all options apart from Extract figures are enabled.
This creates the Scholarcy summary section, comprising a summary of the main document text with a fixed word limit set by the word count option, or as a fraction of the original document set by the % option.
This can be set to stable, which gives a good result for a broad range of documents, or experimental, which may work better for longer, more narrative texts.
Rewrite in 3rd person
This rewrites the Scholarcy summary section into a neutral third person (for example ‘The authors’ results suggest …’) to make it easier to quote from and reference sections of the paper.
Skip dialogue/quotes in summary
When selected, any dialogue or long sections of quoted text will not be considered as input to the summary engine, so only descriptive text will be summarised.
This identifies the main sections in the document (Introduction, Methods, Conclusion etc) and either creates summary sections for each of these when Extract snippets is checked (default), or outputs the full section text when Extract snippets is unchecked.
This identifies population, intervention and outcome measures and reports these in a Participants and statistics section.
This identifies tabular data and their captions, showing the captions in a Tables section along with a download button to save the tables to an Excel file on your computer.
This extracts figures and their captions into a Figures section. You can then click on each figure to show a larger version in a new browser tab. Each Figure will be linked to callouts in the text. Figure extraction is very CPU intensive and, for large documents with high-resolution images, may not always be successful, so it is disabled by default.
This highlights in yellow important facts stated by the paper.
Individual facts may span a few words and may lack context, so Smart highlighting extends the fact to a complete clause.
This highlights in pink the main goals that the authors set out to achieve and the main contributions of the paper.
This creates links between inline citations such as [3, 4] or (Smith 2007a) to the relevant bibliographic entry.
Determines how many key points are displayed in the Scholarcy highlights section – 5 (default) is a good starting point for most articles.
You probably won’t want to or need to change these options very often, but you may like to experiment with them to speed up processing or improve the results.
This locates metadata from external sources such as CrossRef, arXiv and others to correct the results of the automated extraction of title, authors and abstract where necessary.
Reflowing text in the correct order from PDF files is surprisingly difficult. You can change this to Experimental which may give improved results on ‘difficult’ PDFs.
Background reading list
- Fast: maps key terms to a Wikipedia search query but does not attempt to verify or disambiguate the results
- Precise (default): maps key terms to the correct Wikipedia entry, which may add a few seconds to processing time
- Broad: A combination of the Fast and Precise, and includes Wikipedia entries that include the key term in the title
- Broader: As for broad, but also includes Wikipedia entries that include the key term in the entry.
- Narrow (default): Identifies specific key terms in the document
- Broad: As for narrow but also includes acronyms
- Broader: As for broad but also includes frequent noun phrases
- Representative (default): Harvests key terms from a representative sample of the document
- Full text: Harvests key terms from the full document