The project will address sustainable ways of increasing the growth rate of White Rose Research Online by improving repository ingest and output options. Metadata quality will be evaluated. Features within the ePrints 3 software platform will be utilised to improve repository workflow. The project will be underpinned by the principle that metadata should be created only once; we will be investigating metadata sourcing (inside and outside the institutions) and reutilising metadata and full text from the repository to feed into other services, including the ESRC repository. The project will assess methods of bulk and automated ingest and report on any issues raised.

Increasing repository content through automation and services (IncReASe)


Start date: 1 July 2007

End date: 31 December 2008

Funding programme: Repositories and Preservation programme

Project website: http://eprints.whiterose.ac.uk/increase/

JISC theme(s): e-Research, e-Resources, Information environment

Repositories Enhancement project

The IncReASe project builds upon the first phase of White Rose Research Online, a shared institutional repository for the Universities of Leeds, Sheffield and York. We will be addressing some of the issues identified over the past couple of years which have restricted the repository growth rate. Our immediate aim is to create a better fit between researchers’ behaviour and repository deposit so that we can obtain a great proportion of new research being produced across the Consortium. We also need to incorporate older publications to make the repository more attractive to depositors.

Aims and Objectives

The project aims to increase content in White Rose Research Online, to automate aspects of the repository ingest process and to start to embed the repository within research workflows by lowering barriers to deposit. Repository based services which may be useful to researchers will also be investigated. The project aims to produce reports and scenarios which will be helpful to other institutional repositories working towards embedding a repository within their own institutional workflows.

Project Methodology

The project can be divided into four phases:

  • Investigation of metadata sources across the Consortium and identification of pilot departments for workflow analysis and bulk data upload
  • Metadata enhancement as part of the repository ingest process
  • Building repository deposit into the standard research workflow
  • Offering services back to departments

Early phases 1 and 2 will enhance the existing core repository service, implementing features of the EPrints 3 software and identifying opportunities for bulk ingest. Workflow analysis will enable the development of a strategy for the embedding phase 3 and, as greater mass of content is achieved, repository based services, phase 4 will be feasible.

Anticipated Outputs and Outcomes

  • Researcher workflow case studies and scenarios
  • Database import case studies
  • Bulk import report
  • Web scraper perl script
  • ESRC workflow report
  • Automation issues report

Technology / Standards Used

 

Name of standard or specification

Version

Notes

OAI-PMH

2.0

 

W3C Recommended standards

   

JSR-168

 

portal

Dublin Core

   
RSS 2.0  

LDAP

   
Lead institution
Project partners

project staff

Project Manager
  • Rachel Proudfoot, Universities of Leeds, Sheffield and York, Edward Boyle Library University of Leeds LS2 9JT, 0113 343 7167 r.e.proudfoot@leeds.ac.uk
Project Team
  • Project Officer, To be appointed
  • Technical Developer, To be appointed
  • Last updated on 09/01/09 by Lisa Clifford