Archive

Posts Tagged ‘jorum’

OAI harvest and PORSCHE: Testing intraLibrary 3.5

December 15, 2011 1 comment

Quick recap: a requirement of PORSCHE was to transfer metadata between Jorum and the NHS eLearning repository:

The primary aim of the PORSCHE project is to provide “seamless access to academic and clinical learning resources to healthcare students”… “establishing the basis for a long term national partnership between the NHS and academia by sharing of appropriately licensed content between Jorum and the NeLR”.  To achieve this, PORSCHE has been exploring metadata transfer between the two repositories using OAI-PMH. Jorum is currently running on a modified DSpace (v1.5.2) and the NeLR on intraLibrary (3.1) both of which will of course expose an OAI-PMH feed, however, there would be an additional requirement for each repository to ingest each other’s feeds which is functionality that is not currently supported by either platform, though it is supported by the NDLR running on a later version of DSpace (v1.6) while PORSCHE are currently liaising with Intrallect to develop similar functionality in intraLibrary.

https://acerep.wordpress.com/2011/06/21/oer-infrastructure-discussions-with-mimas-and-ndlr/

Along with the PORSCHE team I’ve recently been testing a pre-release intraLibrary 3.5 which now includes an OAI-PMH harvest facility so that the metadata from external repositories can appear as a “collection” in intraLibrary. In the context of our own ACErep project, for example, this would also allow us to harvest the ALPS collections from the repositories at the University of Leeds and York St John, thereby offering a form of federated searching as collections are all brought under the standard intraLibrary search mechanism.

harvest admin

Harvest admin allows multiple harvests to be set up with the relevant oai url; user selects the desired collection (oai set) from the drop-down.

Selecting the OER collection from the Leeds Met repository (corresponds to OAI set=23) I am able to select harvested metadata format (dc or lom) and type of harvest (All Records, Since Last Success, From Date), time to run, and select intraLibrary Group/Process and Collection:

Bingo!

Bingo!

Bingo!

…and then, crucially for ACErep, I’m able to search my harvested collection (called mrnick) by SRU:

http://harvest.intralibrary.com/IntraLibrary-SRU?recordSchema=lom&operation=searchRetrieve&version=1.1&maximumRecords=10&startRecord=1&query=rec.collectionIdentifier=mrnick

Not sure yet when we’ll be able to get intraLibrary 3.5 but it should then be straightforward to harvest ALPS collections from the University of Leeds and York St John and search all ALPS resources across the three institutions from an ALPS branded interface.

In the context of PORSCHE, when the NHS is running 3.5 they will be able to harvest relevant collections from Jorum (e.g. HE – Medicine and Dentistry) into the NeLR.

<set>
<setSpec>hdl_123456789_39</setSpec>
<setName>HE – Medicine and Dentistry</setName>
</set>

http://dspace.jorum.ac.uk/oai/request?verb=ListRecords&metadataPrefix=oai_dc&set=hdl_123456789_39

Work to achieve similar from Jorum is ongoing via liaison with PORSCHE – see also my post on the UKCoRR blog at http://ukcorr.blogspot.com/2011/12/jorum-steering-group.html and this recent post on the Jorum blog on Coming improvements to the Jorum UserExperience

Game-Informed Risk Assessment? – A Labyrinth based toolkit for managing a good practice risk assessment workflow and guidance for open educational resources

June 28, 2011 2 comments

Just a quick post live from #elhealth2011 – just seen Stewart Cromar from Edinburgh University talking about “labyrinth” as used by the MEDEV toolkit – http://www.medev.ac.uk/ourwork/oer/toolkits/

Also had very brief discussions with Stewart and PORSCHE team about the possibility of implementing a similar system at another institution – I think it has the potential to fulfil our deposit use-case for ACErep (and OER in general) . Basically the toolkit leads a user through the process of file-upload, metadata addition, appropriate licensing and syndication to several destinations – Jorum, other repositories*, Web 2.0 services like You-Tube.

*I’ve also been pestering one of the MEDEV developers about implementing SWORD which would be fulfil this requirement (as I understand, currently syndication to Jorum will be by RSS harvest.) He’s sat behind me so will try to get him to comment here!

Categories: Uncategorized Tags: , , ,

OER infrastructure – discussions with Mimas and NDLR

June 21, 2011 2 comments

As I posted recently over on repositorynews and as ukoer folk will still be aware, from 1st August, Jorum, historically overseen jointly by the national data-centres in Edinburgh (EDINA) and Manchester (Mimas), will be managed exclusively by Mimas and will liaise more closely with the NDLR in Ireland.

Yesterday I took part in a telephone conference organised by the PORSCHE project team with representatives of Mimas, the NDLR and their technical partners Enovation – this post is to summarise the discussion and describe the processes of data transfer required by PORSCHE and ACErep.

Just to emphasise at the outset that Mimas are extremely positive about moving forward with a community driven process for the development of Jorum – informed by the ukoer programme and projects like PORSCHE and ACErep – but it is still very early days and we should be careful not to raise expectations unrealistically – the service does not pass to Mimas until August who are still in the process of appointing a Jorum Technical Coordinator – a role that is obviously central to these discussions.

The primary aim of the PORSCHE project is to provide “seamless access to academic and clinical learning resources to healthcare students”… “establishing the basis for a long term national partnership between the NHS and academia by sharing of appropriately licensed content between Jorum and the NeLR”.  To achieve this, PORSCHE has been exploring metadata transfer between the two repositories using OAI-PMH. Jorum is currently running on a modified DSpace (v1.5.2) and the NeLR on intraLibrary (3.1) both of which will of course expose an OAI-PMH feed, however, there would be an additional requirement for each repository to ingest each other’s feeds which is functionality that is not currently supported by either platform, though it is supported by the NDLR running on a later version of DSpace (v1.6) while PORSCHE are currently liaising with Intrallect to develop similar functionality in intraLibrary.

N.B. There may be an additional requirement for PORSCHE to transfer files as well as metadata (i.e. full content packages) between the two systems and it was suggested that OAI-ORE (Open Archives Initiative Object Reuse and Exchange) might meet this requirement – I don’t know enough about this standard to comment.  (In intraLibrary I think it is possible to use a custom metadata template to expose the URI for the full content package in the OAI-PMH feed – might this be a possibility for file transfer?)

In lieu of a Jorum software upgrade to match the specification of their Irish counterpart, the NDLR have agreed to test the intraLibrary OAI-PMH feed (both from the NeLR and the Leeds Met repository) into their test-server which should be relatively straightforward (some metadata mapping notwithstanding). Moreover DSpace (v1.6) includes an API that supports search and retrieve and  v1.8 due for release in October of this year includes an improved (RESTful) API which may preclude the need for further development of the Jorum API – identified as a pre-requisite for both ACErep and PORSCHE.

In summary, a proof-of concept metadata exchange will be developed between the NHS Repository and the NDLR, future work by Jorum with hopefully mean that this can this concept can then be replicated:

OAI-PMH transfer between DSpace and intraLibrary

OAI-PMH transfer between DSpace and intraLibrary

Just a note that the NDLR is already ingesting OAI-PMH from Jorum but I’ve spotted an issue in that the resource URL isn’t necessarily being included in the simple record, appearing only in the full record (unlinked) in a dc:identifier field – see https://dspace.ndlr.ie/jspui/simple-search?query=ukoer for examples including this one from Leeds Met https://dspace.ndlr.ie/jspui/handle/10633/29188

N.B. It looks to me like this has arisen in instances where there are multiple dc:identifier fields where the first isn’t a URL – ours often have an intraLibrary ID in the first field (see below). I don’t expect this will be a major issue to resolve?

Unlinked Jorum handle in full metadata record

Unlinked Jorum handle in full metadata record

Trigger presentation at eLearning in Health 2011 Conference

June 7, 2011 Leave a comment

I will be presenting a trigger presentation at the eLearning in Health Conference to be held on 27th and 28th June at the Aston Conference Centre in Birmingham. Conference web-site is here, abstract below.

Intended Learning Outcomes:

– Learn more about the developing institutional and national OER infrastructures supported by JISC
– Discuss how these can be streamlined across the sector and improved/adapted  to more fully support subject specific requirements (e.g. health)

Abstract:

The ALPS CETL repository project (ACErep) is examining the requirements of a repository infrastructure in local, regional and national contexts to more effectively facilitate sharing and reuse of appropriately licensed medical teaching, learning and assessment material.

Three institutions in the ALPS consortium – the University of Leeds, Leeds Metropolitan University and York St John University – all use different commercial platforms for their teaching and learning repositories. In addition there is the national HE/FE repository Jorum, and the NHS National eLearning Repository (NeLR).

Informed by a user-group of institutional stakeholders, the goal of ACErep is to develop a clearly branded web-site from where staff at the respective institutions can:

  • deposit resources into their own institutional repository
  • cross-search the repositories from a single interface

In addition, we have developed ties with the JISC funded PORSCHE project at Newcastle University which aims to provide seamless access to academic and clinical learning resources for healthcare students primarily from the respective collections in Jorum and the NeLR.

The project utilises well-established repository technologies and has adapted an Open Source client to differentially deposit into a user’s institutional repository and liaised with Jorum to harvest metadata from multiple repositories. ACErep and PORSCHE are also working with Jorum on their development of an Open API (Application Programming Interface) upon which we can build a bespoke search portal. This work, in turn, builds on a prototype developed in collaboration with the Xpert repository at Nottingham University.

This approach will have the benefit of digital assets being preserved in one location (an institutional repository) while providing multiple points of access as well as allowing the ALPS branded web-site and the institutional repositories to “piggyback” on Jorum’s Google pagerank thereby improving discoverability.

The presentation will comprise a summary of the ACErep project to date; the emphasis will not be technical though technical implications and requirements will be discussed.

Attendees: eLearning professionals in health related fields

Project update

March 4, 2011 1 comment

One of the most challenging aspects of ACErep continues to be working across multiple institutions and organisations and recently progress has stalled somewhat as we liaise with our partners and wait for development work essential to the overall infrastructure.

In November last year we were able to build a prototype using Xpert which harvests OAI-PMH from our repository at Leeds Met and which we are then able to selectively search by keyword. We had initially hoped to work with Jorum but at that time there was no Open API, nor were there any plans to harvest by OAI-PMH, so we liaised instead with Pat Lockley of Xpert who was able to help with both requirements.

A lot has happened since November however, both at Xpert and Jorum. Pat is moving on to another (OER related) post which raises questions for us around the sustainability of that service now it has lost its key developer and I have been working with the PORSCHE project at Newcastle University which aims to provide seamless access to academic and clinical learning resources for healthcare students primarily from the respective collections in Jorum and the NHS National eLearning Repository (NeLR). As such, PORSCHE has also requested that Jorum harvest by OAI-PMH and provide an open API; the project team includes the Jorum service manager Hiten Vaghmaria as well as Kate Lomax of the NeLR (which, like Leeds Met’s repository, is also based on intraLibrary.) Jorum have now released a first iteration of an open API and hopefully both projects, and the UKOER community at large, can now work with the national OER service to develop that API to meet our requirements – PORSCHE’s Suzanne Hardy has set up a the Jorum API discussion space (wiki) at http://jorumapi.pbworks.com/w/page/35601929/Jorum-API-discussion-space. I am, in fact, yet to contribute to the discussion myself partly due to a lack of knowledge around API development – I’m far from clear, for example, how we may most effectively use an API to return just a subset of the resources in Jorum (i.e. ALPS/medical resources). In the Xpert prototype we just filtered on keyword but this feels a little unsatisfactory and Pat suggested a custom URL search would be better…

The other pieces of the jigsaw are repository-shaped and variously lacking the dove-tailing standards required to make the model workable. At Leeds University the learning and teaching repository is ExLibris’ DigiTool while York St John have a system called ArchivalWare, both systems are OAI-PMH compliant but do not support SWORD – development work is underway to rectify this at York St John while Jodie has set up an out of the box test install of e-prints to investigate OER object management.

Mike has also made some progress with EasyDeposit which is now working on a Leeds Met server and, after a few teething problems, will now successfully deposit a METS package to Jorum (DSpace), we will still need to test with EPrints (METS should be OK I think) and ArchivalWare when SWORD is integrated with that platform…probably METS again. There is an issue with intraLibrary, however, in that it only accepts IMSCP by SWORD, not METS, so we will also need to write an IMS content packager for EasyDeposit.

We are very grateful to Tamsin Treasure-Jones of ALPs for her continued support with this challenging project and I’m hopeful that all of these pieces can be put together over the Spring and Summer as we move towards a local, regional and national infrastructure for sharing medical teaching, learning and assessment material with an approach that will have the benefit of digital assets being preserved in one location (an institutional repository) while providing several points of access as well as allowing the ALPS branded web-site and the institutional repositories to “piggyback” on Jorum’s Google pagerank and improving discoverability.

Working search prototype and a SWORD fight

November 23, 2010 3 comments

Since the last meeting, Mike has done a great job in putting together a working (search) prototype that uses the Xpert API to search http://www.nottingham.ac.uk/xpert/ for just those resources tagged “alpsportal” i.e. ALPS resources that I have added to our repository and that have been harvested by Xpert.  Currently Xpert is only harvesting Leeds Met but it should now be fairly straightforward to also harvest YSJ and Leeds such that any resources appropriately tagged will be returned from the portal.  It’s on a restricted test server at the moment so this what it looks like (with a few explanatory slides):

And the SWORD fight? There are 3 repositories (possibly 4 if we count JORUM) that we need to be able to deposit into – this is a problem as currently only one of them – intraLibrary at Leeds Met – has fully functioning SWORD.  This did lead us to consider a hybrid deposit process allowing for manual package forwarding but given limited technical resources (this approach would have its own, not inconsiderable, overheads) and the clear advantages of utilising SWORD, it probably makes sense to focus on a prototype that can utilise the protocol and that can, hopefully, plug into the other repositories in the future.

One of the issues we face is that intraLibrary is based on IEEE LOM and only accepts IMS Content Packages by SWORD whereas the majority of (open source) repositories are based on Dublin Core metadata and only accept METS by SWORD; I have been working with Jorum to test their SWORD deposit – as a customised DSpace repository it accepts METS (not IMS) so we have had to map IMSCP for one of our resources -> METS in order for it to be accepted.  Currently I have only been able to achieve this with a very simple package comprising just title, description and author (I’ve also tried to add keyword and rights but these don’t seem to be picked up by Jorum – I’m not sure if this is a problem with Jorum or with my XML – probably the latter!)

It is still a moot point whether we will need METS or IMSCP for YSJ/Leeds until various questions around their specific repository implementations have been answered but we probably need to progress on the basis that we should support both. We will need to get Mike involved on the technical side and I’m hoping that he can start developing a web-based SWORD deposit client that, eventually, will be able to post to any SWORD service URL whether at Leeds Met, Leeds, YSJ or Jorum; as the only viable target repository at this stage is Leeds Met we will want to package as IMSCP in the first instance, ultimately with a view to also packaging as METS depending on the requirements of the destination repository.

MT – I may be asking for the moon on a stick here but I know you’ll put me straight and tell me what we can actually achieve and in what time-scale…just to summarise, below, what (I think) I understand about SWORD so far and what I anticipate might be required of this type of implementation:

SWORD works by means of a service URL that a package is posted to via the ATOM publishing protocol – this service URL allows a “service document” to be retrieved from the target repository which itemises available collections for SWORD deposit. The desktop client I have been using with our repository and with Jorum is available from Sourceforge – http://sourceforge.net/projects/sword-app/.  It enables me to post a .zip containing a resource + its metadata in an imsmanifest.xml to intraLibrary (or resource + mets.xml to Jorum)

Screenshots below:

Authenticate to repository / retrieve service doc (note SWORD service URL)

Authenticate to repository / retrieve service document (note SWORD service URL)

Service doc successfully retrieved – choose collection to deposit into:

Service doc successfully retrieved - choose collection to deposit into:

Confirm collection details for post operation:

Confirm colection details for post operation

Browse to zip on hard drive (already containing imsmanifest.xml) and click button to post to repository:

Browse to zip on hard drive (already containing imsmanifest.xml) and click button to post to repository

Resource and full metadata record successfully posted to repository:

Resource and full metadata record successfully posted to repository:

I imagine we would need a web-form that allows similar metadata to be captured and transformed into an imsmanifest.xml, then allows file upload and zips resource and manifest into an IMSCP that is then posted to a specified collection in the service doc at http://repository-intralibrary.leedsmet.ac.uk/IntraLibrary-Deposit/service.

Erm, sounds a bit tricky to me, Mike?

It might also be worth looking again at Stuart Lewis’ EasyDeposit (blogged about at http://repositorynews.wordpress.com/2010/06/04/easydeposit-the-sword-client-creation-toolkit/) which packages as METS for DSpace/EPrints etc.

Using Jorum (and/or Xpert?) for ACErep

October 12, 2010 10 comments

Thanks to Gareth and Hiten at Edina who spared their time to speak to Peter and I yesterday and answer our questions about Jorum to help us determine how we might integrate with ACErep.

In a nutshell we were interested in whether we can deposit into Jorum via SWORD – in addition to our respective institutional repositories – with a view to using the national repository to search across all ALPS resources from the three partner institutions. Ideally we would want to conduct a search from our own ALPS portal and display/format search results in our own environment. However, it seems there is no “open” search facility to query Jorum and return data in a format that we could process ourselves (i.e. XML) and while we may be able to conduct a search from the portal, our only option would then be to “jump-off” to the results in Jorum itself.

In view of limited resources it may be that this is the route we choose to go down but we will need to speak to our stakeholders first to see if it is acceptable to them – it’s obviously not ideal and the additional functionality we could add to the portal would be limited in this scenario (eg. comments/discussion on ALPS resources to bridge the theory/practice gap in health education as suggested by our stakeholders).

The approach that would give us the greatest flexibility, of course, is if we were able to harvest/index/search our three repositories ourselves and Peter will do some research to determine whether we might look into this.   Realistically, however, we may lack the resources (mostly time!) for this to be viable and there is another service that may be worth looking at first:

Xpert, as I blogged recently, is a service at Nottingham University that harvests RSS and OAI-PMH feeds from learning object/OER repositories including Leeds Met (and as of last week Jorum itself) to create a “distributed e-learning repository”.

So…our question for @xpert_project is: Are we be able to query Xpert with an appropriate level of sophistication (tbc!) and return XML that we can process, format and display ourselves?

Peter and I had a look at the APIs recently released as Xpert Labs which includes base URLs to return a variety of data formats including XML; I don’t really know enough about querying databases/data transfer to know whether this, in itself, is sufficient to solve our problem and Peter suggested that the XML returned by this service is called up by the user’s browser rather than being in a format that we could further process (?) but we would be very interested to speak with Xpert to see if there is any mileage in these ideas. If we were able to utilise Xpert in some way then a further caveat is that it would necessitate a delay between deposit and discovery to allow for harvest – Xpert harvests every night I think so we would be looking at an overnight delay – is this acceptable?  Any harvesting solution would also necessitate a delay of course; SWORD deposit to Jorum should mean resources are discoverable immediately.

In the short term, Peter and I intend to pursue both the Jorum and Xpert routes; so far I’ve just tried to sketch out the broad picture – as always the devil is in the detail and in this case that devil, in one way or another, is likely to reside in metadata Hell…or Hades at least.

We have had only the briefest discussion about an Application Profile for ALPS but it is probably desirable to adopt a lightweight AP based on UKOER (see previous post) – with the only additional requirement being that resources are presented utilising a bespoke taxonomy to accommodate “specific learning/assessment outcomes” (tbc).  While it should not be too difficult to map between our disparate systems’ metadata standards to arrive at an AP based on UKOER (Title, Description, Keyword, Classification, Contributor etc) there is a potential issue in that Jorum classifies by JACS and Gareth could not say, without some experimentation, whether non-JACS classification data is indexed and hence searchable; we intend to submit some test METS packages to Jorum by SWORD in order to test this.  There may also be issues around managing Content Packages – especially if we want to deposit them by SWORD (but also harvest by Xpert); our original intention was to deposit everything into Jorum via SWORD (which would require authentication with a UKFed user-account – best option probably to set up a specific ALPS account with an UKFed institutional email?) and this might still be the best option; even if we do go down the Xpert route would we want resources to be harvested from the institutional repositories or from Jorum?  Jorum only accepts METS by SWORD, so IMSCP could not be deposited via the standard and we would need to have some sort of contingency process whereby Content Packages are transferred to Jorum by an alternative mechanism which would also precipitate a (further) delay – such a process is already in place for Unicycle resources.

N.B. In actual fact, I suspect we are unlikely to get large numbers of IMSCPs but the contingency needs considering nevertheless!

At this stage, of course, it still a moot point whether we can query Xpert at all and return data in an appropriate format but we would also want to be sure that we could query a bespoke taxonomy… There may also be issues with respect to harvesting Content Packages (see discussion on this post)

(Attempting) to summarise:

Option 1: Submit to Jorum by SWORD – search Jorum from portal but jump-off to Jorum itself for results. In addition, submit to one of three institutional repositories (depending on user affiliation)

Pro: Integration with national OER infrastructure / deposited resources available immediateley / (relatively) low developmental overheads

Con: User taken out of portal to results in Jorum / limits development of additional functionality

Option 2: Submit to one of three institutional repositories (depending on user affiliation); harvest/index/search the metadata ourselves.

Pro: Maximum control

Con: Resources / timescale / delay associated with harvest / developmental overheads are potentially prohibitive

Option 3: Submit to one of three institutional repositories (depending on user affiliation); ensure all three repositories are harvested by Xpert – utilise API to search Xpert and return XML that we can display/format ourselves.

Pro: (relatively) low developmental overheads (compared to option 2)

Con: Unknown issues (is it feasible?) / delay associated with harvest / does Xpert have resources to help us in ACErep project timescale?

Option 4: Submit to Jorum by SWORD AND one of three institutional repositories (depending on user affiliation); records harvested from Jorum by Xpert – utilise API to search Xpert and return XML that we can display/format ourselves.

Pros/cons: As above (Options 1 & 3)