Date

Call-in Information

Time: 07:00 am, Eastern Time (New York, GMT-04:00)

To join the online meeting:

  • https://lyrasis.zoom.us/j/82670709536?pwd=MzF3NDladE1DKzEvUml4SGQ5eUFsQT09

    Meeting ID: 826 7070 9536
    Passcode: 008047
    One tap mobile
    +16699006833,,82670709536#,,,,*008047# US (San Jose)
    +19292056099,,82670709536#,,,,*008047# US (New York)

    Dial by your location
            +1 669 900 6833 US (San Jose)
            +1 929 205 6099 US (New York)
            +1 253 215 8782 US (Tacoma)
            +1 301 715 8592 US (Washington DC)
            +1 312 626 6799 US (Chicago)
            +1 346 248 7799 US (Houston)
            877 853 5257 US Toll-free
            888 475 4499 US Toll-free
    Meeting ID: 826 7070 9536
    Passcode: 008047
    Find your local number: https://lyrasis.zoom.us/u/kbEatBA0od

Slack

Attendees

(star)  Indicating note-taker

  1. Dragan Ivanovic 
  2. Michel Héon
  3. Jose Ortiz
  4. Abhishek Raval

Agenda

  1. Massive migration of data from DSpace to VIVO
    1. Further discussion about solution architecture
      1. integration of data in elements of VIVO graph (publications records, researchers, etc.)
    2. Reporting about implementation

Notes

  1. Jose reported his progress on exporting of DSpace data to middleware internal data structure defined for the needs of massive migration of data from DSpace to VIVO. Packages should be renamed to start with org.vivoweb.dspacevivo.
  2. Options for exporting data should be:
    1. All DSpace Items
    2. DSpace items modified or added in some time interval (start and end date and time)
    3. DSpace items belonging to one or to the list of certain collections (at least in one collection, but it might be that some item belongs to more collections)
    4. DSpace items belonging to one or to the list of certain communities, i.e. to collections linked with those communities
    5. DSpace items linked with a researcher or group (list) of researchers
    6. DSpace items belonging to an organization unit (affiliated researchers)
    7. DSpace items linked with a super-publication (journal), or event (conference)
    8. All DSpace communities
    9. Communities modified or added in some time interval (start and end date and time)
    10. All DSpace collections
    11. Collections modified or added in some time interval (start and end date and time)
    12. DSpace Items in accordance with a custom SPARQLQuery
    13. DSpace Communities in accordance with a custom SPARQLQuery
    14. DSpace Collections in accordance with a custom SPARQLQuery
  3. At the end of the first phase massive migration might be implemented as a console application (web interface is not mandatory, although it will be nice to have it)
  4. Michel reported his progress as well. 
  5. It is important that imported data are really integrated in the existing graph of VIVO and linked with existing entities. IDs of VIVO entities might be added in DSpace data, and vice versa.  
  6. We need a name for middleware internal data structure defined for the needs of massive migration of data from DSpace to VIVO
  7. Michel and Jose should think about time line, when the implementation of massive migration might be done. After that, the implementation of depositing files from VIVO UI is needed.
  8. We might move the weekly for one hour later (8am Eastern Time).

See:

  1. Architectural proposition
  2. About DSpace rdfizer
  3. About an Item in DSpace
  4. DSpace-VIVO GitHub Repo

Task List

  • Dragan Ivanovic to define options for harvesting data from DSpace in a wiki page 
  • All to think about the name for middleware internal data structure defined for the needs of massive migration of data from DSpace to VIVO

Michel Héon I propose the name 'dspace-vivo exchange data schema'.

  • Jose to continue working on his implementation in accordance with defined options for harvesting data
  • Jose to rename packages in his implementation,
  • Michel Héon to do the same for his contributions
  • The work is completed and the dev-heon branch has been merged by hand into the github repo
  • Michel Héon to think about stronger integration of migrated data in the VIVO graph, and how those data might be linked for synchronization in the case of modification
  • Dragan Ivanovic to make consultation with Abhishek about moving meeting for one hour, and if Abhishek agrees, Dragan Ivanovic will send new ics file. 

Previous tasks 

  • Michel Héon to prepare and share slides for discussion about architecture
  • Michel Héon and Jose to set up environment, i.e. to install DSpace 7, DSpace 6 and VIVO 1.12.x, Abhishek to help them with DSpace if necessary



  • No labels