Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3
Note

This page provides notes on the DSpace SVN to GitHub Migration taking that took place in MarchApril/April 2012.May 2012.  It has now all been completed.  The final announcement of this migrate is at: http://www.mail-archive.com/dspace-general@lists.sourceforge.net/msg01268.html

GitHub Migration Steps

Table of Contents
minLevel2
outlinetrue
stylenone

Codebases to migrate to GitHub

...

Note
titleGitHub import hints

http://help.github.com/import-from-subversion/

Main Codebases

Main Codebases

  • (tick) Primary DSpace Codebase - The Offical DSpace/DSpaceGitHub repository is now active! All DSpace development is now occurring in GitHub instead of SVN.
    1. The "DSpace/DSpace" GitHub is a BRAND NEW copy of SVN, with SVN author mapping using the #Author File below.
    2. WARNING: The old, "unofficial" "DSpace/DSpace" GitHub repository was renamed to DSpace-SVN-Deprecated
      • Anyone who had forked the "unofficial" GitHub migrate as-soon as possible to the new GitHub repo, as the old one will not be maintained any further.
      • Once everyone has migrated (no more forks exist), we will delete the "DSpace/DSpace-SVN-Deprecated" repo
    Primary DSpace Codebase - Migrate (perform final sync) Main Trunk/Branches/Tags to https://github.com/DSpace/DSpace. WARNING - WILL NOT PERFORM AUTHOR MAPPING: At this time, we do not plan to perform the "SVN author mapping" (See #Author File) for this primary codebase. The reason why is that it can only be performed by essentially "destroying" the existing GitHub repository and rebuilding/recreating it. If we were to recreate this codebase, it would be detrimental to all people/institutions who have already forked it. Since "recreating this repository with author mappings" doesn't seem to give us much benefit in GitHub, the attendees of the Developers Meeting on March 14,2 012 decided it doesn't seem worth the hassle
      • .
  • (tick)  dspace-api-lang - migrated to GitHub:DSpace/dspace-api-lang 
    • perhaps migrate to a single 'dspace-lang' GitHub project?  (Mark: I think move back into DSpace API and use minor releases to update languages.)
  • (tick)  dspace-xmlui-lang - migrated to GitHub:DSpace/dspace-xmlui-lang
    • perhaps migrate to a single 'dspace-lang' GitHub project? (Mark: I think move back into DSpace-xmlui and use minor releases to manage.)

(question) QUESTION: The following codebases are also standalone. Do we want to keep them that way? Do they all merit their own repository in GitHub?

  • (tick) dspace-services - migrate to 'migrated to GitHub:DSpace/dspace-services' GitHub Project? (Mark: yes, keep it a separate repo for now with its own release numbering.)
  • (tick)  dspace-pom (parent POM of 'dspace-parent' & other DSpace maven projects) - migrate to a 'dspace-pom' GitHub project? (Mark: separate repo) - consolidated into Offical DSpace/DSpace GitHub repository
  • (tick)  The License/License Header files (used by 'dspace-pom') - This License/License Header may just need to be moved into the same location as 'dspace-pom' (as that is where it is used) ( mark: yes) consolidated into Offical DSpace/DSpace GitHub repository
  • (tick) dspace-assembly-plugin (used by 'dspace-parent' to assembly DSpace)- migrate to ??? (Mark: We started to shift to putting these assembly files into the parent src directory, maybe that's the best place and will eliminate the need for the plugin.) consolidated into Offical DSpace/DSpace GitHub repository

(question) QUESTION: The following codebases are also standalone. Do we want to keep them that way? Do they all merit their own repository in GitHub?

Dependency Codebases

The following codebases are strictly releases of DSpace dependencies. Some of these dependencies just needed to be pushed to Maven Central (i.e. we don't "own" any of the code – all we did was release it to Maven Central because we needed it available there). Others need minor tweaks for DSpace.

(question) QUESTION: Should we just combine all of these into a single 'dspace-dependences' GitHub repository? Is there a reason to maintain these as separate repositories? It seems like most of these we could just release/update as needed from a single 'dspace-dependences' GitHub repo.

The below dependencies may no longer be needed, as it seems like we have a valid way to replace them in the future.

  • dspace-solr (dependency of 'modules/solr') - migrate to 'dspace-dependencies' GitHub project???
  • dspace-geoip (dependency of 'dspace-stats') - migrate to a general 'dspace-dependencies' GitHub project???
  • (error)
  • dspace-tm-extractors (used by dspace-api to do text extraction of Word docs?) - Can we just replace this with POI (http://poi.apache.org/), which is used for PPT text extraction (but also looks to work for Word docs)? See DS-1140
  • (error) dspace-jmockit (dependency of dspace-parent) - migrate to a general 'dspace-dependencies' GitHub project???
  • dspace-cocoon-servlet-service-impl (dependency of dspace-xmlui-wing) - migrate to a general 'dspace-dependencies' GitHub project???

...

Additional Active DSpace projects

...

Archive old SVN (for posterity)

  • (tick) Make existing SVN READ-ONLY
  • (tick) Archive the existing SVN (scm.dspace.org hosted by OSUOSL) to http://svn.duraspace.org and make it for posterity.
  • (tick) Post a notice in the archived SVN (at svn.duraspace.org) that codebase is now at GitHub

Additional Cleanup tasks

  • (tick)Turn off any syncing between SVN and GitHub
    • Notify the Committers before making this change!!
  • (tick)Ensure Maven 'License Header' check still works when running 'mvn install' (this is setup in 'dspace-pom')
    • Likely will need to move LICENSE_HEADER to GitHub, maybe into same project as 'dspace-pom'?
    • Also will likely require a new release of 'dspace-pom'
  • (tick)Ensure Maven Release Procedure still works - likely will require a POM update or two.

Mark: I've run a few releases via sonatype and github, it might be good to run a test release or two just to verify.  We just need to update the source management section of the maven parent Pom. I have examplespf these settings in my repo.

Final Steps

  • (tick) Notify community when development has moved completely to GitHub.
  • (tick)Redirect old SVN URLs:
    • (tick) Redirect scm.dspace.org to svn.duraspace.org (if possible, setup this redirect so that old URLs just go directly to that SVN archive)
  • (tick) Contact OSUOSL and let them know we no longer need to use the old SVN server.
  • (In process/ongoing) Build some "GitHub Best Practices" documentation : @ Development with Git

...

Code Block
svnuser = gituser_full_name <gituser_email>
Code Block
KevinVandeVelde = KevinVdV Kevin Van de Velde <kevin@mire.be>
ScottPhillips = scott-phillips <scott.a.phillips@gmail.Scott Phillips <scott@scottphillips.com>
StuartLewis = Stuart stuartlewisLewis <stuart@stuartlewis.com>
ahkim = NAME <USER@DOMAIN>Austin Kim <ahkim@users.sourceforge.net>
aschweer = aschweerAndrea Schweer <schweer@waikato.ac.nz>
azeckoski = Aaron Zeckoski <azeckoski@gmail.com>
benbosman = benbosmanBen Bosman <ben@atmire.com>
bollini = abolliniAndrea Bollini <bollini@cilea.it>
bradmc = Brad NAMEMcLean <USER@DOMAIN><bmclean@duraspace.org>
cjuergen = Claudia cjuergenJuergen <claudia.juergen@tu-dortmund.de>
dchud = NAME <USER@DOMAIN>Dan Chudnov <daniel.chudnov@gmail.com>
dstuve = NAME <USER@DOMAIN>David Stuve <david.stuve@hp.com>
gam = NAME <USER@DOMAIN>Greg McClellan? <gam@users.sourceforge.net>
gam5 = Greg NAME <USER@DOMAIN>McClellan <gam5@users.sourceforge.net>
gcarpent = Grace NAME <USER@DOMAIN>Carpenter <gcarpent@users.sourceforge.net>
grahamtriggs = Graham grahamtriggsTriggs <grahamtriggs@gmail.com>
jimdowning = NAME <USER@DOMAIN>Jim Downing <jim.downing@gmail.com>
jrutherford = NAME <USER@DOMAIN>James Rutherford <jrutherford@users.sourceforge.net>
jtrimble = Jeffrey NAMETrimble <USER@DOMAIN><jatrimble@ysu.edu>
keithg = NAME <USER@DOMAIN>Keith Gilbertson <keith.gilbertson@library.gatech.edu>
kshepherd = Kim kshepherdShepherd <kim.shepherd@gmail.com>
lcs = NAME <USER@DOMAIN>Larry Stone <lcs@users.sourceforge.net>
mdiggory = mdiggoryMark Diggory <mdiggory@atmire.com>
mirceag = Gabriela NAME <USER@DOMAIN>Mircea <gabriela.mircea@utoronto.ca>
mwoodiupui = Mark mwoodiupuiWood <mwood@IUPUI.Edu>
pbreton = NAME <USER@DOMAIN>Peter Breton <pbreton@users.sourceforge.net>
peterdietz = peterdietzPeter Dietz <dietz.72@osu.edu>
pottingerhj = Hardy hardyoyoPottinger <pottingerhj@umsystem.edu>
pvillega = pvillegaPere Villega <pere.villega@gmail.com>
richard-jones = Richard NAMEJones <USER@DOMAIN><richard@cottagelabs.com>
robintaylor = Robin robintaylorTaylor <robin.taylor@ed.ac.uk>
rrodgers = richardrodgersRichard Rodgers <rrodgers@mit.edu>
rtansley = Robert NAMETansley <USER@DOMAIN><roberttansley@google.com>
sands = sandsfishSands Fish <sands@mit.edu>
scottphillips = Scott scott-phillipsPhillips <scott@scottphillips.com>
stuartlewis = Stuart stuartlewisLewis <stuart@stuartlewis.com>
syeadon = NAME <USER@DOMAIN>Scott Yeadon <scott.yeadon@anu.edu.au>
tdonohue = Tim tdonohueDonohue <tdonohue@duraspace.org>