Contribute to the DSpace Development Fund

The newly established DSpace Development Fund supports the development of new features prioritized by DSpace Governance. For a list of planned features see the fund wiki page.

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

A more up-to-date version of the DSpace FAQ is now available on the DSpace.org Website.

This page addresses the most commonly-asked questions about DSpace. See also the TechnicalFaq page for anwers to technical questions about DSpace.

About DSpace

What is DSpace?

A groundbreaking digital repository system, DSpace captures, stores, indexes, preserves and redistributes an organization's research material in digital formats. Research institutions worldwide use DSpace for a variety of digital archiving needs - from institutional repositories (IRs) to learning object repositories or electronic records management, and more. DSpace is freely available as open source software you can customize and extend. An active community of developers, researchers and users worldwide contribute their expertise to the DSpace Community.

Who can join the DSpace Community?

Anyone who uses DSpace can get involved, in a number of different ways: programming, defining feature requirements, writing documentation, testing new features , sharing your design or marketing expertise. Get involved by joining the DSpace mailing lists, adding your projects, experiences, and comments to the DSpace Wiki, and collaborating with other DSpace community members.

Who built DSpace?

The MIT Libraries and Hewlett-Packard (HP) originally developed DSpace, but the software is now supported by DuraSpace. The system is now freely available to research institutions world-wide as an open source system that can be customized and extended.

Who manages DSpace?

DSpace is freely available as open source software. The DSpace Community manages the code base and releases new versions of the software. An active community of developers, researchers and users worldwide contribute their expertise to the DSpace Community.

How is DSpace different from other digital repositories?

DSpace is the first digital repository to address the myriad issues inherent in a multi-disciplinary archive, including:

  • Differing policies, practices and cultures established by individual disciplines
  • The variety of digital formats produced in today's multi-media research environments
  • The complexity of metadata standards needed to accommodate and maintain access to the digital formats supported by the system.

DSpace is designed with a flexible storage and retrieval architecture adaptable to a multitude of data formats and distinct research disciplines, known as "communities." Each community has its own customized user portal that can use the community's own practices and terminology.

Open Source

Is DSpace free?

Yes. The DSpace system is freely available as open-source software (see locations below), under the terms of the BSD distribution license. We have also tried to find good open-source tools to package with the DSpace application, all freely available under an open-source license (although not all the same license as the one for DSpace itself), so that you get a complete system along with the part that we created.

Who can download the software?

Open-source systems like DSpace are available for anyone to download and run at any type of institution, organization, or company (or even just an individual). Users are also allowed to modify DSpace to meet an organization's specific needs. The BSD distribution license describes its specific terms of use.

DSpace is freely available as open-source software from SourceForge. For more information on the most recent release of the software see the Current Release Notes.

Where can I download the DSpace open-source software?

DSpace is freely available as open-source software from SourceForge.

Can I change the DSpace system?

Yes, you can customize and extend the system to suit your organization's needs. DSpace was designed to make adapting it for individual organizations as easy as possible. See the section on how to contributeat the DSpace Wiki for information on submitting code changes to DSpace. Each application is different, but most organizations need to customize the authentication system, for example, to work with existing systems. Some organizations may want to substitute the open-source tools supplied with DSpace with different ones (for example, replacing postgreSQL with mySQL or Oracle).

Where can I learn more about Open Source?

O'Reilly & Associates has a very helpful web site devoted to open source: http://opensource.oreilly.com/.

Functionality

What kind of content does DSpace support?

DSpace accepts all manner of digital formats. Some examples of items that DSpace can accommodate are: documents, such as; articles, preprints, working papers, technical reports, conference papers, books, theses, data sets, computer programs: visualizations, simulations, and other models, multimedia publications, administrative records, published books, overlay journals, bibliographic datasets, images, audio files, video files, reformatted digital library collections, learning objects, web pages.

What are DSpace Communities and Collections?

Each DSpace service is comprised of Communities – groups that contribute content to DSpace – and Communities in turn each have Collections, which contain the content items, or files In a university environment, for example, Communitiesmight be departments, labs, research centers, schools, or some other administrative unit within an institution. Communities determine their own content guidelines and decide who has access to the community's contributions. An administrator on the DSpace team, usually the DSpace User Support Manager, works with the head of a community to set up workflows for content to be approved, edited, tagged with metadata, etc. Collections belong to a community or multiple communities (for example, research collaborations between two communities may result in a shared collection) and house the individual content items and files.

Can I export my digital material out of DSpace?

Yes. Currently DSpace supports exporting digital content, along with its metadata, in a simple XML-encoded file format. The DSpace developers are working on migrating this export capability to use the METS standard, but are waiting for some necessary extension schemas to emerge (such as one for qualified Dublin Core metadata, and one for minimal technical/preservation metadata for arbitrary digital objects).

Will DSpace interoperate with other systems running at my organization?

Yes, DSpace has documented Java APIs you can customize to allow interoperation with other systems an institution might be running (for example, a department's web document system auto-depositing in DSpace, or a campus data warehouse).

What sort of persistent identifiers does DSpace use?

DSpace uses the Handle Systemfrom CNRI to assign and resolve persistent identifiers for each digital item. Handles are URN-compliant identifiers. The Handle resolver is an open-source system used in conjunction with DSpace. The developers chose to use handles instead of persistent URLs to support citations to items in DSpace over very long time spans - longer than we believe the HTTP protocol will last. Handles in DSpace are currently implemented as URLs, but can also be modified to work with future protocols.

How does DSpace preserve digital material?

DSpace identifies two levels of digital preservation: bit preservation, and functional preservation. Bit preservation ensures that a file remains exactly the same over time - not a single bit is changed - while the physical media evolve around it. Functional preservation goes further: the file does change over time so that the material continues to be immediately usable in the same way it was originally while the digital formats (and physical media) evolve over time. Some file formats can be functionally preserved using straightforward format migration, such as TIFF images or XML documents. Other formats are proprietary, or for other reasons are much harder to preserve functionally. No one can predict the formats all users will choose for their research material. They use the best tools for their purposes, and research institutions will get whatever formats those tools produce. For this reason, there are three levels of preservation for a given format: supported, known, or unsupported.

  • Supported formats will be functionally preserved using either format migration or emulation techniques. Examples include TIFF, SGML, XML, AIFF, and PDF.
  • Known formats are those that we can't promise to preserve, such as proprietary or binary formats, but which are so popular that third party migration tools will likely emerge to help with format migration. Examples include Microsoft Word and Powerpoint, Lotus 1-2-3, and WordPerfect.
  • Unsupported formats are those that we don't know enough about to do any sort of functional preservation. This would include some proprietary formats or a one-of-a-kind software program. For all three levels, DSpace does bit-level preservation so that ?digital archaeologists? of the future will have the raw material to work with if the material proves to be worth that effort.

Technical

Where can I find DSpace technical documentation?

You can find DSpace system documentation on the SourceForge project web site or on the DSpace Technology page.

I've installed DSpace and I have questions/problems/comments. What should I do?

The DSpace Community of developers support one another and exchange ideas and solutions on the DSpace mailing lists. Before you post a question or problem, check to see if your question has been answered already. Start by searching the DSpace mailing list archives. Then read the Technical FAQ and check the technical documentation. If you still haven't found an answer or solution, post your questions to DSpace-tech, where members of the DSpace community will offer their assistance.

I've found a bug in the software. How do I report it?

You can report bugs and suggest enhancements through the SourceForge system. Bugs will be fixed as soon as possible, within the limits of the DSpace team?s technical support resources. The team considers all enhancements, and if an enhancement is accepted, adds it to the enhancement list for development as time and resources allow. Of course, any users working with the open-source code are welcome to fix a bug or make an improvement to the system. See our DSpace Community Development Guidelines to learn how.

Who supports DSpace?

DSpace has a very active community of developers to contribute expertise and support through the DSpace listserv at SourceForge, DSpace-Tech, and the project wiki. To work with the DSpace system you'll need local technical resources (hardware, technical experts, and so on) to really take advantage of the system. The DSpace web site offers technical documentation, and you can join the DSpace listserv, DSpace-Tech, to ask questions or post solutions.

What sort of hardware does DSpace require? What about sizing the server? How much disk space do I need?

There are no specific server requirements for DSpace. Because the application is written in Java, it will run on any Operating System (UNIX, Windows, Mac OSX), though most people tend to run it on UNIX. DSpace is built on top of free, open-source tools, such as the Apache Web server, the Tomcat Servlet engine, and the PostgreSQL relational database system. For your convenience, we package the necessary JDBC and other drivers and libraries together with DSpace. This set of tools should run on any UNIX-type OS, such as Linux, HP/UX, or Solaris, and you can substitute other libraries if you need to run on another platform. The system runs on anything from a laptop to a $500K server, but there are a few general recommendations for hardware architectures. For a research university, DSpace requires a reasonably good server (see below) and a decent amount of memory and disk storage. Some examples from the community (not necessarily endorsements):

  • HP Server rx2600, powered by dual 64-bit Intel Itanium 2 processors (900MHz), 2GB RAM, 26 GB internal disk storage.
  • HP StorageWorks Modular SAN Array 1000 (msa1000) with a single high-performance controller. Options include a second controller and, with the addition of two more drive enclosures, controls up to 42 Ultra2, Ultra 3, or Ultra320 SCSI drives. Total capacity can be six terabytes. Cost starts around $40K and goes up to around $1.8M.
  • SunFire 280R Server, two 900MHz UltraSPARC-III Cu processors, 8MB E-cache, 2GB memory, two 36GB 10,000rpm HH internal FCAL disk drives, DVD, 436-GB, or 12 x 26.4 Gbyte 10K RPM disks, Sun StorEdge A1000 rackmountable w/ 1 HW RAID controller, 24MB std cache. Around $30K.
  • Dell PowerEdge 2650 with dual Xeon processors (2.4GHz), 2GB RAM, 2x73GB scsi disks. One 2.5TB Apple XServe. A DLT tape library to back up the DB/jsps etc. Around $10K.

Of course, your mileage (and costs) will vary depending on what you plan to do with the system.

How much time does it take to set up a DSpace installation?

A person familiar with installing Java and database based open source applications should be able to complete a prototype DSpace installation in a few hours to a day. After this experience, and an additional day to a week of exploring the software, a production installation should also take about a day for the basic software in a standalone configuration. If customizations or local integrations are required (for example, for user authentication), then additional time should be allocated according to the complexity and quantity of the changes required.

How much maintenance does a DSpace instance require?

While the DSpace software itself requires very little maintenance, the usual IT overhead for data backup, etc. applies. It is also typical for expectations and requirements to evolve once the instance is operational. Initial system usage often exposes areas for reorganization, metadata correction, and the inevitable requests to remove prematurely or incorrectly deposited items. An ongoing maintenance estimate should incorporate IT time based on the expected size of the repository to backup, ongoing local development based on the amount of customization, and repository content management based on the anticipated rate of deposit.

Can anyone help me to setup or install DSpace?

Metadata

What is Metadata?

Metadata is literally "data about data." It is descriptive information used for querying. Some metadata can be done mechanically, such as file sizes, checksums, and full-text indexing, for instance. Other metadata is a higher order of human-made description such as titles, authors, unique identifiers, and abstracts. DSpace uses a qualified version of Dublin Coremetadata across all content. Some communities or collections may also have tailored metadata available (such as MARC records for book collections, or FGDC records for geographic datasets). But even where that's available for some items, we crosswalk more detailed metadata records into our Dublin Core vocabulary to ensure a common layer of descriptive specificity for browsing and searching across everything.

What metadata standards does DSpace support? Can I create metadata using the [SCORM or VRA or FGDC or MARC or myOwnSchema]?

In this context support for a given metadata schema means that metadata can be entered into DSpace, stored in the database, indexed appropriately, and made searchable through the public user interface. This currently applies mainly to descriptive metadata, although as standards emerge it could also include technical, rights, preservation, structural, and behavioral metadata.

Currently DSpace supports only the Dublin Core metadata element set with a few qualifications conforming to the library application profile (see DSpace Metadata). HP and MIT also has a research project called SIMILE which is investigating how to support arbitrary metadata schemas using RDF as applied by the Haystack research project in the Lab for Computer Science and some of the Semantic Web technologies being developed by the W3C.

Does DSpace support OAI?

DSpace supports the Open Archives Initiative's Protocol for Metadata Harvesting (OAI-PMH) v2.0 as a data provider. OAI support was implemented using OCLC's OAICatopen-source software to make DSpace item records available for harvesting. DSpace@MIT is registered as a data provider with the Open Archives Initiative. Other institutions running DSpace may choose to turn on OAI or not, and to register as a data provider or not.

Building a DSpace Service

Where can I find information on how to build a DSpace service?

For technical information see, the technical documentation and join the DSpace-Tech mailing list. For non-technical questions and concerns, see the DSpace.org web site, specifically the section on institutional repositories: Building an IR with DSpace. You'll also want to join the DSpace-General mailing list for announcements and information for DSpace teams.
Another helpful resource is the workbook, Creating an Institutional Repository, written by Mary Barton and Margaret Waters for the LEADIS seminar series in the UK, available at http://dspace.org/implement/leadirs.pdf.

Is there a list of live DSpace services?

See the list of DspaceInstances on this site. If your institution is running DSpace and isn't listed, please add your site to the list.

Is there a list of DSpace Service Providers?

Yes, see the ServiceProviders page for a list of consultants and organizations who can help you build and run your DSpace service.

Do I have to name my service "DSpace"?

No. With the release of DSpace version 1.3, it has become much easier to create a unique name for your repository. All the language in the user interface resides in one file, to make it easier to modify and translate. You just need to replace "DSpace" with the name of your respository in the file, config/language-packs/Messages.properties. Note: You must be running DSpace 1.3 or higher to use this solution.

How do I add an administrator through the admin UI?

To make someone an administrator, go into the admin interface and click 'groups'. Then click 'Edit' next to the 'Administrator' button, then click 'Select E-people', find the person you want to make an administrator, click 'Add' next to their name, then close the pop-up and click 'Update Group' on the Edit Group page.

What kinds of DSpace services are other institutions building?

Research institutions worldwide use DSpace to meet a variety of digital archiving needs:

  • Institutional Repositories (IRs)
  • Learning Object Repositories (LORs)
  • eTheses
  • Electronic Records Management (ERM)
  • Digital Preservation
  • Publishing
  • and more

Where can I find information on Digital Preservation?

There are several good resources available. Start by reading Paul Wheatley's article "A way forward for developments in the digital preservation functions of DSpace : options, issues and recommendations".

DSpace Content

What kind of content does DSpace support?

DSpace accepts all manner of digital formats. Some examples of items that DSpace can accommodate are:

  • Documents, such as articles, preprints, working papers, technical reports, conference papers
  • Books
  • Theses
  • Data sets
  • Computer programs
  • Visualizations, simulations, and other models
  • Multimedia publications
  • Administrative records
  • Published books
  • Overlay journals
  • Bibliographic datasets
  • Images
  • Audio files
  • Video files
  • eformatted digital library collections
  • Learning objects
  • Web pages

What copyright do I own? What is a license? And what is a Creative Commons license?

If you have original works that you have created, including photos you have taken, you can overcome traditional copyright restrictions by selecting a 'license' that tells other folks what they are or are not allowed to do with the works. US Copyright restricts the use of materials by others unless the user hunts you down and explicitly asks you for permission to use your content. Creative Commons is a group founded by lawyers in academia (Stanford, Duke and other universities) that has defined alternative licenses whereby you retain the copyright but you attach the license to your content so people know what they can or cannot do with it. That way they don't have to go find you and ask permission each time. They still have to give you credit for the content whenever/however they use it or they violate the license agreement and are subject to legal recourse. But if a professor at another institution wants to use some of your photos or descriptive works in his/her course or a musician wants to include your photo on his/her album cover they can do so if your license says it's ok AND they give you attribution for the work (e.g. image by <your name goes here>). You select different options when you assign a creative commons license to a work and, based on the options you select, a license is generated. Here's a link to the license descriptions that can be generated based on your selections:
Meet the Licenses.
You can get to the full legal explanation through a link, but it's usually only lawyers who are interested in this (smile) . The most open license is the 'Attribution' license. With this, you receive the greatest exposure for your work since it can be distributed anywhere or modified to someone's specific needs while still giving you credit for its creation.
In DSpace, we assign the copyright license to the work at the time we submit it. There's a creative commons form built into DSpace that allows you to identify the license to be used with the item so people can know what they're allowed to do with it. Here's a link to the same form on the Creative Commons web site, with additional links that give you further explanations:
License selection.
So while you're traveling around the world, take LOTS of pictures! You, then, can make them available for others to use so people don't have to keep paying image dealers for pictures. Education will improve since these assets will be available to people who can't afford to buy these images and people will LOVE you because they'll see that you're the person who's letting them use the image. You don't lose the copyright, you don't sign it over to anyone, but you make the work available to folks within the terms you identify for the license. You can put them in your DSpace so that they're managed in appropriate collections, have metadata assigned to them, and can be found by search engines like google when people are looking for images of, say, the colosseum.

Interoperability

Can I import items into DSpace in batch mode?

Yes. See overview.

Can I export my digital material out of DSpace?

Yes. Currently DSpace can export digital content, along with its metadata in a simple XML-encoded file format or METS.

Will DSpace interoperate with other systems running at my organization?

Yes, DSpace has documented Java APIs you can customize to allow interoperation with other systems an institution might be running (for example, a department's web document system auto-depositing in DSpace, or a campus data warehouse).

What sort of persistent identifiers does DSpace use?

DSpace uses the Handle System from CNRI to assign and resolve persistent identifiers for each digital item. Handles are UN-compliant identifiers. The Handle resolver is an open-source system used in conjunction with DSpace.
The developers chose to use handles instead of persistent ULs to support citations to items in DSpace over very long time spans - longer than we believe the HTTP protocol will last. Handles in DSpace are currently implemented as ULs, but can also be modified to work with future protocols.

What metadata standards does DSpace support? Can I create metadata using the SCOM/VA/FGDC/MAC or myOwnSchema?

In this context support for a given metadata schema means that metadata can be entered into DSpace, stored in the database, indexed appropriately, and made searchable through the public user interface. This currently applies mainly to descriptive metadata, although as standards emerge it could also include technical, rights, preservation, structural, and behavioral metadata.
Currently DSpace supports only the Dublin Core metadata element set with a few qualifications conforming to the library application profile (see DSpace Metadata). The DSpace team hopes to support a subset of the IMS/SCOM element set (for describing education material) in the coming year.
HP and MIT also have a research project called SIMILE, which is investigating how to support arbitrary metadata schemas using DF as applied by the Haystack research project in the Lab for Computer Science and some of the Semantic Web technologies being developed by the W3C.

Does DSpace support OAI?

DSpace supports the Open Archives Initiative's Protocol for Metadata Harvesting (OAI-PMH) v2.0 as a data provider. OAI support was implemented using OCLC's OAICat open-source software to make DSpace item records available for harvesting. DSpace@MIT is registered as a data provider with the Open Archives Initiative. Other institutions running DSpace may choose to turn on OAI or not, and to register as a data provider or not.

Does DSpace support JSR-170?

Currently JSR-170, which is a standard Java API, isn't implemented anywhere in DSpace. We've looked at JSR-170 from two perspectives. One is where DSpace itself acts as a JSR-170 repository, and other apps can use the JSR-170 interface to store and retrieve things in DSpace. Since JSR-170 is a Java API and not (yet) a network API (or "Web Service") we haven't implemented that; the closest is probably the Lightweight Network Interface which should be appearing in DSpace 1.5, or SRW/U for search. If you want another app to interface to DSpace, these are probably the best way forward right now.

The other way we've been looking at JSR-170 is for internal use in DSpace, i.e. DSpace itself would use a storage layer with a JSR-170 interface to store things. That possibility is still on the table, though there are no immediate plans for that right now; there are some issues to do with the DSpace data model that need resolving first.

For More Information

Still have questions? For general questions about DSpace and DuraSpace, you can search the DSpace-General mailing list archives or post a question to the DSpace-General mailing list.

For technical or software questions, see the TechnicalFaq and the DSpace system documentation. You can also search the DSpace-tech archives or post a question to DSpace-tech mailing list.

  • No labels