Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • DSpaceDirect supports two primary types of usage statistics:
    • Google Analytics (if enabled) - DSpace can use Google Analytics to track page views/accesses with the system. This is similar to using Google Analytics to tracking any other website.
    • DSpace Internal Statistics - DSpace also tracks its own basic Usage, Workflow & Search statistics.
  • DSpace Statistics are available site-wide, or on any specific Community, Collection or Item page.  Based on where you are in the system, you will receive a slightly different statistical report.
  • NOTE: By default, DSpace Internal Statistics are ONLY available to Administrators. If you desire, DuraSpace can instead make them publicly available for your site.  Unfortunately, at this time, there are only two options: private (Admin only) or public.
  • How-To: View Site-Wide DSpace Statistics
    • Browse to the homepage (logged in as an Admin)
    • Notice the "Statistics" box has three options:
      • DSpace Usage Statistics: At the homepage level, this gives a general count of "hits" for individual Items in the system.  The report is extremely basic at this time and only lists the title of the Item.
      • DSpace Search Statistics: At the homepage level, this provides a summary of recent searches performed via the DSpace search box.  There are also some historical reporting options.
      • DSpace Workflow Statistics: At the homepage level, this provides a general count of any approval workflows/reviews that have taken place.  There are also some historical reporting options.  NOTE: If you do not use Workflow Reviewer Roles on any Collections, this report will always be empty.)
  • How-To: View DSpace Community/Collection Statistics
    • Browse to a specific Community or Collection
    • The "Statistics" box has the same three options, but now the results will be specific to this Community/Collection (and provide extra details)
      • DSpace Usage Statistics: At the Community/Collection level, this report provides extra detail about accesses of this Community/Collection page.  You'll now get a summary of total hits to the Community/Collection homepage, along with monthly trends, and top countries/cities (where the hits are coming from)
      • DSpace Search Statistics: At the Community/Collection level, this provides a summary of recent searches performed via the DSpace search box (specific to this Community or Collection).
      • DSpace Workflow Statistics: At the Community/Collection level, this provides a general count of any approval workflows/reviews that have taken place (specific to this Community or Collection) (NOTE: If you do not use Workflow Reviewer Roles on any Collections, this report will always be empty.)
  • How-To: View DSpace Item Statistics
    • Browse to a specific Item
    • The "Statistics" box has just one option, but now the results will be specific to this Item (and provide extra details)
      • DSpace Usage Statistics: At the Item level, this report provides extra detail about usage of the Item.  You'll now get a summary of total hits to the Community/Collection homepage, along with monthly trends, number of file downloads and top countries/cities (where the hits are coming from)
      • NOTE: Items do not have Search/Workflow statistics as both of those statistical reports are only applicable to Communities & Collections.

Bulk/Batch Metadata Editing

  • How-To: View/Analyze Page Visits (hits) in Google Analytics
    • In Google Analytics, this info is under the "Behavior -> Site Content -> All Pages" section. It should allow you to easily see the top pages visited, and filter that list based on a page name or path.
    • For example, to analyze the number of visits/hits on a single item's homepage (i.e the number of people who viewed the item metadata) ..
      • Either filter this list by putting in the URL of the item page into the searchbox
      • OR, click on "Page Title" as the "Primary Dimension" and put the title of the Item into the searchbox
  • How-To: View/Analyze File Downloads in Google Analytics
    • File downloads are recorded as "events" in Google Analytics. So, this info is under the "Behavior -> Events -> Pages" section. By default it'll show the top URLs used to download Items, but it also provides ways to filter that information based on page name or URL.
    • For example, to analyze the number of file downloads for a single item (even if it has multiple files)
      • Either filter this list by putting in the URL of the item into the searchbox,
      • OR, click on "Page Title" as the "Primary Dimension" and put the title of the Item into the searchbox
      • In the results, clicking on the title will bring you to a page that lists downloads per file (if the item has multiple files)

Bulk/Batch Metadata Editing

  • DSpace's "Batch Metadata Editing" tool allows you to export sets of DSpace Item Metadata (all Items or just those in specific Communities/Collections) into a CSV file.  The metadata values/fields in the CSV file can then be edited using Microsoft Excel (or OpenOffice Calc or LibreOffice Calc).  Once editing is complete, you can re-import the modified CSV to apply the metadata changes into DSpace.
  • Step-by-step Tutorials:
  • How-To: Batch Metadata Editing basics
    • Batch Metadata Editing can be performed at a Community or Collection level
    • Browse to a specific Community or Collection. The "Context" menu will display an option for "Export Metadata".
    • Click "Export Metadata".  This will generate a CSV file that contains all the metadata for every Item within that Community or Collection hierarchy. 
      • WARNING: For extremely large communities or collections the export (and import) processes may take a long time (or cause your site to significantly slow down). Therefore, DSpaceDirect currently only allows you to modify 500 Items (i.e. lines in CSV) at a single time.  This 500 item limitation can be increased as needed, but it is not recommended (as it can cause performance issues with your site when using these tools).
    • Edit the CSV using either Microsoft Excel or OpenOffice Calc
      • More information on the CSV / Spreadsheet format is available in the DSpace Documentation section on Batch Metadata Editing
      • EXCEL WARNING: By default, Excel will not open a CSV in Unicode/UTF-8 encoding. This means that special characters may be improperly displayed and also can be "corrupted" during re-import of the CSV.
        • You need to tell Excel this CSV is Unicode, by importing it as follows:
          • Open Excel (and create an empty sheet, if one doesn't open by default)
          • Select "Data" tab
          • Click "From Text" button (in the "External Data" section)
          • Select your CSV file
          • Wizard Step 1
            • Choose "Delimited" option
            • In the "File origin" selectbox, select "65001 : Unicode (UTF-8)"
              • NOTE: these encoding options are sorted alphabetically, so "Unicode (UTF-8)" appears near the bottom of the list.
            • Click Next
          • Wizard Step 2
            • Select "Comma" as the only delimiter
            • Click Next
          • Wizard Step 3
            • Select "Text" as the "Column data format" (Unfortunately, this must be done for each column individually in Excel)
              • At a minimum, you MUST ensure all date columns (e.g. dc.date.issued) are treated as "Text" so that Excel doesn't autoconvert DSpace's YYYY-MM-DD format into MM/DD/YYYY
              • To avoid such autoconversion, it is safest to ensure each column is treated as "Text".  Unfortunately, this means selecting each column one-by-one and choosing "Text" as the "Column data format".
            • Click Finish
    • Perform your edits. Once finished, re-upload the changes to DSpace.
      • You can remove entire columns from the spreadsheet to make it easier to concentrate on editing just a few metadata fields. But, the 'id' (first) column MUST be kept.  
        • Removing an entire column with not delete that metadata (rather DSpace will just ignore it). However, please be careful to remove the ENTIRE column (including the column header). Metadata values are only deleted if you leave the column header in place but clear out one or more values (rows in a column)
      • Some metadata fields may appear duplicated with ISO language tags within the spreadsheet (e.g. "dc.subject" and "dc.subject[en_US]" columns). This is nothing to be concerned about, it simply means that some of your metadata fields specify a specific language and others do not.
        • For example, a "dc.subject" column would include subjects with no language specified; whereas, a "dc.subject[en_US]" column would include subjects with USA English specified as the language, and a "dc.subject[es]" column would include subjects with Spanish specified as the language.
        • You are welcome to move values between these columns.  Moving a value from "dc.subject" to "dc.subject[en_US]" and saving would update that value to include a language specifier of USA English. Similarly, moving a value from "dc.subject[en_US]" to "dc.subject" would update that value to include no language specifier.
      • Many more editing tips are available via the tutorials linked above.
    • Click "Import Metadata" (under "Administrative" menu)
    • Select the CSV
    • Review the changes and save the changes.
      • WARNING: Please make certain that the changes displayed on the Review screen look correct.  Once you save, you will be unable to "undo" the changes without either re-editing to metadata (or if you deleted something entirely it may need to
    • NOTE: A more detailed walkthrough with screenshots of this entire process (and additional hints) is available in the Batch Metadata Editing tutorials linked above

Bulk/Batch Content Uploads

  • DSpace allows you to batch upload content + metadata in a specific Zip package format that DSpace calls the Simple Archive Format.
  • How-to: Create an upload package
  • How-to: Upload the SAF package to DSpace
    • Keep in mind that large packages (over several GB in size) may prove difficult to upload via the web. They may timeout during upload or processing. Therefore, with DSpaceDirect, you may wish to consider creating several separate upload packages (and upload them individually) if you have a larger set of content to upload
    • Step by step upload instructions are available in the DSpace documentation at UI Batch Import (XMLUI). (Note: All DSpaceDirect sites use the DSpace XMLUI user interface)

  • Tips and Common Questions:
    • For any columns that require multiple values (including the filename column), separate the multiple values using a double pipe ||
    • DSpace generates thumbnails for the following formats only:  BMP, GIF, JPG, JPEG, JPEG 2000, PNG, TIFF, and PDF. Thumbnails are auto-generated via an overnight process. So, they will not appear until the next day.  

Individual Item Permission Changes

  • DSpace allows you to modify the permissions on any Item at any time.  This may be useful if an item needs to be access restricted at a later date, or was accidentally deposited as restricted but needs to be made openly available.
    • NOTE: You can also perform batch/bulk permissions changes (for all Items in a single Collection). More information on that can be found below in the section "Bulk/Batch Permissions Changes (Advanced)"
  • How-to: Change Permissions on a single Item
    • Login to your site as an Administrator
    • Locate the Item whose permissions you wish to change
    • Click "Edit this Item" in the "Context" side menu
    • On the "Item Status" tab, click "Authorizations".  This will bring you to a "Policies" page which details the current permissions on this Item.
    • Overview of the "Policies" page:
      • You'll see a listing of all current policies/permissions on the Item, along with all its associated files (called "bitstreams")
        • To modify a policy, click on its "ID".
        • To add a new policy, click the appropriate "Add" link
        • To remove an existing policy, check the checkbox next to the policy, and click the "Delete Selected" button. 
      • Types of Policies:
        • Item Policies - These policies define the permissions on the Item's metadata. In other words, they define who can locate the Item via the DSpace browse & search tools. If the "Item Policies" are limited to a specific Group, then ONLY users who belong to that Group will be able to search/browse to locate that Item.
          • Please note: restricting access to an Item's metadata also means that search engines like Google or Google Scholar will NOT be able to index the Item. That means the Item will NEVER appear in Google or Google Scholar search results.  If you want an Item to appear in Google search results, but do not want everyone to be able to download its file(s) then you should leave the Item policies as unrestricted, while restricting access to individual files (Bitstreams). See below for more info.
        • Bundle Policies ("Policies for Bundle ___") - These policies define permissions across a "Bundle" (loose grouping) of Files.  Bundles in DSpace are mostly "hidden" from users, but they include "ORIGINAL" (the originally uploaded files), "LICENSE" (a copy of any deposit licenses), "TEXT" (optionally, the extracted plain text from originally uploaded files).  Generally speaking, the Bundle permissions here should really just be identical to the permissions you wish to set on the various Files (Bitstreams).
        • Bitstream Policies ("Bitstream __") - These policies define permissions on individual Files (called Bitstreams).  The "READ" permission lets users view/download the specific files in question.
      • Action types:
        • "READ" permissions gives a single group the ability to view or download the Item
        • "WRITE" permissions gives a single group the ability to edit or modify the Item
      • Start and End Dates:
        • Optionally, individual policies can have a start or end dates.  This allows you to set temporary (or expiring) permissions on an Item (e.g. for embargoes).
        • Start Date: Policies will not take affect until the start date has passed
        • End Date: After this date, the specific policy will no longer be in effect. 
        • If there is no Start or End date, the policy is in effect at all times
      • Some Sample Policies:
        • An Item Policy that provides "READ" access for "Anonymous" group.  This policy will allow anyone in the world to access the metadata for this specific Item.  It means that anyone will be able to search/browse for this Item in DSpace (and search engines like Google or Google Scholar will be able to index this Item, so anyone will also be able to find it within Google)
        • An Item Policy that provides "READ" access for only the "On Campus Users" group.  This policy will only allow users who belong to the "On Campus Users" group to browse or search for this Item within DSpace.  This means that all other users will NOT be able to locate this Item within DSpace.  It also means that search engines like Google or Google Scholar will also NOT be able to locate the item (as they will not have access to the Item's metadata).
        • A Bitstream Policy that provides "READ" access for "Anonymous" group. This policy will allow anyone in the world to view or download that particular file (bitstream).  (Please note however that users will be UNABLE to search/browse for the link to the file unless the Item Policy also provides READ access for the Anonymous group.)
        • A Bitstream Policy that provides "READ" access for only the "On Campus Users" group. This policy will only allow users who belong to the "On Campus Users" group to view or download that particular file (bitstream).  (Please note however that users will be UNABLE to search/browse for the link to the file unless the Item Policy also provides READ access for either the "On Campus Users" group OR the "Anonymous" group.)
        • POLICY NOTES: 
          • Administrators can ALWAYS access/edit all Items in the system, no matter what policies are set on the Item or its individual Bitstreams.
          • Policies are always "additive". This means that if you have two Policies, then both permissions are in effect. In other words, a more restrictive policy is always overridden by a less restrictive one (e.g. "READ for Anonymous group" will mean that anyone in the world can view/download the Item, even if there is also a secondary policy stating "READ for 'On Campus Users' group")
      • Some Sample Use Cases:
        • You want an Item to be searchable/browseable publicly and its file(s) to be viewable/downloadable to anyone in the world
          • Set ALL policies to "READ" access for "Anonymous" group
        • You want an Item to be searchable/browseable publicly, but its file(s) should ONLY be viewable/downloadable by members of the "On Campus Users" group
          • Set the Item Policy to be "READ" access for "Anonymous" group
          • Set all Bundle/Bitstream Policies to be "READ" access ONLY for the "On Campus Users" group
        • You want an Item to ONLY be searchable/browseable/downloadable by members of the "On Campus Users" group  (WARNING: In this scenario, the Item will also NEVER appear in Google or Google Scholar as that particular Item is essentially "invisible" to all users except for those in the "On Campus Users" group)
          • Set ALL policies to "READ" access for ONLY the "On Campus Users" group

Bulk/Batch Permissions Changes (Advanced)

  • DSpace provides an "Advanced Policy Manager" (also known as the "item wildcard policy admin tool"), which allows an Administrative user to perform bulk permissions changes to all of the Items or Files (bitstreams) within a specified Collection.
    • For more information on permissions / policy settings in general, please also refer to the section "Individual Item Permissions Changes" above.
  • WARNING: This Advanced Policy Manager is a bit of a "beta-level" tool. It works, but it's not the most user friendly page in DSpace. It's also not the smartest tool, so you need to sometimes take several steps to make the changes you want to make.
  • How-To: Batch Permissions Changes basics
    • Sample Use Case: The easiest way to explain how this tool works is via a common use-case. Suppose that you have a Collection of open access (viewable/readable to anyone in the world) Items which you now want to restrict to only be viewable to a group of users called "On Campus Users". Here's the steps you would take to perform that change:
      • Login to your site as an Administrator

      • Under the "Administrative" side menu, click on "Authorizations" (under Access Control submenu)

      • Just under the box at the top of the page, click the link that says "Click here to go to the item wildcard policy admin tool"

      • Step 1: Remove existing metadata access rights for all Items in the specified Collection.   To do so, fill out the form as follows:

        • Description: (optional, usually is left blank as it's only really useful for bulk changes to embargo)

        • Group: (leave blank in this case as you will remove any existing permissions)

        • Action: READ (you want to remove "READ" access)

        • Content Type: Item (you want to remove READ access on an Item level – this controls metadata access)

        • Collection: [select the collection]

        • Start Date: (leave blank - this is only useful for bulk-changes to embargo dates)

        • End Date: (leave blank - this is only useful for bulk-changes to embargo dates)

        • CLICK the "Clear Policies" button

        • (NOTE: Even though you get no confirmation screen, the changes will be immediately applied)
      • In Step #1, essentially all we've done is remove access to the Item metadata. The metadata is now only visible (readable) by Administrators.  However, the content files within those Items are unfortunately still accessible (if someone had bookmarked the URL)
      • Step 2: Remove existing content file access rights for all Items in the specified Collection.  To do so, fill out the form as follows:

        • Description: (optional, usually is left blank as it's only really useful for bulk changes to embargo)

        • Group: (leave blank in this case as you will remove any existing permissions)

        • Action: READ (you want to remove "READ" access)

        • Content Type: bitstream (you want to remove READ access on the files, or bitstreams)

        • Collection: [select the collection]

        • Start Date: (leave blank - this is only useful for bulk-changes to embargo dates)

        • End Date: (leave blank - this is only useful for bulk-changes to embargo dates)

        • CLICK the "Clear Policies" button

        • (NOTE: Even though you get no confirmation screen, the changes will be immediately applied)
      • In Step #2, we've also removed access to the Item files. This means that only Administrators can now access/download any files associated with the Items. Now, we  need to assign NEW permissions for our "On Campus Users" group in the following two steps.
      • Step 3: Give the "On Campus Users" group access to all metadata for all Items in the specified Collection. To do so, fill out the form as follows:

        • Description: (optional, usually is left blank as it's only really useful for bulk changes to embargo)

        • Group: Select the "On Campus Users" group

        • Action: READ (you want to add "READ" access to the selected group)

        • Content Type: item (you want to add READ access on Items)

        • Collection: [select the collection]

        • Start Date: (leave blank - this is only useful for bulk-changes to embargo dates)

        • End Date: (leave blank - this is only useful for bulk-changes to embargo dates)

        • CLICK the "Add Policies" button

        • (NOTE: Even though you get no confirmation screen, the changes will be immediately applied)
      • In Step #3, we've now given the "On Campus Users" group the ability to read the metadata for all Items in this collection. So, the final step is to also give them the ability to read/download files associated with these Items.
      • Step 4: Finally, give the "On Campus Users" group access to all files for all Items in the specified Collection. To do so, fill out the form as follows:

        • Description: (optional, usually is left blank as it's only really useful for bulk changes to embargo)

        • Group: Select the "On Campus Users" group

        • Action: READ (you want to add "READ" access to the selected group)

        • Content Type: bitstream (you want to add READ access on all files, or bitstreams)

        • Collection: [select the collection]

        • Start Date: (leave blank - this is only useful for bulk-changes to embargo dates)

        • End Date: (leave blank - this is only useful for bulk-changes to embargo dates)

        • CLICK the "Add Policies" button

        • (NOTE: Even though you get no confirmation screen, the changes will be immediately applied)
      • At the end of this process, all the Items (and their Files) in the selected Collection will now only be accessible to users who belong to your "On Campus Users" group.  Other non-Administrative users will be presented with an Access Restricted message.
  • DSpace's "Batch Metadata Editing" tool allows you to export sets of DSpace Item Metadata (all Items or just those in specific Communities/Collections) into a CSV file.  The metadata values/fields in the CSV file can then be edited using Microsoft Excel (or OpenOffice Calc).  Once editing is complete, you can re-import the modified CSV to apply the metadata changes into DSpace.
  • Step-by-step Tutorials:
  • How-To: Batch Metadata Editing basics
    • Batch Metadata Editing can be performed at a Community or Collection level
    • Browse to a specific Community or Collection. The "Context" menu will display an option for "Export Metadata".
    • Click "Export Metadata".  This will generate a CSV file that contains all the metadata for every Item within that Community or Collection hierarchy. 
      • WARNING: For extremely large communities or collections the export (and import) processes may take a long time (or cause your site to significantly slow down). Therefore, it is recommended to try to edit less than 500 Items (i.e. lines in CSV) at a single time.
    • Edit the CSV using either Microsoft Excel or OpenOffice Calc
      • EXCEL WARNING: By default, Excel will not open a CSV in Unicode/UTF-8 encoding. This means that special characters may be improperly displayed and also can be "corrupted" during re-import of the CSV.
        • You need to tell Excel this CSV is Unicode, by importing it as follows:
          • Open Excel (with an empty sheet open)
          • Select "Data" tab
          • Click "From Text" button (in the "External Data" section)
          • Select your CSV file
          • Wizard Step 1
            • Choose "Delimited" option
            • In the "File origin" selectbox, select "65001 : Unicode (UTF-8)"
            • Click Next
          • Wizard Step 2
            • Select "Comma" as the only delimiter
            • Click Next
          • Wizard Step 3
            • Select "Text" as the "Column data format"
            • Click Finish
    • Perform your edits. Once finished, re-upload the changes to DSpace
    • Click "Import Metadata" (under "Administrative" menu)
    • Select the CSV
    • Review the changes and apply
    • NOTE: A more detailed walkthrough with screenshots of this entire process (and additional hints) is available in the Batch Metadata Editing tutorials linked above