Individual Institution Information

Institution

Cornell

Indiana

Ponce

Scripps

UF

Weill

WUSTL

Size (# of people)

~5000 faculty

~2300 faculty, 40,000 students (IU Bloomington only)

411 faculty, 622 students

~2996

~4500 faculty, ~51,000 students

~5000
refined

  1. PND

 

  1. of Affiliates/
  2. of additional people

~12K

 

not sure. under discussion

~7 affiliates,
300-700 people

 

~50 affiliates
fewer core

 

Scope (academic depts?)

Cross discipline, faculty, researchers, grad students (under discussion)

Under discussion

Four programs (MD, Biomedical Doctorate, Public Health, Psychology)

Faculty, ~254

15 departments | Faculty, ( other groups still under discussion), 16 colleges, 124 departments | Faculty,
Med and grad students
? research staff | |

Nature of Institution

Mixed private and state, land grant, research

Public research university

private, non-profit

Private

non-profit
research
institute | Public, land grant, research university | | |

All Data Sources

HR: LDAP
Courses: PeopleSoft
Pubs: Activity Insight, Pubmed (coming), other

Grants: OSP | HR: PeopleSoft,

Courses: Sakai,
Pubs: Faculty Annual Report system,
others under investigation/discussion

OSRPP, Human Resources, MIS, Academic Affairs

MySQL database

HR: PeopleSoft
Courses: moving
to Sakai
Pubs: EDIS,

EndNote Web

Grants: DSR
others under investigation

HR: SAP
Courses: Angel
Faculty: WOFA
Pubs: RPS/POPS
Res: Coeus
others...

HR: PeopleSoft

Technical Environment

Redhat Linux, Apache, Tomcat, Java, MySQL, CUWebAuth

Linux/Intel, Apache, Tomcat, Java, MySQL

Windows

Linux, Windows, Sun/Solaris

Linux

Java, Jira, LDAP/AD, Nexus, source safe, sub-version, cold fusion, oracle, MSSQL, windows, AIX, Solaris, ...

Linux

Authentication

Kerberos

CAS, Kerberos

Active Directory

LDAP, Active Directory, Shibboleth

Shibboleth

LDAP, Active Directory

 

System Environment by Site

Institution

    1. of Servers*

      Servers (Physical or Virtual)

      Types of Server

      Server OS

      Tomcat Version

      MySql Version

      Web Addresses

      Google Analytics included?

Cornell (Developers)

1 production, several development and test servers in department

Physical

  • Production
  • Development
  • Test | RHEL 5+ | 6+ | 5+ | http://vivo.cornell.edu | Pre-NIH grant |

    Indiana

    3

    Virtual

  • Production
  • Staging
  • Development | RHEL 5.6 | 6 | 5.0.77 | http://vivo.iu.edu | 02/11/10 |

    Ponce

    2

    Virtual

  • Production
  • Test | Windows Server 2008 R2 | 6 | 5 | http://vivo.psm.edu | 02/10/10 |

    Scripps

    2

    Virtual

Data By Site

Institution

Data Collection Update

Organizational Data

People/HR Data

Grant Data

Publications

CV's

BioSketchs

Local Public Data (local repositories)

Cornell (updated 3/1/2011)

New processes for: LDAP (HR), Registrar (courses), OSP warehouse (Grants), Activity insight Still exists
legacy faculty reporting tool, self-editing

Updated (existed before the grant)

ingested, up-to-date

Prod: 11,724
Test: 55, 927 | ingested, old data in, new ingest in test, will move to prod very soon.

Prod: 3,359
Test: 28,325 | We have the deprecated text fields and some linked data. Working on getting harvester to work with pub med. Activity Insight publications (30K) in test, will move to prod very soon.Academic articles

Prod: 2,043
Test: 29,355 | Students and administrative staff help out with CV input, but we don't have a system like UF. | no | No institutional repository but in discussion for eCommons. |

Indiana

Summary:We have 2.5K+ IUB faculty in our production instance. This includes people/HR data, course/teaching data, and organization data ingested from our data warehouse.

Next up is expansion to include faculty on IU's other campuses, with IUPUI being the next source to tackle.

Work is currently being done to import about 55 profiles with richer data from the Networks and Complex Systems Research VIVO instance at IU.

Collaboration is underway with the CTSI Hub. We have shared basic VIVO profile data for IUPUI campus faculty to seed Hub profiles. Work on collecting research interest data via the Hub and importing into VIVO is in progress.

Collaboration is underway with the Pervasive Technology Institute. We have their publication data set and are waiting for the next release of the Harvester to use its MOD XML ingest abilities on this data. | IU's data warehouse. Different areas of data are controlled by data managers, who grant access to the data in their area. Access to the data is distinct from permission to publicly publish it to VIVO.

Various other local data stores (e.g. CTSI Hub, PTI publications data, possibly the IUPUI Medical School faculty reporting system) |

Ponce

Collected data from Active Directory, HR Database, OSRPP office, internal web pages, pubmed and NIH Reporter

manually entered

Data Ingested from an exported of Active Directory

Some data entered manually

Testing harvester on test instance. Some data entered manually on production

Data Colected from OSRPP Office and faculty web pages and manually entered

Data Collected from OSRPP office and Faculty web pages and manually entered

Data entered Manually

Scripps

Collected data from HR files, existing web profiles and publication data from PubMed.

02/12/10 Data manually entered on test server and copied to production server

02/12/10 manually entered on test server and copied to production server

sample data entered

sample data entered; will ingest at the institutional level upon release of the PubMed ingest procedure

sample data entered for showcase department; educational data manually entered from faculty web profiles

no

n/a: No institutional repository

UF

Collected HR data from Enterprise Reporting (PeopleSoft),
Grant data from DSR,
and publication data from PubMed

manually curated and updated. We will be doing an ingest in the near future - to include all dept IDs.

Ingested using harvester .7. Currently in process of refreshing with 1.0

ingested using .7. currently refreshing with .7.

PubMed ingest using .7. Refresh in process with 1.0.

phase I (health science) departments completed. phase II (sciences and liberal arts) beginning.

One 'showcase' department only has biosketchs (no CVs)

not yet

Weill Cornell

Continuing with examination of data sources. Already collected some researcher and faculty data from People, OFA, POPS (internal WCMC system data sources). Pending approval to collect teaching data from Angel and people data from SAP.

Originally manually entered. Will be replaced by database driven data. Once pass the test stage, will use Harvester for data ingest in production.

Acquired. Currently in test stage to ingest name, position, biography using Harvester. Will continue with education background, research overview, service, address. Once pass the test stage, will use Harvester for data ingest in production.

Some grant data added using the Advanced Data Tools of the VIVO interface. Will be ingested using Harvester after passing the test stage. Currently examining the data from Coeus.

Will use Harvester for ingest. Tested ingest on test site.

not yet

not yet

not yet

WUSTL

will upload data into production once every 8 weeks.

manually entered

Programmatically ingested for all faculty members for which we have permission (>50% effort).

Ingested data from NIH Reporter, however there are a few issues. Our group has the WUSM grant data, but we are acquiring permission to use it as well as receive grant abstracts.

Programmatically ingested from PubMed (>32K articles)

Two departments/divisions entered, two Centers.

Requested from all departments and divisions. Will work on a script to programmatically ingest the data.

not yet