You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 5 Next »

SHARE Hackathon and Community Meeting

July 11-14, 2016
Charlottesville, VA

Monday, July 16 – Hackathon Day 1

Jeff Spies

SHARE version 2.  More specificity about the contents of the database

Need interfaces for SHARE.  SHARE does not want to be an interface to the scholarly work

Data needs discovery and refinement

Rick Johnson

Exciting time to be involved with SHARE

Erin Braswell

OSF work space.  Code at GitHub.

Provider  -> Harvester -> raw_data -> Normalizer -> normalized_data -> changes -> change_set -> versions -> entities

The Harvester gets the data from the provider. Uses date restrictions to get "new" data. The normalizer creates the values that can go into the SHARE data models.  

Title issues:  Unicode, LateX, MS Word, foreign languages.  Attempt to store the language provided by the provider.  Joined fields for titles with multiple titles.  Can be stored as a a list n the extra class.

Normalizers can guess title or identifier or DOI.  Usually conservative normalizers. 

Idea:  data inspectors:  Write elastic searches to get percentages of populated/vacant fields, by provider, by date range.  Would show the density of field values in the normalized data. Could be used to draw control charts of field values density.  Mirror the values.

Idea:  data inspectors:  Identifiers are a problem, often come in "random". 

Idea:  data inspectors:  feed the results back the the providers.  The providers may be able to suggestions enhancers to the harvesters and normalizers.

Documents can be updated – provider's id.  If the metadata comes in for a record that exists, COS versions the record and provides the most current unless the query asks for versions.

See https://staging-share.osf.io/api/

Tuesday – Hackathon Day 2

Wrote the share-data-inspector. Upload to GithHiub and provide link here

Wednesday – Community Meeting Day 1

Keynote Siva Vaidhyanathan, UVa – The Operating System of Our Lives: How Google, Facebook and Apple plan to manage everything

Relationships with technology and information and communication changing rapidly.  Mapping a game onto reality, engaging millions of people immediately into a game – Pokemon Go.  Facebook Live – mapping reality into the virtual world, immediately, effortlessly, in real-time.  Facebook took the video down for an hour, did not anticipate the incident of violence.  1.6B users, leading source of news for many millions of people.  Facebook matches content to people.  Facebook denying its level of power in the world.  Google has the same position – constantly underplaying its role in pointing people at information.

We are collectively dependent on Google.

"The web is dead" – flows of data are not open docs loosely joined.  Most data is moving through proprietary devices and formats. Our concept of the Internet is flawed/primitive.  We have never been comfortable with the concepts of radical openness.



 

  • No labels