Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Ralph O'Flinn
  2. Jim Blake
  3. Huda Khan
  4. Tim Worrall
  5. Don Elsborg (star)
  6. Kitio Fofack
  7. Andrew Woods
  8. Mike Conlon (star)
  9. Christian Hauschke
  10. Alex Viggio
  11. Brian Lowe
  12. Benjamin Gross

Agenda

  1. Report from the field: ElasticSearch instead of Solr (Jim Blake )

  2. Documenting ingest approaches
  3. Sept sprint planning
  4. Active tickets:
    1. Jira
      serverDuraSpace JIRA
      serverIdc815ca92-fd23-34c2-8fe3-956808caf8c5
      keyVIVO-1501
       (pending response - Benjamin Gross )
    2. Jira
      serverDuraSpace JIRA
      serverIdc815ca92-fd23-34c2-8fe3-956808caf8c5
      keyVIVO-1524
       (Muhammad Javed - to review)
    3. Jira
      serverDuraSpace JIRA
      serverIdc815ca92-fd23-34c2-8fe3-956808caf8c5
      keyVIVO-1451
       (Kitio Fofack - where does this stand?)
  5. Planning for a demo and walk-through of: 
    Jira
    serverDuraSpace JIRA
    serverIdc815ca92-fd23-34c2-8fe3-956808caf8c5
    keyVIVO-1436
  6. Modularizing VIVO
    1. Search index
    2. Triplestore
    3. Frontend UI
  7. Jira
    serverDuraSpace JIRA
    serverIdc815ca92-fd23-34c2-8fe3-956808caf8c5
    keyVIVO-1443

...

Draft notes in Google-Doc

ElasticSearch instead of Solr

  1. Swapping in ElasticSearch 6.3 for Solr. 
  2. JIm demonstrated VIVO 1.8 running elasticSearch. 
  3. Downloaded elasticSearch last week and spent a couple of days creating a VIVO driver. 
    1. See https://github.com/j2blake/Vitro/blob/feature/elasticsearchExperiments/api/src/main/java/edu/cornell/mannlib/vitro/webapp/searchengine/elasticsearch/Elasticsearch_notes_on_the_first_draft.md
    2. Looks like VIVO. Works like VIVO. 
  4. Needs:
    1. Docs
    2. Smoke test
    3. Functional testing
    4. Improved snippets
    5. Code improvements
    6. Unit tests
    7. Automatic initialization of the index
  5. Why Elasticsearch
    1. Create options for sys admins. 
    2. Some sites are already bought in to ElasticSearch. They love it. 
    3. Certainly should not run in parallel. Should be implemented as part of VIVO.
  6. Put more data in the index
  7. Elastic has nested fields that keep their relationships publication uris on the author’s record. Publication names. 
    1. Does the current version of Solr have these features?
  8. Ownership characteristics – both seem okay? VIVO rather small by elasticSearch standards?
  9. ElasticSearch has an Apache license
  10. Both ElasticSearch and Solr are based on underlying Lucene technology
    1. See In case this helps (not sure if the article is 'good' or not but here you go): https://www.searchtechnologies.com/blog/solr-vs-elasticsearch-top-open-source-search
    2. And https://sematext.com/blog/solr-vs-elasticsearch-differences/ and https://db-engines.com/en/ranking/search+engine (rank Elasticsearch #1 based on their ranking methodology)
  11. Perhaps not a search for “best” but which is best in a particular environment and for a particular application
  12. After Solr 5, (current is Solr 7) Solr is a free-standing application, like ElasticSearch.
  13. Explore nested documents in elasticSearch. Current VIVO interface 
  14. Developed in 1.8 since the ant environment is so much more productive than the 1.9/1.10 Maven environment.
  15. Are there touch points with product evolution? 
    1. Does the nested doc capability, and the ability to have ElasticSearch in the architecture resonate with product evolution? 
    2. Product Evolution is looking at GraphQL for its API capabilities.
  16. How do we see applications related to VIVO being installed? By the installer?
    1. For try out, a jar?
    2. For try out, a VM?
    3. For production, apps must be installed? As we require MySQL and Tomcat now? Solr or ElasticSearch in the future?

Ingest Approaches

  1. Add ingest tools to the table in the apps and tools catalog

Topics for September Sprint

  1. abox/tbox topic
  2. ElasticSearch
  3. Internationalization
  4. Decoupling?

Planning for walk-through of large pull-request

  1. Graham’s pull request is a big one. May need some additional hands.

Previous Actions

Actions

  •  ..Alex Viggio  will bring news of Elasticsearch instead of Solr up with Product Evolution.  Might there be consequences for the September sprint.