Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

DSpace Statistics

Table of Contents
outlinetrue

DSpace uses the Apache Solr application underlaying the statistics. There is no need to download any separate software. All the necessary software is included.

...

Property Name

Default Value

Type

Description

solr.log.server

${dspace.baseUrl}/solr/statistics

String

Is used by the SolrLogger Client class to connect tot the Solr server over http and perform updates and queries. In most cases, this can (and should) be set to localhost.

solr.spiderips.urls


Code Block
http://iplists.com/google.txt, \
http://iplists.com/inktomi.txt, \http://iplists.com/lycos.txt, \http://iplists.com/infoseek.txt, \http://iplists.com/altavista.txt \http://iplists.com/excite.txt, \
http://iplists.com/misc.txt, \
http://iplists.com/non_engines.txt

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="8fabd3b4acef8ffc-cf5e3688-4e184730-9205a3c2-5c312261ff82da746fdf7371"><ac:plain-text-body><![CDATA[[http://iplists.com/non_engines.txt]

String

List of URLs to download spiders files into [dspace]/config/spiders. These files contain lists of known spider IPs and are utilized by the SolrLogger to flag usage events with an "isBot" field, or ignore them entirely.
]]></ac:plain-text-body></ac:structured-macro>
The "stats-util" command can be used to force an update of spider files, regenerate "isBot" fields on indexed events, and delete spiders from the index. For usage, run:


Code Block
dspace stats-util -h

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="ac3df4153563c562-b628d6f0-465947e6-990b87c1-0565043d75cdefa8cab023b6"><ac:plain-text-body><![CDATA[from your [dspace]/bin directory

]]></ac:plain-text-body></ac:structured-macro>

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="984334798e5278ca-3947b7b4-46744a80-a06eb967-208a719dfb36f6259ee1cc02"><ac:plain-text-body><![CDATA[

solr.dbfile

${dspace.dir}/config/GeoLiteCity.dat

String

The following referes to the GeoLiteCity database file utilized by the LocationUtils to calculate the location of client requests based on IP address. During the Ant build process (both fresh_install and update) this file will be downloaded from [http://www.maxmind.com/app/geolitecity] if a new version has been published or it is absent from your [dspace]/config directory.

]]></ac:plain-text-body></ac:structured-macro>

solr.resolver.timeout

200

Integer

Timeout in milliseconds for DNS resolution of origin hosts/IPs. Setting this value too high may result in solr exhausting your connection pool.

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="f957af3d8de60ed8-61c7788f-45404658-a5bfa665-7f1203a7355840e0366de990"><ac:plain-text-body><![CDATA[

useProxies

true

boolean

Will cause Statistics loging to look for X-Forward URI to detect clients IP that have accessed it through a Proxy service.  Allows detection of client IP when accessing DSpace. [Note: This setting is found in the DSpace Logging sesction of dspace.cfg]

]]></ac:plain-text-body></ac:structured-macro>

statistics.item.authorization.admin

true

boolean

Enables access control restriction on DSpace  Statistics pages, Restrictions are based on access rights to Community, Collection and Item Pages. This will require the user to sign on to see that statistics. Setting the statistics to "false" will make them publicly available.

solr.statistics.logBots

true

boolean

If false, and IP is detected as a spider, the event is not logged.
If true, the event will be logged with the "isBot" field set to true.
(see solr.statistics.query.filter.* for query filter options)

solr.statistics.query.filter.spiderIp

false

boolean

If true, statistics queries will filter out spider IPs -- use with caution, as this often results in extremely long query strings.

solr.statistics.query.filter.isBot

true

boolean

If true, statistics queries will filter out events flagged with the "isBot" field. This is the recommended method of filtering spiders from statistics.

...