User Tools

Site Tools


scratchpad:oaitoolkit

This Page May Need Updating or Revising

The following instructions are dated Feb 4, 2009. Error handling in the OAIToolkit should have improved since then, however, fast-extract is probably still capable of producing bad MARCXML. This also doesn't tell you how to install the Toolkit, and assumes some specific file paths you'll want to change.

To extract records from EG:

PERL5LIB=/home/phasefx/svn/OpenSRF/STABLE_0_9/src/perlmods/:$PERL5LIB time ./fast-extract| \
./import/marc_add_ids -set_001_003 -f deleted -f id -f tcn_source -f tcn_value -f marc > ~/exported-records.marcxml

To start and stop Tomcat:

cd ~/apache-tomcat-6.0.18
bin/startup.sh
bin/shutdown.sh

To load or update records:

cd ~/OAIToolkit
cp some.marc.xml marcxml/
./load.sh

It seems that Tomcat needs to be restarted for an updated Lucene index to become visible.

If you delete ~/OAIToolkit/lucene_index/, it will be recreated on the next invocation of load.sh

The toolkit doesn't like records with bad leaders, or nonvalid subfield codes and indicators.

It can break completely with a "MarcException", and not report in its summary the records it does manage to create and/or update.

To view the web service:

http://hostname:8080/OAIToolkit

scratchpad/oaitoolkit.txt · Last modified: 2022/02/10 13:34 by 127.0.0.1

Except where otherwise noted, content on this wiki is licensed under the following license: CC Attribution-Share Alike 4.0 International
CC Attribution-Share Alike 4.0 International Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki

© 2008-2022 GPLS and others. Evergreen is open source software, freely licensed under GNU GPLv2 or later.
The Evergreen Project is a U.S. 501(c)3 non-profit organization.