About
Q
A_Report w
as written to comp
are WSL's newsp
aper input d
at
a (i.e. d
at
a cre
ated for NDNP during reel ev
alu
ation) to the NDNP output
after d
at
a conversion (i.e. the b
atch, mets issue
and mets reel files). You m
ay be
able to
ad
apt the script for use but be
aw
are th
at these instructions
are written for use with d
at
a from
an MS
Access 2007 d
at
ab
ase
and
a Windows oper
ating system.
Download
QA_Report.tar v1 or
QA_Report.zip v1Feel free to downlo
ad but ple
ase help us improve it by
emailing me if you find bugs.
Install
Extr
act Q
A_Report
and s
ave it on your server or extern
al h
ard drive. This script works best (
and is f
aster) if you s
ave the folder on the s
ame drive
as the b
atch your
are testing (e.g. C:\NDNP\OUT\b
atch_w
a_2009_s
ample
and C:\NDNP\OUT\Q
A\Q
A_Report)
data:image/s3,"s3://crabby-images/9e91e/9e91edc7da4d44a59aa7f946ea2718d2d3e297c3" alt="save to same folder as the batch" save to same folder as the batch |
data:image/s3,"s3://crabby-images/da73a/da73af00c54db43dff6e4d262e025e5657286c08" alt="FYI: the pictured batch is one reel with three titles" FYI: the pictured batch is one reel with three titles |
How to Run
- To run QA_Report you must first save your input data as xml to the QA_Report folder. You can easily convert Access tables to xml (see sample_database.mdb)
- Our input data is delivered in MS Access in three tables (titleTbl, reelTbl, issueTbl)
- From Access save each table as xml into the QA_Report folder (e.g. Export > More > XML File > C:\NDNP\OUT\QA\QA_Report\reelTbl.xml)
- Click OK
- Export XML window will open, save 'Data (XML)' only (uncheck 'Schema of the data (XSD)' - this isn't needed)
- Click OK
- Save Export Steps if you like, click Close
- You may close the database
- QA_Report should now have three xml files (titleTbl.xml, reelTbl.xml, issueTbl.xml)
NOTE: You may get errors if your data isn't structured in the same way as the sample data. If you know xsl, you're welcome to tweak the code to meet your needs. For example, if your field names are different simply change the variable names in the titleTbl.xsl, reelTbl.xsl, and issueTbl.xsl files in QA_Report data:image/s3,"s3://crabby-images/e6f88/e6f880e442fc873f88a5be858f3a82928821de5a" alt="save data to xml files" save data to xml files |
- Now run the QA_Report using the BATCH.xml file
- Open your computer's MS-DOS command
- Start > Run > type 'cmd' > click 'OK'
- Navigate to QA_Report (cd C:\NDNP\OUT\QA\QA_Report)
- At the prompt run the run_report.bat file with the proper path to the BATCH.xml file as a parameter - use single quotes as shown here (e.g. run_report '..\batch_wa_2009_sample\BATCH.xml')
data:image/s3,"s3://crabby-images/d8bc2/d8bc24ba64577fe13988875f737f2285fccd8cc5" alt="run the script from the command line" run the script from the command line |
- The script may take a few minutes depending on the size of your dataset. When the comparison is done a browser window will open to 'Test:titleTbl'
data:image/s3,"s3://crabby-images/f06b7/f06b7440a76c6ea3be62da90c8dfd3ba5c060c29" alt="when the script finishes the first test table will open in a browser" when the script finishes the first test table will open in a browser |
Understanding the Results¶
Links in
all the t
ables will open up the corresponding mets (issue or reel) or the B
ATCH.xml file in the browser so you c
an investig
ate
any found differences in the output d
at
a.
Test: titleTbl
This t
able is
a re-cre
ation of titleTbl in the d
at
ab
ase.
- Titles in red are missing from the BATCH.xml file
- The links at the bottom of the page will take you to the next test results
Test: reelTbl
This t
able is
a re-cre
ation of reelTbl in the d
at
ab
ase.
- Reels in red are those whose reelNumber is missing from the BATCH.xml file
- The data in this table is compared to the mets reel files throughout the batch (crawled via the BATCH.xml file using ndnp:reel)
- Cells in yellow are where there is a mismatch in the output
Test: issueTbl
This t
able is
a re-cre
ation of issueTbl in the d
at
ab
ase.
- Issues in red are those whose lccn is missing from the BATCH.xml file
- The data in this table is compared to the mets issues files throughout the batch (crawled via the BATCH.xml file using ndnp:issue)
- Cells in yellow are where there is a mismatch in the output
Known issues
- Need to fix test for ndnp:startDate and ndnp:endDate on reel mets test (error created when reel is split between two titles).
- Fix test for density readings
- Need to build a way to test when duplicate pages MODS:detail@type="page number"/MODS:number should not equal MODS:detail@type="pages"/MODS:start (since WSL delivers duplicates).