Page History: Batch file structure
Compare Page Revisions
Page Revision: 2009/04/07 11:27
NDNP FILE STRUCTURE
batchID/snXXXXXXX/reel_barcode_number/issue_date_edition
NAMING SCHEMES and EXAMPLES¶
- batchID = the group of reels scanned together
- BATCH.xml = Delivery batch manifest data
- BATCH_1.xml = Validated version of delivery batch manifest data
- snXXXXXXX (7 to 10 digits) = LCCN - Library of Congress serial number for newspaper title (sn87093449: Daily Republican)
- 00211100503 (11 digits) = NDNP reel barcode number
- 00211100503.xml - Reel metadata
- 00211100503_1.xml - validated version of reel metadata
- 1896022601 = YYYYMMDD01 issue date (e.g. 1896-02-26) and edition number (e.g. 01)
- 1896022601.xml = issue information (where you can find date, title, vol., iss., page correlations)
- 1896022601_1.xml = validated version of issue metadata
- 0016.xml = ocr data for the page (file name is sequence number of scan)
- 0016_1.xml = validated version of page data