The Library of Congress >> Especially for Librarians and Archivists >> Standards
HOME >> MARC Development >> Discussion Paper List
DATE: May 26, 2022
REVISED:
NAME: Defining a New Field to Record Electronic Archive Location and Access in the MARC 21 Formats
SOURCE: ISSN International Centre, Paris, and the National Library of Finland
SUMMARY: This paper describes defining a new field 857 (Electronic Archive Location and Access) to enable libraries to specify a persistent identifier or location of the resource in a digital archive repository or Web archive, and record the name, date ranges, and completeness of relevant archived content.
KEYWORDS: Field 857 (All formats); Electronic archive location and access (All formats); Field 856 (All formats); Electronic location and access (All formats); Access to online information resources (All formats)
RELATED: 2022-DP02; 2020-DP01; 2022-06; 2022-07; 2018-DP11; 93-4; 97-1; 99-06; 2019-01; DP 49; DP 54; DP 69; Guidelines for the Use of Field 856, Revised August 1999;Guidelines for the Use of Field 856, Revised March 2002
STATUS/COMMENTS:
05/26/22 – Made available to the MARC community for discussion.
06/29/22 – Results of MARC Advisory Committee discussion: Support for the new field varied somewhat but the group was reminded that a straw poll during the previous meeting demonstrated a strong preference for a new field to accommodate archival information rather than adding more information to field 856. A pre-meeting comment by Australia articulated a distinct need for a unique place for their digital preservation master URL to be mapped to, a need echoed by the sponsors of the discussion paper during the meeting. Discussion turned to the labeling and definition of subfield $e concerning archiving dynamics, namely whether it addressed the date range(s) of the content or of the harvesting action. This was clarified to specify the date range(s) of the content. This then raised the question of the date range of publication of the content or the date span of the subject matter addressed by the content. This was clarified to concern the publication date(s) of the content. A suggestion was made to deploy the “coverage/coverage dates” terminology from the continuing resources cataloging community. There were additional comments regarding what subfield $e should and would encompass. An additional subfield, possibly subfield $f, was suggested for the other use case(s) under discussion. The paper will return as a proposal, with the authors addressing the concerns expressed by MAC regarding whether all of the subfields that echo 856 subfields are needed in 857, and clarification about how subfields $e and $f are to be used.
Discussion paper 2022-DP02, written jointly by the ISSN International Centre and the National Library of Finland, suggested several improvements to existing field 856 (Electronic Location and Access), including;
MAC meeting response to the discussion paper was positive. New data elements were supported, but a clear majority of the MAC members preferred that a new MARC field for Electronic Archive Location and Access be defined in a separate proposal instead of adding all the new data elements to 856. Two separate papers have thus been prepared: a proposal covering the redefined subfields in field 856, and this discussion paper covering the creation of field 857.
For the purposes of this discussion paper, the terms electronic and digital are used interchangeably. This paper addresses both digital archive repositories and Web archives, which have the same purpose but are built in a different manner, using different software and communication protocols. The ISO definitions are provided below.
Digital archive repository
dedicated to the long-term preservation of the associated data
Note 1 to entry: The data in digital archives are also often available on-line. This highlights the need for reliable PIDs
[SOURCE: ISO 24622-1:2015(en), 2.1]Web archive
entire set of resources crawled from the Web over time, comprising one or more collections
[SOURCE: ISO/TR 14873:2013, 2.4]
Digital archive information does not currently have a specific field in MARC 21. There are at least four reasons why including information about Web archives and digital archive repositories in bibliographic records may provide added value to libraries and users and should be described in a new MARC 21 field:
New field 857 may be used when the existence of a copy of the resource has been identified in an archive and there is a need to record more information about the archive and its contents in a bibliographic record than is possible in field 856.
In the previous discussion paper, subfields for the date range of the relevant archive content and its completeness were included. In this paper, we have added subfields for the names of the archiving agency and archive.
New field 857 (Electronic Archive Location and Access) would enable information about an archived copy of a resource (persistent identifier, location, date range, completeness, provider) to be provided in a dedicated field in MARC records.
A parallel structure to Field 856 is proposed with some small differences. Field 857 would accommodate not only Web archives, but also digital archive repositories built on applications such as Ex Libris’ Rosetta, that are intended to preserve original manifestations of resources in outdated file formats.
In an update to the previous discussion paper, new subfields have been defined for the name of the archiving agency and the name of the Web archive or digital archive repository.
The archived resource may be available in one or more publicly accessible Web archives, but access may also be restricted. The new terms of access and reproduction subfields introduced in MARC Proposal No. 2022-06 for Field 856 are equally applicable here.
Field 857 would make it possible for a library to specify the Web archive versions of both the current and past URL of an electronic journal that has been published on an unstable internet domain. For example, PLOS One journal has been published for 15 years, but its URI has not remained the same. The current address, https://journals.plos.org/plosone/, has been used since 2015. The Internet Archive has harvested it 5,577 times since January 29, 2015. The original address, https://plosone.org, was first harvested in July 2006.
The ISSN International Centre collects archive information for the Keepers Registry from over a dozen international partners. There are currently 76,267 titles ingested by 16 archiving agencies. Preservation data about electronic journals is normalized, stored in a local MARC field, and exposed on the ISSN Portal. If there were an official field for archive information, the ISSN International Centre would be able to exchange this information more readily with other partners, libraries, and the 93 ISSN national centres in a format vetted by the MARC community.
The National Library of Finland archives continuing and dynamic publications from the Finnish Web in a Web archive. This activity is supported by the legal deposit act. Until now, the National Library of Finland has used Field 856, however there is no standard way to indicate the date range and completeness of the archived content or rights related to it, or the name of the archive. Field 857 will make it possible for the National Library of Finland to provide links to its archive, which can only be accessed from dedicated work stations in legal deposit libraries. All the data from the Finnish Web archive and other electronic legal deposit documents are also being ingested to the national digital archive, hosted by the Finnish IT Center for Science. In the long term, this archive will become the only place hosting the original, outdated versions of electronic legal deposit documents, since the National Library relies on a migration strategy for its digital preservation. Providing links to the national digital archive will meet the needs of a customer who prefers the original version of a document to migrated versions, even if rendering the original document may require special tools.
This paper proposes that a new field 857 be created and defined in the MARC 21 formats as follows (subfields in bold defined below):
857 - Electronic Archive Location and Access (R)
FIELD DEFINITION AND SCOPE
Information needed to locate and access an electronic resource from a Web archive or a digital archive repository. This field may be used to provide additional information about archived resources beyond what is possible in Field 856.
Field 857 is repeated when an electronic resource has been stored in more than one Web archive or digital archive repository.Indicators
First Indicator – Access method
# - No information provided
1 - FTP
4 - HTTP
7 - Method specified in subfield $2Second indicator – Relationship
# - No information provided
0 - Resource
1 - Version of resource
2 - Related resource
8 - No display constant generatedSubfield Codes
$c - Name of the archiving agency (NR)
$d - Name of the Web archive or digital archive repository (NR)
$e - Archive harvesting date range (NR)
$f - Archive completeness (NR)
$h – Non-functioning Uniform Resource Identifier (URI) (R)
$g - Persistent identifier (PID) (R)
$l - Standardized information governing access (R)
$m - Contact for access assistance (R)
$n - Terms governing access (R)
$q - Electronic format type (R)
$r - Standardized information governing use and reproduction (R)
$s - File size (R)
$t - Terms governing use and reproduction (R)
$u - Uniform Resource Identifier (URI) (R)
$x - Nonpublic note (R)
$y - Link text (R)
$z - Public note (R)
$2 - Access method (NR)
$3 - Materials specified (NR)
$5 - Institution to which field applies (NR)
$6 - Linkage (NR)
$7 - Access status (NR)
$8 - Field link and sequence number (R)
Note: Subfield descriptions that are identical with 856 or are standard control subfields are not reproduced here. Subfields in italics have been approved and are expected in MARC Update No. 34 in 2022. Second indicator values proposed in MARC Proposal 2022-07 will be substituted if that paper is approved by MAC.
$c – Name of the archiving agency (NR)
Agency responsible for the Web archive or digital archive repository.$d – Name of the Web archive or digital archive repository (NR)
The name by which the Web archive or digital archive repository is known.$e – Archive content date range (NR)
Date range of the content of the resource in the archive, specified according to Extended Date/Time Format (EDTF). The start date of the archived content should always be mentioned; the end date only if the content described in the record is no longer being archived.
Multiple date ranges can be provided in a single 857 $e by separating them with ";". The reason for these gaps, if known, may be provided in 857 $x or 857 $z.
Note: 857 $e is intended primarily for electronic continuing resources and other dynamic resources.$f – Archive completeness (NR)
Contains available information about the completeness of the content in the Web archive or digital archive repository, but may also include information about how often or how many times the resource has been harvested during the date range specified in $e.
Note: 857 $f is intended primarily for electronic continuing resources or other dynamic resources.$g - Persistent identifier (PID) (R)
Persistent identifier (PID) which enables search and retrieval of a resource from a Web archive or digital archive repository using existing Internet protocols.$h – Non-functioning Uniform Resource Identifier (URI) (R)
Uniform Resource Indicator (URI), which is no longer functional for example due to link rot, content drift, etc. In the case of archives and repositories, they may cease to exist.
Subfield $h may be repeated if there is more than one non-functioning URI. A note on the status change (including the date) may be added either in subfield 857 $x or 857 $z, depending on the local policy.
leader 03922cas a2200625 i 4500
007 cr |||||||||||
008 190529d20122018enkqr|pso | a0eng
022 0 # $a 2162-4054 $2 _1 $l 2162-4046
210 1 # $a Worm $b (Austin Tex., Online)
222 # # $a Worm $b (Austin, Tex. Online)
245 1 # $a Worm.
264 # 1 $a Austin, TX $b Landes Bioscience $c [2012]-
264 3 1 $3 <2015-> $a Abingdon $b Taylor & Francis
362 1 # $a Began with v. 1, issue 1 (Jan./Feb./Mar. 2012); ceased with Volume 6, Issue 3/4 (2017).
588 # # $a Description based on: V. 1, issue 1 (January/February/March 2012); title from issue contents page (publisher's Web site, viewed January 15, 2013).
588 # # $a Latest issue consulted: Volume 6, issue 3-4 (2017) (Taylor & Francis Online, February 2, 2018).
776 0 8 $t Worm (Austin, Tex. Print) $x 2162-4046 $h ta
856 4 0 $h http://www.landesbioscience.com/journals/worm/$u http://www.tandfonline.com/toc/kwrm20/current $u http://www.tandfonline.com/loi/kwrm20
857 40 $c Internet Archive $e 2011-2017$f saved 45 times
$q https://www.nationalarchives.gov.uk/PRONOM/fmt/96 $u https://web.archive.org/web/*/http://www.landesbioscience.com/journals/worm/
LDR 01574cai a2200469 i 4500
007 cr||||||||||||
008 090429c20089999fi kn w|o 0 | b0fin|
022 1# $a1798-1557$l0355-2047$2a
222 #0 $aHS.fi.
245 00 $aHS.fi.
246 13 $aHelsingin sanomat
260 ## $aHelsinki: $bSanoma Magazines, $c2008-
310 ## $acontinuously updated
338 ## $aonline resource $bcr $2rdacarrier
538 ## $aWorld Wide Web.
655 #7 $anewspapers $2slm/fin $0http://urn.fi/URN:NBN:fi:au:slm:s39
776 0# $tHelsingin sanomat $x0355-2047
856 40 $uhttps://www.hs.fi/
857 40 $c National Library of Finland $d Suomalainen verkkoarkisto $e 2006-01-16- $f saved 9431 times as of 2022-03-14 $n Access in legal deposit libraries only $q text/html $u https://verkkoarkisto.kansalliskirjasto.fi/wayback/*/www.hs.fi
857 40 $c Internet Archive $e 2003-12-18- $f saved 21238 times as of 2022-03-15 $n Free access $u https://web.archive.org/web/*/hs.fi
LDR 02329cas a2200493 i 4500
007 cr
008 161228c20169999it s||pss|||||||||b0mul
022 0 # $2 _d $a 2531-9884 $l 2531-9884
210 1 # $a Comp. cult. stud. $b (Firenze)
222 # # $a Comparative cultural studies $b (Firenze)
245 1 # $a Comparative cultural studies.
246 3 1 $a Comparative Cultural Studies. European and latin american perspectives
260 3 # $a Firenze $b Firenze University Press
362 1 # $a N. 1 (2016)
500 # # $a Peer-review (fascicolo consultato: N. 1, 2016)
710 1 # $a Università degli Studi, Firenze.
720 # # $a Università degli Studi di Firenze
856 4 0 $h http://www.fupress.net/index.php/ccselap/index $q application/pdf $u https://oajournals.fupress.net/index.php/ccselap/index
857 4 0 $c CLOCKSS $e 2016- $n Dark archive $u http://www.clockss.org/clockss/Comparative_cultural_studies
857 4 0 $c Internet Archive $d FatCat $e2021- $f selected articles $q application/pdf $u https://fatcat.wiki/container/pv7gnxzaj5eydnvan65xus4uiy
leader 04030cas a2200577 i 4500
007 cr |||||||||||
008 100219c20139999ne f||p|s|||||||||a0eng |
022 0 # $a 2213-0624 $2 _j $l 2213-0624
222 # # $a International journal for history, culture and modernity $b (Online)
245 1 # $a International journal for history, culture and modernity.
246 3 3 $a HCM
260 # # $a Utrecht $b Utrecht University Department for History & Art History
260 3 # $a Leiden $b Brill
362 1 # $a Volume 1 - Issue 1 - 2013-
710 2 # $a Stichting International Journal for History, Culture and Modernity
776 0 # $t International journal for history, culture and modernity (Print) $x 2666-6529 $h ta
856 4 0 $h https://www.history-culture-modernity.org/ $q application/pdf $u https://brill.com/view/journals/hcm/hcm-overview.xml
857 4 0 $c Library of Congress $e 2013- $n Onsite access only &g http://hdl.loc.gov/loc.gdc/ejournal.021602 $u http://hdl.loc.gov/loc.gdc/ejournal.021602
leader 07581cas a2201261 4500
007 ta
008 220330c18929999nyumn|p 0 a0eng
044 # # $c USA
022 0 # $a 0042-8000 $l 0042-8000 $2 _1
222 # # $a Vogue $b (New York)
245 1 # $a Vogue.
260 # # $a [New York] $b [Condé Nast Publications, etc.]
336 # # $a text $b txt $2 rdacontent
337 # # $a unmediated $b n $2 rdamedia
338 # # $a volume $b nc $2 rdacarrier
362 0 # $a v. 1- Dec. 17, 1892-
588 # # $a Latest issue consulted: Vol. 210, no. 6 (June/July 2020).
856 4 0 $u https://www.vogue.com/magazine
856 4 1 $u http://www.proquest.com
857 4 1 $d HathiTrust Digital Library $e 1894-1922 $f incomplete $u http://catalog.hathitrust.org/api/volumes/oclc/1769261.html
857 4 1 $c Bibliothèque nationale de France $d Gallica $e 1917-1917 $f incomplete $g https://gallicaintramuros.bnf.fr/ark:/12148/cb34471903t/date
857 4 1 $c Internet Archive $e 1892-2014 $f complete? $u https://archive.org/details/pub_vogue
leader 06604cas a2200889 a 4500
007 cr mnu||||||||
008 191107c19899999pauqr|pso o 0 a0eng c
022 0 # $a 1547-3325 $l 1040-1237 $y 1573-9698 $z 1573-3238 $2 _1
222 # # $a Annals of clinical psychiatry $b (Online)
245 1 # $a Annals of clinical psychiatry.
246 3 0 $a Clinical psychiatry
260 # # $a [New York, NY] $b [Elsevier] $c [©1989]-
260 2 # $3 $a [New York, NY] $b [Plenum Pub. Co.] $n 1
260 2 # $3 <2006>-2008 $a [Philadelphia] $b [Taylor & Francis] $n 2
260 2 # $3 Feb. 2009- $a [Montvale, N.J.] $b [Dowden Health Media] $n 3
260 2 # $3 Feb. 2010- $a [Parsippany, NJ] $b [Quadrant HealthCom] $n 4
264 3 1 $a [Parsippany, NJ] $b [Frontline Medical Communications]
336 # # $a text $b txt $2 rdacontent
337 # # $a computer $b c $2 rdamedia
338 # # $a online resource $b cr $2 rdacarrier
362 1 # $a Began with vol. 1, no. 1 (Mar. 1989).
588 # # $a Description based on: Vol. 12, no. 1 (2000); title from contents screen (MetaPress, viewed Aug. 17, 2006).
588 # # $a Latest issue consulted: Vol. 20, no. 4 (November 2008), Portico, viewed May 21, 2015.
856 4 0 $h http://bibpurl.oclc.org/web/80079 $h http://www.aacp.com/Pastissues.asp
856 4 0 $h http://www.metapress.com/link.asp?id=710714
856 4 0 $h http://www.metapress.com/openurl.asp?genre=journal&issn=1547-3325
857 4 0 $c CLOCKSS $e 1989-2008 $f Preserved $n Dark archive $u http://www.clockss.org/clockss/Annals_of_Clinical_Psychiatry
857 4 0 $Portico $e 1997-2008 $n Dark archive $u http://www.portico.org/Portico/browse?journal=ISSN_10401237
857 4 0 $b Internet Archive $e 1989-1993 $f Incomplete $u https://archive.org/details/pub_annals-of-clinical-psychiatry
The implications of these proposed changes on BIBFRAME will need to be considered together in order to prevent inadvertent data loss and conversion inconsistencies.
6.1. The proposed new field 857 indicators mirror some of the indicators in 856 (Electronic Location and Access). Have we covered all the relevant access modes, or are there other access methods that should have their own indicators?
6.2. The proposed new field 857 includes many of the same subfields in 856 (Electronic Location and Access); should all the subfields copied from 856 be kept?
6.3. In this new discussion paper, an additional subfield for the name of the archive has been provided in $c and the archive agency in $d. Is it clear how to use these subfields and are both needed?
6.4. Does this paper accommodate the range of digital preservation systems or tools that might exist and need to be recorded in field 857?
HOME >> MARC Development >> Discussion Paper List
The Library of Congress >> Especially
for Librarians and Archivists >> Standards ( 10/27/2022 ) |
Legal | External Link Disclaimer | Contact Us |