Publikationer

 

Brügger, N; Laursen, D.; Nielsen, J. (2017): Exploring the domain names of the Danish web. I: The Web as History, p. 62-80

Skouvig, L. (2016):Web-archives and big data: managing the messiness Interview with Niels Brügger

Laursen, D.; Møldrup-Dalum, P. (2016): Netarkivet 10 år. DF Revy 2, p. 6-9.
I juli 2015 fejrede Netarkivet sin 10 års fødselsdag.

Skovgård Jensen, T.; Schostag, S.; Bønding, N. (2015): Chasing the news: report from 10 years of digital legal deposit in Denmark. IFLA International News Media Conference, 15-16 april 2015.
Article on Collection and preservation of digital news media – based on legal deposit.

Schostag, S.; Fønss-Jørgensen, E. (2012): Webarchiving: Legal Deposit of Internet in Denmark.A Curatorial Perspective. Microform and Digitization Review, Vol. 41, pp. 110–120
Non technical article on web curation in Denmark using the tool “NetarchiveSuite”.

Henriksen, B. N. (2011): Netarkivet.dk indsamler data om .dk. DF Revy 4, p. 4-5.
Internettet er også en del af den kulturarv, der skal dokumenteres for fremtidens generationer. Derfor er Statsbiblioteket og Det Kongelige Bibliotek under det fælles banner Netarkivet.dk gået i gang med at indsamle og bevare den danske del af internettet.

Jacobsen, G. (2008): Erfaringer med høstning af det danske Internet 2005-2008. DF Revy 8, p. 12-15.
I foråret 2006 kunne DF Revys abonnenter læse om de første erfaringer med nethøstning. Der er nu gået 3 år siden den lov, der gjorde nethøstningen lovlig, trådte i kraft og Netarkivet.dk synes, det kunne være passende med en opdatering.

Andersen, B. (2007): Integration of non-harvested web data into an existing web archive. Unpublished paper.
This paper describes a software prototype developed for transforming non-harvested web data into ARC files. It analyses problems connected to different kind of delivered web material and tries indexing the transformation result with the open source WayBack for testing the transformation quality.

Jacobsen, G. (2007): Interoperability in the Future. Conference paper, 73rd World Library and Information Congress.
This article summarizes the results of a questionaire about web archive interoperability between national web archives.

Jacobsen, G. (2007): La Captura de Internet en Dinamarca. Unpublished paper.
The article describes experiences with harvesting the danish part of the internet gained within the first two years of the netarchive.dk project. English version below.

Jacobsen, G. (2007): Collecting the Danish Internet. Unpublished paper.
The article describes experiences with harvesting the danish part of the internet gained within the first two years of the netarchive.dk project.

(2007): Definition of an event in terms of net archiving. Unpublished note.
The document describes the guidelines used for determining when an event is relevant for the netarchive.dk project to harvest.

Clausen, L. (2006): Overview of the Netarkivet web archiving system. Conference paper, 6th International Web Archiving Workshop (IWAW’06).
The paper presents an overview of the entire system build by Netarchive.dk to control preservation of internet material in both large scale (snapshot harvesting) and small scale (selective / thematic harvesting) – from defining harvests to perserving the bits.

Christensen, N. H. (2006): A formal analysis of recovery in a preservational data grid. Conference paper, Conference on Mass Storage Systems and Technologies (MSST06).
A data grid made for the long-term preservation of digital materials is described. The data grid’s ability to recover from data loss is analysed by developing a formal, mathematical model for the relevant, implemented software operations.

Andersen, B. (2006): The DK domain: in words and figures. Unpublished paper.
This article summarizes the experiences and statistics from the first snap shot harvest undertaken by netarchive.dk during july to september 2005.

Andersen, B. (2006): DK-domænet i ord og tal. Upubliceret papir.
Denne artikel sammenfatter netarkivets erfaringer og statistikker fra den første tværsnitshøstning af hele .dk-domænet der fandt sted fra juli til september 2005.

Christensen, N. H. (2005): Preserving the bits of the Danish internet. Conference paper, 5th International Web Archiving Workshop (IWAW05).
This paper describes simulations of bit preservation setup used by netarchive.dk

Larsen, S. og E. K. Nielsen (2005): Kulturarv bliver sikret: Kulturens danmarkskort. Jyllands-Posten 02.07.2005.
En lov, der trådte i kraft i går, sikrer en bred og enestående indsamling og registrering af vor kultur i alle afskygninger, skriver Svend Larsen og Erland Kolding Nielsen.

Hielmcrone, H. v. (2005): Vejledning til den nye pligtafleveringslov.
Den nye pligtafleveringslov trådte i kraft den 1. juli 2005. Dette dokument beskriver i korte træk, hvad loven indebærer for både de to nationalbiblioteker og netsteder/producenter.

Henriksen, B. N. (2005): Webarkivering. I: E. K. Nielsen, S. B. Larsen og N. C. Nielsen (red.): Kommunikation erstatter transport: Den digitale revolution i danske forskningsbiblioteker 1980-2005, p. 637-656.
Denne artikel beskriver webarkivering som disciplin og Netarkivet som projekt.

Clausen, L. (2004): Concerning Etags and Datestamps. Conference paper, 4th International Web Archiving Workshop (IWAW04).
In web archiving, avoiding unnecessary downloads of unchanged pages can significantly reduce the load on both the archiving system and the server being archived. However, the indicators available for determining whether a page is changed are frequently either missing or wrong, causing pages changes to missed. In this paper, we investigate the quality of the two change indicators defined in the HTTP protocol, Last-Modified and Etag. Based on downloads of front pages of Danish web sites, we compare the reliability and usefulness of the two indicators and consider if using a combination of the two can lead to better prediction of page changes. Finally, we present a systematic way to determine the best prediction scheme, and present an unexpected download scheme with better characteristics than the obvious choices.

Christensen, N. H. (2004): Towards format repositories for web archives. Conference paper, 4th International Web Archiving Workshop (IWAW04).
Web archives face a formidable challenge regarding the handling of file formats. It is the thesis of this paper that this challenge could and should be met through the development of format repositories fit for that purpose. The format challenge for web archives – and its relation to software for viewing and converting digital objects – is analyzed in detail using methods from the field of programming language implementation. As a result of the analysis, we are able to list a number of specific requirements to a format repository. A format repository that satisfies these requirements can be integrated with a web archive s software and thereby provide it with automatic support for handling formats.

Christensen, S. S. (2004): Archive Format and metadata requirements. Unpublished paper.
A discussion of this projects requirements for both archival format and metadata.

Clausen, L. (2004): Handling File Formats. Unpublished paper.
Considerations and plans for handling the problem of evolving file formats in a long-term web archive setting. Problems discussed include: Categorization of formats, preserving limited aspects of files, criteria for evaluation the long-term viability of formats, DRM issues, preservation strategies, and preservation workflow.

Christensen-Dalsgaard, B. (2004): Web Archive Activities in Denmark. RLG DigiNews 8, 3.
This paper describes our experience and some of our results.

Christensen-Dalsgaard, B., Fønss-Jørgensen, E., Hielmcrone, H. v., Finnemann, N. O., Brügger, N., Henriksen, B., Carlsen, S. V. (2003): Final Report for The Pilot Project netarkivet.dk.
The present report by the group behind the ”netarkivet.dk” project describes the experience gained from a pilot study, in which existing software was used to harvest and subsequently test out materials relating to the County and District elections of 2001. The pilot study showed that a great deal of material could be harvested in this way, but also that much of the interactive use of the net cannot be caught by ordinary methods.
The pilot project also offers an indication of the financing needed if Denmark is to safeguard an important part of its cultural heritage. Estimates are given both for the archiving of this heritage under present conditions, where the work is carried out on the basis of voluntary agreements, and on the assumption that the law on legal deposit of material may be changed, making it legal for institutions receiving statutory deliveries to acquire online materials.

Henriksen, B. (2001): Danish Legal Deposit on the Internet: Current Solutions and Approaches for the Future. Conference paper, 5th European Conference (ECDL 2001).
Proceedings from the conference ‘Preserving the present for the future’, 2001 Several of the papers that were presented at the conference about archiving of the internet are available on the website of The Danish Electronic Research Library.