Netarchive.dk collects and preserves the Danish portion of the internet.

A new legal deposit law came into force on the 1st of July 2005. As a result, The State and University Library and the Royal Library are now collecting and preserving the Danish portion of the internet.

In practical terms, this means that the two institutions collect internet-material using so-called "harvesters" (web crawlers).

Harvesters are programs of the kind used by search engines. Netarchive's harvesters scan the Web from a set of starting points (initially all domains under .dk) and follow links from them.

The collected material is stored in a joint archive in which, for added security, all data is stored in both institutions.

The archive is not publicly accessible and initially can only be used for research purposes and with prior permission from the Danish Data Protection Agency.

A set of guidelines regarding the new law has been developed. It outlines the implications of the new law for the national libraries and for websites and content-providers.

[The guidelines can be read here] (in danish only)

Any questions or comments can be sent to:

 

news

August 2009:
Newsletter august 2009:
Read more here

8th september 2008:
Newsletter august 2008:
Read more here

July 2008: "Web Archiving: Issues and Problems in Collection Building and Access". Read the article from LIBER Quarterly here

2nd november 2007:
New article on netarchive.dk:
'Integration of non-harvested data...'

Read more here

18th september 2007:
New article on netarchive.dk:
'Interoperability in the future'

Read more here

4th july 2007:
NetarchiveSuite released as Open Source.
Read more here

2nd may 2007:
New article on netarchive.dk:
'Collecting the Danish internet'

Read more here

30th march 2007:
Newsletter march 2007
Read more here

15th february 2006:
The first expeciences with harvesting the entire .dk-domain. Read the article here.