Originally OpenOffice.org pages was published in multi sub-domain structure.

Active address list:

Main page: http://www.openoffice.org
Documentation: http://documentation.openoffice.org
Development: http://development.openoffice.org
Distribution: http://distribution.openoffice.org
Download: http://download.openoffice.org
Projects list and individual addresses: http://projects.openoffice.org
About: http://about.openoffice.org
Marketing: http://marketing.openoffice.org
Native pages list: http://l10n.openoffice.org
Bugzilla: http://openoffice.org/bugzilla/
Extensions: http://extensions.services.openoffice.org
Templates: http://templates.services.openoffice.org
Wiki: http://wiki.services.openoffice.org
Forums: http://user.services.openoffice.org

Archive create

Possible:

Checkout via SVN URL.
Example:
"svn co https://svn.openoffice.org/svn/download~webcontent download" to get all website content from the download project
Do it analog with the other projects.
Wiki: database dump (Cornell can do this)
Bugzilla: I hope, ORACLE will provide a Database dump if not, we can use XML Export. Bugzilla can import this XML's
Forums: As I know we have admins of the OOo user forums in our group, they can make a dump of the database via the PHPbb admin interface.
Extensions and Templates: We realy need to backup this. Afk the servers of this services are not hosted bei ORACLE, they are hosted ad OSL.
Use wget

Note: I (rbircher) have allready a script to make a serie checkout of all projects, the only thing that I need is a .txt file who lists all project names (line break separated)

Todo plan

Create full sub-domains list
Create archive
Selecting needed content
Move contents to new page

Space shortcuts

Child pages

Active address list:

Archive create

Todo plan

Space shortcuts

Child pages

OOo-Sitemap

Active address list:

Archive create

Todo plan