Originally OpenOffice.org pages was published in multi sub-domain structure.
Active address list:
- Main page: http://www.openoffice.org
- Documentation: http://documentation.openoffice.org
- Development: http://development.openoffice.org
- Distribution: http://distribution.openoffice.org
- Download: http://download.openoffice.org
- Projects list and individual addresses: http://projects.openoffice.org
- About: http://about.openoffice.org
- Marketing: http://marketing.openoffice.org
- Native pages list: http://l10n.openoffice.org
- Bugzilla: http://openoffice.org/bugzilla/
- Extensions: http://extensions.services.openoffice.org
- Templates: http://templates.services.openoffice.org
- Wiki: http://wiki.services.openoffice.org
- Forums: http://user.services.openoffice.org
Archive create
Possible:
- Checkout via SVN URL.
Example:
"svn co https://svn.openoffice.org/svn/download~webcontent download" to get all website content from the download project
Do it analog with the other projects. - Wiki: database dump (Cornell can do this)
- Bugzilla: I hope, ORACLE will provide a Database dump if not, we can use XML Export. Bugzilla can import this XML's
- Forums: As I know we have admins of the OOo user forums in our group, they can make a dump of the database via the PHPbb admin interface.
- Extensions and Templates: We realy need to backup this. Afk the servers of this services are not hosted bei ORACLE, they are hosted ad OSL.
- Use wget
Note: I (rbircher) have allready a script to make a serie checkout of all projects, the only thing that I need is a .txt file who lists all project names (line break separated)
Todo plan
- Create full sub-domains list
- Create archive
- Selecting needed content
- Move contents to new page