Or: migrate networks to new physical network
- CLOUDSTACK-10024Getting issue details... STATUS
As a Cloud Owner
I want to migrate my existing guest networks to another physical network infrastructure
In order to leverage new network technologies, e.g. leverage SDN.
KVM
Vmware
Isolated
VPC
CloudStack supports multiple physical networks with guest traffic, but only when the guest traffic is given a tag. The network offering then also needs to specify the tag, in order to correlate it to a matching physical network.
CloudStack Core will be extended to support moving networks to another Physical Network, by adding API commands.
Migrating a network will keep the VM running on the same host, so there are no issues for multinic VM's, when you would migrate all networks one by one.
VR will be recreated. The new VR will get a new guest IP if it is no longer the gateway (when the target Network offering doesn’t specify VR as provider of source nat). In that case, the new VR will get the first free IP i.s.o. Gateway IP.
We will update the DB object with the details of the new implementation.
To be able to correctly cleanup the old implementation, we make a temporary copy of the network.
New API’s:
migrateNetwork
Parameter: networkId (UUID)
Parameter: networkOfferingId (UUID)
Parameter: resume (boolean)
migrateVpc
Parameter: vpcId (UUID)
Parameter: vpcOfferingId (UUID)
Parameter: tierNetworkOfferings[] (to defines the new network offering for each tier)
networkId
Parameter: resume (boolean)
The networkOfferingId/vpcOfferingId specifies the target network/vpc offering, which as mentioned, uniquely determines the target physical network using its associated tag.
The API implementation will be made as re-entrant as possible.
E.g. if 2 network objects are found in the db for the specified networkId (UUID), it will
By default indicate the error and fail
Unless “resume” is explicitly set to True, then try to resume as much as possible from the corrupt entry state. Read more on "resume" in below section.
During migration of a network the UUID of objects are swapped so that external system (like NuageVsp) will perceive the migration as a delete of the network on the old physical network followed by a reimplement on the new physical network. Because of this, orchestration systems can still use the same UUID's as before (the resulted objects in CloudStack will have the same UUID's except for the VR.).
Make a temporary copy of the network before migration to keep the implementation details (moving along VR and nics).
Implement original network on new Physical network using guest network guru, found by looking up the tag of the offering
Implement services (Dhcp, static nat, source nat, …)
Deploy VR if needed
Migrate vnics to new physical network (move them back from the copy to the original network that is upgraded now)
Prepare Nic on Guru (add vport, vm interface, vm)
Prepare Nic on Element (apply acl rules, …)
Send ReplugNicCommand to host
Remove nic on native element and guru
Shutdown network services on old physical network (for this the copy object is used)
Undeploy VR if needed
Shutdown network infrastructure on old physical network
Delete original network
During migration of a network the UUID of objects are swapped so that external system (like NuageVsp) will perceive the migration as a delete of the network on the old physical network followed by a reimplement on the new physical network. Because of this, orchestration systems can still use the same UUID's as before (the resulted objects in CloudStack will have the same UUID's except for the VR.).
Make copy of original VPC and implement VPC on new Physical network using guest network guru, found by looking up the tag of the network offerings
Migrate Tiers to newly created VPC in the new Physical Network (same pattern as migrate network happens for every tier.)
Make copy of original network (moving along VR and nics)
Re-implement original network on new Physical network using guest network guru
Implement services (Dhcp, static nat, source nat, …)
Deploy VR if needed
Migrate vnics to new physical network
Prepare Nic on Guru (add vport, vm interface, vm)
Prepare Nic on Element (apply acl rules, …)
Send ReplugNicCommand to host
Release nic on native element and guru
Shutdown network services on old physical network
Undeploy VR if needed
Shutdown network infrastructure on old physical network
Delete original network
Shutdown (original) vpc in old physical network
Delete original vpc
Provide Agent command to change the bridge of a nic in one call.
ReplugNicCommand
We need to allow both OVS and LinuxBridge on one host at the same time. This is not possible in the current implementation (only one VifProvider can be specified).
We changed the code to look for the correct VifDriver, by asking each one to see, if a bridge with the given name exists, and let the default create the bridge if it isn’t found.
You can load both VifDrivers by having the following lines in /etc/cloudstack/agent/agent.properties
network.bridge.type=native libvirt.vif.driver.Vpn=com.cloud.hypervisor.kvm.resource.OvsVifDriver |
Resume support is implemented for operators to have the ability to finish off a previously submitted migration command which halfway failed during its new network implementation. After correcting the cause of that failure, the operator can re-issue the same migration command, now with a resume parameter set to True.
The resume logic follows below table :
resume parameter value \ Network migration "state" | Good | Bad (but error resolved) |
False (default) | Success | Fail (again) |
True | Success | >> Success << |
Based on the resume parameter and the related field in the networks table, we “force” migration of a network even if the network offering of the network is already the same as the current network offering. In that case, we will use the network that is stored in the related field (DB table) as the already generated copy. Otherwise we follow the normal migration procedure.
The creation and migration of the network copy happens in a transaction block. The same happens for each nic that needs to be reassigned. The use of those transactions allow us to mark critical sections and never leave the DB in an unrecoverable state.
Given a VMware host setup, prepared for migration
And a physical network with VLAN encapsulation providing guest, management and storage networking
And a new physical network providing guest traffic tagged as “target”
And an isolated guest network in the VLAN physical network
And a network offering equivalent to the offering of the guest network tagged with “target”,
When I migrate the network using the target network offering
Then the network is configured on the new physical network
This can be refined to:
Note: US-1-3 and US-1-4 need persistent isolated network offering support with Nuage (upstream PR is available)
Given a VMware host setup
And a physical network with VLAN encapsulation providing guest, management and storage traffic
And a new physical network providing guest traffic tagged as “target”
And an isolated guest network in the VLAN physical network
And a network offering equivalent to the offering of the guest network tagged with “target”,
And 2 stopped vms in the guest network
When I migrate the network using the target network offering
Then the network is configured on the new physical network
And the VMs and their interfaces are configured on the new physical network
This can be refined to :
Given a VMware host setup
And a physical network with VLAN encapsulation providing guest, management and storage networking
And a new physical network with Guest Traffic tagged as “target”
And an isolated guest network in the VLAN physical network
And a network offering equivalent to the offering of the guest network with tag “target”
And 2 vms in the guest network
When I migrate the network using the target network offering
Then the network is configured on the new physical network
And the VMs and their interfaces are configured on the new physical network
And the VMs can reach each other
This can be refined to :
2 Comments
Daan
Frank this looks very good. The only reservation i have is that I want to see to and from for your obvious solution. i.e. vlan->vsp and back, vsp->vlan. Other possible scenarios are of course vlan -> vlan on a new infra, or vlan -> vxlan. I don't think I'm holding you responsible for those implementations though
good work
Kris Sterckx
Hi Daan,
the solution is generic i.e. network technology agnostic. Internally we test with vlan to vxlan and back to vlan. Other combinations are possible as well.