Bridging Datacenters for Disaster Recovery - Virtually

Published: 2014-12-19. Last Updated: 2014-12-19 22:43:40 UTC
by Rob VandenBrink (Version: 1)

It's been a while since we talked about Disaster Recovery issues - the last diary I posted on this was on using L2TPv3 to bridge your Datacenter / Server VLAN to the "same" VLAN at a DR site, over an arbitrary Layer 3 network (https://isc.sans.edu/diary/8704)

Since then, things have changed. There's a real push to move DR sites from a rack in a remote office location to recognized IaaS cloud locations. With that change comes new issues. If you are using your own servers in a colocation facility, or using IaaS VM instances, rack space for a physical router may either come with a price tag, or if it's all virtual, there might be no rack space at all.

In my situation, I had two clients in this position. The first customer simply wanted to move their DR site from a branch office to a colocation facility. The second customer is a Backup-as-a-Service Cloud Service Provider, who is creating a "DR as a service" product. In the first situation, there was no rack space to be had. In the second situation, the last thing a CSP wants is to have to give up physical rack space for every customer, and then deploy CSP owned hardware to the client site - that simply does not scale. In both cases, a VM running a router instance was clearly the preferred (or only) choice.

Virtual routers with enterprise features have been around for a while - back in the day we might have looked at quagga or zebra, but those have been folded into more mature products these days. In our case, we were looking at Vyatta (now owned by Brocade), or the open-source (free as in beer) fork of Vyatta - Vyos (vyos.net). Cisco is also in the game, their 1000V product supports IOS XE - their "bridge L2 over L3" approach uses OTV rather than L2TPv3 or GRE. You'll find that most router vendors now have a virtual product.

Anyway, Working with Vyatta/Vyos configs isn't like Cisco at all - their configs look a whole lot more like you might see in JunOS. While Vyos supports the L2TPv3 protocol we know and love, it's a brand new feature, and it comes with a note from the developer "if you find any bugs, send me an email" (confidence inspiring, that). Vyatta doesn't yet have that feature implemented. So I decided to use GRE tunnels, and bridge them to an ethernet interface. Since this tunnel was going to run over the public internet, I encrypted/encapsulated the whole thing using a standard site-to-site IPSEC tunnel. The final solution looks like this:

The relevant configs look like the one below (just one end is shown) Note that this is not the entire config, and all IP's have been elided.

Please - use our comment form and let us know if you've used a different method of bridging Layer 2 over Layer 3, and what pitfalls you might have had to overcome along the way!

interfaces {

bridge br0 {

aging 300

hello-time 2

max-age 20

priority 0

stp false

}

First, define the bridge interface. Not that STP (Spanning Tree Protocol) is disabled. You likely want this disabled unless you’ve got a parallel second bridged link (maybe a dark fiber or something similar)

ethernet eth0 {

bridge-group {

bridge br0

}

description BRIDGE

duplex auto

hw-id 00:50:56:b1:3e:4f

mtu 1390

smp_affinity auto

speed auto

}

The ETH0 interface is on the server VLAN (or port group if you are using standard ESXi vSwitches) – this is the VLAN that you are bridging to the DR site. It is added to the bridge, and has no IP address.

ethernet eth1 {

address 192.168.123.21/24

duplex auto

hw-id 00:50:56:b1:1d:a8

smp_affinity auto

speed auto

}

ETH1 is the management IP of this router, and is also the terminator for the GRE tunnel and the IPSEC VPN.

You can split this up, many might prefer to terminate the tunnels to a loopback for instance, or add another Ethernet if you prefer.

tunnel tun0 {

description BRIDGED

encapsulation gre-bridge

local-ip 192.168.123.21

multicast enable

parameters {

ip {

bridge-group {

bridge br0

}

tos inherit

}

remote-ip 192.168.249.251

}

The GRE tunnel is also bridged, and also doesn’t have an IP address. The encapsulation of GRE-bridge is the same as GRE (IP protocol 47), but the “gre-bridge” encapsulation allows you to add this interface to bridge.

.....

system stuff like AAA, NTP, timezone, syslog, SSH, ACLs and so on go here

......

This stuff is all important for your security posture, but is not relevant to the tunneling or bridging, so I’ve redacted it.

vpn {

ipsec {

esp-group PRL-ESP {

compression disable

lifetime 3600

mode tunnel

pfs disable

proposal 1 {

encryption AES256

hash sha1

}

ike-group PRL-IKE {

lifetime 28800

proposal 1 {

encryption AES256

hash sha1

}

ipsec-interfaces {

interface eth1

}

logging {

log-modes all

}

nat-traversal enable

site-to-site {

peer a.b.c.d {

authentication {

id @CUSTOMER

mode pre-shared-secret

pre-shared-secret demo123

remote-id @CLOUD

}

connection-type initiate

default-esp-group PRL-ESP

ike-group PRL-IKE

local-address 192.168.123.21

tunnel 0 {

allow-nat-networks disable

allow-public-networks disable

esp-group PRL-ESP

local {

prefix 192.168.123.21/32

}

protocol gre

remote {

prefix 192.168.249.251/32

}

The relevant portions of the VPN config are:

Note that the peer IP is the public / NAT'd IP of the other end
ID's have to be created for each end - these routers use XAUTH when you define a pre-shared key, so to avoid having them use the FQDN, it's safer to define usernames
The "traffic match" for encryption is defined by the source prefix+destination prefix+protocol. In our case, it's "the management IP of the customer router AND the matching IP on the cloud router AND GRE".
NAT-T is enabled, as both ends are behind NAT firewalls
Take some care in defining the pre-shared key. If a word occurs on your corporate website, facebook page, or linkedin (or in a dictionary), it's a bad choice, LEET-speak or no. Check out https://isc.sans.edu/forums/diary/16643

· We set both ends to "initiate", which enables both init and respond. This allows either end to start the tunnel

===============
Rob VandenBrink
Metafore

Keywords:

1 comment(s)

Comments

I really appreciated your article.

Ijust experimented a little with two vyatta bridges and I encountered the following problems:

a) it works between two single hosts, but when I attempted to connect two remote clusters with standard vSwitches, only the machines running on the same esx host as the bridge were able to have their traffic bridged...

b) I had some problems with MTU... since the mtu command is on an interface without ip address, icmp packet too big packets were not generated, and MTU discovery wasn't working... I had to manually lower the MTU on virtual machines.

Did somebody else hit the same hurdles?

Internet Storm Center

Bridging Datacenters for Disaster Recovery - Virtually

Comments

Anonymous

Apr 6th 2017
8 years ago

Bridging Datacenters for Disaster Recovery - Virtually

Comments

Anonymous

Apr 6th 20178 years ago

Apr 6th 2017
8 years ago