Introduction and Overview
MPLS Layer 3 VPNs (L3VPNs) are used by service providers (SPs) to provide layer 3 connectivity between an enterprise and its different remote offices/locations through the service provider's network. MPLS L3VPNs are defined by RFC 4364.
Service providers are able to provide different customers with layer 3 connectivity to their remote locations using the same underlaying infrastructure. This is possible regardless of whether the different customer sites use the same overlapping RFC 1918 IP addresses or not. Traffic from different customers is differentiated using VPNv4 addresses and routed to their respective destinations using the SP's core network.
The following topology will be used to configure MPLS Layer 3 VPNs.
The routers R1, R5, and R6 in the topology above are referred to as Provider Edge (PE) routers or edge Label Switch Routers (LSR). The MPLS network is usually referred to as the P network and the customer network, the C network. All LSRs in the P network have MPLS configured. PE routers have VRFs for each customer. Customer Edge (CE) routers have no knowledge of MPLS or VRFs. CE routers connect to Provider Edge (PE) routers.
At the PE routers, VRFs are created for each customer as one PE router may connect to many CE routers. Each customer’s routes will be separated from other customers in their own routing table.
MPLS L3VPN Components
MPLS L3VPNs consist of prefix exchange between CE and PE and P routers in the service provider's core network.
Service Provider Core
In the service provider core of an MPLS L3VPN, the P routers run MPLS over an interior gateway protocol(IGP) such as IS-IS, OSPF; the result is that packets are switched using MPLS along routing paths created by the underlaying IGP. Intermediate LSRs in the provider core network do not have the customer prefixes in their routing tables. All P routers (intermediate and edge LSRs) run an IGP and are configured for MPLS. LDP is usually configured as the label exchange protocol. Other labeling methods are supported such as MP-BGP, RSVP-TE, etc.
PE Routers
For purposes of providing MPLS L3VPN services, only PE routers run BGP and they form an iBGP peering through the VPNv4 address family.
On the PE, a VRF is created for each customer. This ensures that each customer has their prefixes in a separate routing table. In the VRF configuration, a route distinguisher (RD) is configured. This is used by MP-BGP on the PE to differentiate the customer's prefixes (including overlapping IP addresses) when switching customer traffic through the provider core network. The PE interface connecting to each customer's CE is placed in its corresponding VRF. Additionally, the route-target (RT) is configured on the PE devices to determine into which VRF routing table a prefix should be placed.
Route Distinguisher (RD)
The Route Distinguisher (RD) is a 64-bit (8-byte) value usually of the format ASN:nn or IP address:nn where ASN is the Autonomous system number and nn is any random number. RD is used by the PE to distinguish and uniquely identify the different VRFs; to differentiate the multiple prefixes from different customers. This allows for overlapping IPv4 addresses between customers. It is locally significant and does not need to match on the remote PE.
The RD is very critical to the successful operation of MPLS L3VPNs particularly when clients are using RFC 1918 prefixes in their enterprise networks.
VPNv4 Addresses
The RD is used to create VPNv4 addresses. This is a 96-bit value that consists of the RD (64-bit) and the IP address/network address (32-bit). It is usually in the format RD:IP address. An example can be 65000:10000:10.10.20.0 where the value 65000:10000 is the RD and 10.10.20.0 is the prefix being shared. VPNv4 addresses make addresses from the different customers unique. This results in the ability of the SP to route traffic for different customers regardless of whether they use overlapping address space or not. VPNv4 NLRI main attributes include:
- 8-byte route distinguisher(RD)
- IPv4 prefix and length
- Next hop
- MPLS VPN label
MPLS Layer 3 VPN Label Stack
L3VPN needs at least 2 labels to forward traffic across an MPLS L3VPN. The first label attached is the VPN label and the second is the transport/Label Distribution Protocol (LDP) label. On an MPLS packet, the transport label is the top most label. It is typically derived from LDP. It is sometimes referred to as the IGP label. The LDP label(transport label) is used for label switching from PE to PE through the MPLS domain as intermediate LSRs run only MPLS and not BGP. The transport label basically tells the SP MPLS core LSRs which PE traffic is destined to i.e., which PE device is the exit point out of the L3VPN to the CE. The VPN Label is derived from VPNv4 advertisements of PE and is used by the PE router to determine which CE traffic is destined to.
The VPN label is used by the egress LSR/PE router to determine customer specifics about the packet and what to do with it. The VPN label tells the PE router which CE traffic is destined to. It is derived from VPNv4 advertisements of PEs. Other applications such as MPLS TE, FRR, etc., require even more labels. VPN labels are exchanged between PE routers over the MP-BGP peering. Labeled packets are advertised along with BGP extended communities path attributes. MPLS generates labels for VPNv4 routes and adds [V] after the prefix to indicate that the label is a VPNv4 label. In the L3VPN label stack, the VPN label is usually at the bottom of the stack (nearest to the Layer 3 header) as indicated by the 1-value is the S-section of the MPLS header.
Route Target (RT)
A route-target (RT) is a 64-bit value similar in format to the RD i.e. ASN:nn or
IP address:nn. It is used by MP-BGP to determine the VRF into which the different
customer prefixes should be installed. RT is defined by
RFC 4360 (BGP extended communities attribute).
The routes defined in the VRF of one customer are tagged for export
to the remote PE customer VRF using the command
route-target export ASN:nn
. To
import these routes into the appropriate VRF at the remote PE router, the command
route-target import ASN:nn
is used.
The new BGP extended community attribute "route-target" is used to control what
prefixes enter and exit the VRF table.
- export route target: what routes will go from VRF into BGP.
- import route target: what routes will go from BGP into VRF.
As an alternative to the use of RT, the route-maps "import map" and "export map" allow control over import and export on a per prefix basis.
RT controls the route’s VPN VRF membership(s). It is possible to have multiple RTs per VPNv4 route but one RD per VPNv4 route. VPNv4 speakers only accept VPNv4 routes with an RT matching a local VRF. Some exceptions include route reflectors. VPNv4 routes can have more than one RT. This allows for complex VPN topologies such as full mesh, hub and spoke, central services. The RT value used for export in the VRF on the ingress PE router should be the same RT value used in the RT-import of the appropriate VRF on the remote egress PE.
PE-CE
The CE forms an IGP peer relationship with the PE. Through this peering, it exchanges its prefixes with the PE. The PE isolates these prefixes received from the CE into the VRF created for this customer using the configured RT import values. For ingress traffic, the PE redistributes these customer prefixes from the IGP VRF table into MP-BGP VRF for that customer as defined by the export RT values. This configuration is implemented using the corresponding BGP VRF address-family. For egress traffic, the prefixes are redistributed from MP-BGP VRF into the IGP.
If the PE and CE routers have an eBGP peering, the prefixes are exchanged using BGP and no IGP (and redistribution) is required. MPLS is not configured on any client facing interfaces. Network traffic at the CE-PE interface uses normal IPv4 or IPv6 routing and all IPv4 protocols supported. Some IPv6 protocols supported.
Configuration
To create L3VPNs, the following steps should be taken.
- Step 1: Ensure MPLS infrastructure is in place. i.e. IP routing, MPLS.
- Step 2: Create the VRFs on the PE/edge LSR and interfaces connecting to each customer device should be configured to be in the appropriate VRF.
- Step 3: Configure iBGP peering between the PE routers using VPNv4 address-family.
- Step 4: Enable IGP peering and route exchange between the PE and CE devices.
- Step 5: Redistribute routes between MP-BGP VRF address-family and PE-CE routing protocol (this is not needed if PE-CE routing uses eBGP).
- Step 6: Test connectivity.
Step 1: Ensure MPLS Infrastructure is in Place
When configuring MP-BGP on an LSR, ensure that BGP peer address and the MPLS LDP ID match for ease of configuration. Guidance on how to configure an operational MPLS infrastructure can be found here.
Step 2: Create VRFs on PE Routers
Virtual Routing and Forwarding (VRFs) are used to create separate routing tables
for each customer on a router. These routing tables are separated from the
global routing table (RIB). The RT and RD are then configured under each VRF.
There are two ways to create a VRF: using the command
ip vrf <vrf-name>
or
vrf definition <vrf_name>
.
ip vrf <vrf-name>
(older method): creates VRFs for IPv4 address family only.R1(config)#ip vrf CUST_X_HQ
R1(config-vrf)#rd 300:100
R1(config-vrf)#route-target export 300:100
R1(config-vrf)#route-target import 300:110
R1(config-vrf)#ip vrf CUST_Z_2
R1(config-vrf)#rd 200:2
R1(config-vrf)#route-target export 200:110
R1(config-vrf)#route-target import 200:100
Two VRFs CUST_X_HQ and CUST_Z_2 have been created.
-
vrf definition <vrf_name>
: VRFs can be created for the IPv4 or IPv6 address families.R6(config)#vrf definition CUST_Z_HQ
R6(config-vrf)#rd 200:1
R6(config-vrf)#address-family ipv4
R6(config-vrf-af)#route-target export 200:100
R6(config-vrf-af)#route-target import 200:110
The VRF created is CUST_Z_HQ. VRF names do not need to match as they are locally significant.
vrf upgrade-cli
The global configuration command vrf upgrade-cli
is an important command to
use particularly when introducing VRFs in IPv6 in a network where IPv4 VRFs are
operational. It has been previously mentioned that the
vrf definition <vrf-name>
command supports creation IPv6 VRFs.
If the VRFs were created as below:
R1(config)#ip vrf CUST_X_HQ
R1(config-vrf)#rd 300:100
R1(config-vrf)#route-target export 300:100
R1(config-vrf)#route-target import 300:110
R1(config-vrf)#ip vrf CUST_Z_2
R1(config-vrf)#rd 200:2
R1(config-vrf)#route-target export 200:110
R1(config-vrf)#route-target import 200:100
The command vrf upgrade-cli multi-af-mode common-policies
will apply the same RT values configured for address-family IPv4 to IPv6
configuration automatically. This makes applying the same configuration
of IPv4 VRFs to IPv6 much easier.
When configuring RT or other settings in new format VRF, the address family
configuration takes precedence over any other similar configuration that may have been
configured under the VRF sub-mode.
Verification
To verify the creation of VRFs, use the command show vrf
.
R1(config)#vrf upgrade-cli multi-af-mode common-policies
You are about to upgrade to the multi-AF VRF syntax commands.
You will lose any IPv6 address configured on interfaces
belonging to upgraded VRFs.
Are you sure ? [yes]: yes
Number of VRFs upgraded: 2
R1(config)#
R1(config)#do show vrf detail
VRF CUST_X_HQ (VRF Id = 1); default RD 300:100; default VPNID <not set>
No interfaces
Address family ipv4 (Table ID = 1 (0x1)):
Export VPN route-target communities
RT:300:100
Import VPN route-target communities
RT:300:110
No import route-map
No export route-map
VRF label distribution protocol: not configured
VRF label allocation mode: per-prefix
Address family ipv6 not active
VRF CUST_Z_2 (VRF Id = 2); default RD 200:2; default VPNID <not set>
Interfaces:
Gi2/0
Address family ipv4 (Table ID = 2 (0x2)):
Export VPN route-target communities
RT:200:110
Import VPN route-target communities
RT:200:100
No import route-map
No export route-map
VRF label distribution protocol: not configured
VRF label allocation mode: per-prefix
Address family ipv6 not active
R1(config)#
Step 3: Configure iBGP Peering between the PE routers using VPNv4 address-family
Configure an iBGP peering between the PE routers. This iBGP peering should be formed using the VPNv4 address family. iBGP neighborships occur over a maximum of 255 hops. To configure an iBGP neigborship on R1;
R1(config)#router bgp 100
R1(config-router)#bgp log-neighbor-changes
R1(config-router)#no bgp default ipv4-unicast
R1(config-router)#neighbor 30.255.6.6 remote-as 100
R1(config-router)#neighbor 30.255.6.6 update-source Loopback255
R1(config-router)#address-family vpnv4 unicast
R1(config-router-af)#neighbor 30.255.6.6 activate
R1(config-router-af)#exit-address-family
!On R6
R6(config)#router bgp 100
R6(config-router)#bgp log-neighbor-changes
R6(config-router)#no bgp default ipv4-unicast
R6(config-router)#neighbor 30.255.1.1 remote-as 100
R6(config-router)#neighbor 30.255.1.1 update-source Loopback255
R6(config-router)#address-family vpnv4
R6(config-router-af)#neighbor 30.255.1.1 activate
R6(config-router-af)#exit-address-family
The command neighbor 30.255.1.1 send-community extended
is
automatically added by MP-BGP as it uses extended communities to route VPNv4
traffic. This can be verified by the command show run | section bgp
.
Verification
Once the iBGP peering has been completed, this neighborship can be confirmed
using the command show bgp vpnv4 unicast all summary
. The
result of this command is indicated in the following snippet.
R1#show bgp vpnv4 unicast all summary
BGP router identifier 30.255.1.1, local AS number 100
BGP table version is 56, main routing table version 56
39 network entries using 6552 bytes of memory
39 path entries using 2496 bytes of memory
6/6 BGP path/bestpath attribute entries using 864 bytes of memory
5 BGP extended community entries using 1040 bytes of memory
0 BGP route-map cache entries using 0 bytes of memory
0 BGP filter-list cache entries using 0 bytes of memory
BGP using 10952 total bytes of memory
BGP activity 47/8 prefixes, 47/8 paths, scan interval 60 secs
Neighbor V AS MsgRcvd MsgSent TblVer InQ OutQ Up/Down State/PfxRcd
30.255.5.5 4 100 87 105 56 0 0 01:28:23 5
30.255.6.6 4 100 67 103 56 0 0 01:28:04 12
This shows that the iBGP peering was created. A second iBGP peer(R5) was configured.
To confirm if extended BGP communities are being sent:
debug bgp vpnv4 unicast updates
*Apr 14 05:57:04.319: BGP(4): 30.255.6.6 rcvd UPDATE w/ attr: nexthop 30.255.6.6, origin ?, localpref 100, metric 2, extended community RT:200:100 OSPF DOMAIN ID:0x0005:0x000003E80200 OSPF RT:0.0.0.1:2:0 OSPF ROUTER ID:10.255.67.2:0
*Apr 14 05:57:04.331: BGP(4): 30.255.6.6 rcvd 200:1:10.7.7.1/32, label 44
*Apr 14 05:57:04.351: BGP(4): Revise route installing 1 of 1 routes for 10.7.7.1/32 -> 30.255.6.6(CUST_Z_2) to CUST_Z_2 IP table
Step 4: IGP Peering and Route Exchange Between the PE And CE Devices
This can be implemented through configuration of an IGP between the PE-CE routers or BGP with an eBGP relationship. Some customers may prefer running BGP at the PE-CE. This routing protocol is to share the customer routes from the CE to the PE.
When configuring the routing protocol, the physical interface that the PE connects to the CE should be placed in the appropriate VRF that was configured for the customer to which that interface points. Between CE R1 and PE R2, the configured routing protocol is EIGRP with named EIGRP being configured on R2.
R1(config)#interface g3/0
R1(config-if)#vrf forwarding CUST_Z_2
% Interface GigabitEthernet3/0 IPv4 disabled and address(es) removed due to enabling VRF CUST_Z_2
R1(config-if)#ip address 10.255.19.1 255.255.255.252
R1(config)#router eigrp EIGRP_NAMED
R1(config-router)#address-family ipv4 unicast vrf CUST_Z_2 autonomous-system 10
R1(config-router-af)#network 10.255.19.0 0.0.0.3
R1(config-router-af)#topology base
R1(config-router-af)#redistribute bgp 100 metric 100000 100 200 1 1500
On R9:
R9(config-if)#router eigrp EIGRP_Z_S2
R9(config-router)#address-family ipv4 unicast autonomous-system 10
R9(config-router-af)#network 10.255.19.0 0.0.0.3
R9(config-router-af)#network 10.90.0.0 0.0.255.255
Note that VRFs are only configured on the PE routers. On CE routers, VRFs are not required for MPLS L3VPN functionality. If they are configured, then it could be for another purpose unrelated to MPLS L3VPN.
The IGPs on the running on the PE-CE interfaces of the different PE-CE pairings does not have to be the same. To illustrate this, the PE-CE IGP on R7 is OSPF.
Step 5: Redistribute Routes Between MP-BGP VRF Address-family and PE-CE Routing Protocol
Route redistribution between the IGP and MP-BGP occurs at the PE router. Customer routes learned from the remote PE router are stored in the MP-BGP VPNv4 table. These can then be redistributed into the IGP running on the PE-CE link. Also, routes learned from the connected customer site through the IGP can be redistributed into MP-BGP to be forwarded to the remote customer site. This is accomplished using the redistribution command in the appropriate address-family.
On R2:
R1(config)#router bgp 100
R1(config-router)#no bgp default ipv4-unicast
!REDISTRIBUTE CUST_X_HQ ROUTES INTO MP-BGP (LOCAL CUSTOMER SITE TO REMOTE SITES)
R1(config-router)#address-family ipv4 vrf CUST_X_HQ
R1(config-router-af)#redistribute eigrp 10
R1(config-router-af)#exit-address-family
R1(config-router)#exit
!REDISTRIBUTE MP-BGP ROUTES INTO EIGRP (REMOTE CUSTOMER SITES TO LOCAL SITE)
R1(config)#router eigrp EIGRP_Z_S2
R1(config-router)#address-family ipv4 vrf CUST_X_HQ autonomous-system 10
R1(config-router-af)#topology base
R1(config-router-af-topology)#redistribute bgp 100 100000 100 250 10 1500
On R5
R5(config)#router bgp 100
R5(config-router)#address-family ipv4 vrf CUST_X_2
R5(config-router-af)#redistribute eigrp 10
R5(config-router-af)#exit-address-family
R5(config-router)#exit
R5(config)#router eigrp EIGRP_Z_S2
R1(config-router)#address-family ipv4 vrf CUST_X_2 autonomous-system 10
R1(config-router-af)#topology base
R1(config-router-af-topology)#redistribute bgp 100 100000 100 250 10 1500
R5(config-router-af)#exit-address-family
Note that with eBGP, no redistribution of PE-CE routes is required. These routes will simply be shared through the eBGP relationship from MP-BGP.
Verification
To verify that the appropriate routes have been redistributed;
On the PE router:
R1#show ip route vrf CUST_Z_2
Routing Table: CUST_Z_2
Codes: L - local, C - connected, S - static, R - RIP, M - mobile, B - BGP
D - EIGRP, EX - EIGRP external, O - OSPF, IA - OSPF inter area
N1 - OSPF NSSA external type 1, N2 - OSPF NSSA external type 2
E1 - OSPF external type 1, E2 - OSPF external type 2
i - IS-IS, su - IS-IS summary, L1 - IS-IS level-1, L2 - IS-IS level-2
ia - IS-IS inter area, * - candidate default, U - per-user static route
o - ODR, P - periodic downloaded static route, H - NHRP, l - LISP
+ - replicated route, % - next hop override
Gateway of last resort is not set
10.0.0.0/8 is variably subnetted, 18 subnets, 3 masks
D 10.9.1.0/24
[90/2570240] via 10.255.19.2, 01:04:15, GigabitEthernet3/0
D 10.9.2.0/24
[90/2570240] via 10.255.19.2, 01:04:15, GigabitEthernet3/0
D 10.9.3.0/24
[90/2570240] via 10.255.19.2, 01:04:15, GigabitEthernet3/0
D 10.9.4.0/24
[90/2570240] via 10.255.19.2, 01:04:15, GigabitEthernet3/0
B 10.10.1.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.2.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.3.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.4.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.5.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.6.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.7.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.8.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.9.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.10.0/24 [200/2] via 30.255.6.6, 00:30:17
B 10.10.255.1/32 [200/2] via 30.255.6.6, 00:30:17
C 10.255.19.0/30 is directly connected, GigabitEthernet3/0
L 10.255.19.1/32 is directly connected, GigabitEthernet3/0
B 10.255.67.0/30 [200/0] via 30.255.6.6, 00:39:18
R1#
R1#show bgp vpnv4 unicast vrf CUST_Z_2
BGP table version is 38, local router ID is 30.255.1.1
Status codes: s suppressed, d damped, h history, * valid, > best, i - internal,
r RIB-failure, S Stale, m multipath, b backup-path, f RT-Filter,
x best-external, a additional-path, c RIB-compressed,
Origin codes: i - IGP, e - EGP, ? - incomplete
RPKI validation codes: V valid, I invalid, N Not found
Network Next Hop Metric LocPrf Weight Path
Route Distinguisher: 200:2 (default for vrf CUST_Z_2)
*>i 10.7.1.0/24 30.255.6.6 2 100 0 ?
*>i 10.7.2.0/24 30.255.6.6 2 100 0 ?
*>i 10.7.3.0/24 30.255.6.6 2 100 0 ?
*>i 10.7.4.0/24 30.255.6.6 2 100 0 ?
*>i 10.7.5.0/24 30.255.6.6 2 100 0 ?
*>i 10.7.7.1/32 30.255.6.6 2 100 0 ?
*> 10.9.1.0/24 10.255.19.2 2570240 32768 ?
*> 10.9.2.0/24 10.255.19.2 2570240 32768 ?
*> 10.9.3.0/24 10.255.19.2 2570240 32768 ?
*> 10.9.4.0/24 10.255.19.2 2570240 32768 ?
*>i 10.10.255.1/32 30.255.6.6 2 100 0 ?
*> 10.255.19.0/30 0.0.0.0 0 32768 ?
*>i 10.255.67.0/30 30.255.6.6 0 100 0 ?
R1#
On the CE router:
R9>show ip route
Codes: L - local, C - connected, S - static, R - RIP, M - mobile, B - BGP
D - EIGRP, EX - EIGRP external, O - OSPF, IA - OSPF inter area
N1 - OSPF NSSA external type 1, N2 - OSPF NSSA external type 2
E1 - OSPF external type 1, E2 - OSPF external type 2
i - IS-IS, su - IS-IS summary, L1 - IS-IS level-1, L2 - IS-IS level-2
ia - IS-IS inter area, * - candidate default, U - per-user static route
o - ODR, P - periodic downloaded static route, H - NHRP, l - LISP
+ - replicated route, % - next hop override
Gateway of last resort is not set
10.0.0.0/8 is variably subnetted, 18 subnets, 3 masks
D EX 10.7.1.0/24 [170/51456] via 10.255.19.1, 00:26:27, GigabitEthernet0/0
D EX 10.7.2.0/24 [170/51456] via 10.255.19.1, 00:26:27, GigabitEthernet0/0
D EX 10.7.3.0/24 [170/51456] via 10.255.19.1, 00:26:27, GigabitEthernet0/0
D EX 10.7.4.0/24 [170/51456] via 10.255.19.1, 00:26:27, GigabitEthernet0/0
D EX 10.7.5.0/24 [170/51456] via 10.255.19.1, 00:26:27, GigabitEthernet0/0
D EX 10.7.7.1/32 [170/51456] via 10.255.19.1, 00:07:41, GigabitEthernet0/0
C 10.9.1.0/24 is directly connected, Loopback1
L 10.9.1.1/32 is directly connected, Loopback1
C 10.9.2.0/24 is directly connected, Loopback2
L 10.9.2.1/32 is directly connected, Loopback2
C 10.9.3.0/24 is directly connected, Loopback3
L 10.9.3.1/32 is directly connected, Loopback3
C 10.9.4.0/24 is directly connected, Loopback4
L 10.9.4.1/32 is directly connected, Loopback4
D EX 10.10.255.1/32
[170/51456] via 10.255.19.1, 00:26:27, GigabitEthernet0/0
C 10.255.19.0/30 is directly connected, GigabitEthernet0/0
L 10.255.19.2/32 is directly connected, GigabitEthernet0/0
D EX 10.255.67.0/30
[170/51456] via 10.255.19.1, 00:26:27, GigabitEthernet0/0
99.0.0.0/32 is subnetted, 1 subnets
C 99.255.9.9 is directly connected, Loopback255
R9>
R9>ping 10.7.7.1
Type escape sequence to abort.
Sending 5, 100-byte ICMP Echos to 10.7.7.1, timeout is 2 seconds:
!!!!!
Success rate is 100 percent (5/5), round-trip min/avg/max = 84/100/128 ms
R9>
show bgp vpnv4 unicast all
Remote customer site routes are exchanged through L3VPN and are installed in the
MP-BGP VRF table. This can be verified by the command;
show bgp vpnv4 unicast all
. This shows the routes in the
MP-BGP table for all VRFs.
R1#show bgp vpnv4 unicast all
BGP table version is 40, local router ID is 30.255.1.1
Status codes: s suppressed, d damped, h history, * valid, > best, i - internal,
r RIB-failure, S Stale, m multipath, b backup-path, f RT-Filter,
x best-external, a additional-path, c RIB-compressed,
Origin codes: i - IGP, e - EGP, ? - incomplete
RPKI validation codes: V valid, I invalid, N Not found
Network Next Hop Metric LocPrf Weight Path
Route Distinguisher: 200:1
*>i 10.10.1.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.2.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.3.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.4.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.5.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.6.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.7.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.8.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.9.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.10.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.255.1/32 30.255.6.6 2 100 0 ?
*>i 10.255.67.0/30 30.255.6.6 0 100 0 ?
Route Distinguisher: 200:2 (default for vrf CUST_Z_2)
Network Next Hop Metric LocPrf Weight Path
*> 10.9.1.0/24 10.255.19.2 2570240 32768 ?
*> 10.9.2.0/24 10.255.19.2 2570240 32768 ?
*> 10.9.3.0/24 10.255.19.2 2570240 32768 ?
*> 10.9.4.0/24 10.255.19.2 2570240 32768 ?
*>i 10.10.1.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.2.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.3.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.4.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.5.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.6.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.7.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.8.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.9.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.10.0/24 30.255.6.6 2 100 0 ?
*>i 10.10.255.1/32 30.255.6.6 2 100 0 ?
*> 10.255.19.0/30 0.0.0.0 0 32768 ?
*>i 10.255.67.0/30 30.255.6.6 0 100 0 ?
Route Distinguisher: 300:100 (default for vrf CUST_X_HQ)
*>i 10.80.1.0/24 30.255.5.5 10880 100 0 ?
*>i 10.80.2.0/24 30.255.5.5 10880 100 0 ?
*>i 10.80.3.0/24 30.255.5.5 10880 100 0 ?
*>i 10.80.4.0/24 30.255.5.5 10880 100 0 ?
Network Next Hop Metric LocPrf Weight Path
*>i 10.255.58.0/30 30.255.5.5 0 100 0 ?
Route Distinguisher: 300:110
*>i 10.80.1.0/24 30.255.5.5 10880 100 0 ?
*>i 10.80.2.0/24 30.255.5.5 10880 100 0 ?
*>i 10.80.3.0/24 30.255.5.5 10880 100 0 ?
*>i 10.80.4.0/24 30.255.5.5 10880 100 0 ?
*>i 10.255.58.0/30 30.255.5.5 0 100 0 ?
R1#
Note that the routes learned through MP-BGP VPNv4, have the next-hop as the iBGP remote PE peer.
R1#show bgp vpnv4 unicast all 10.80.1.0
BGP routing table entry for 300:100:10.80.1.0/24, version 15
Paths: (1 available, best #1, table CUST_X_HQ)
Not advertised to any peer
Refresh Epoch 1
Local, imported path from 300:110:10.80.1.0/24
30.255.5.5 (metric 3) from 30.255.5.5 (30.255.5.5)
Origin incomplete, metric 10880, localpref 100, valid, internal, best
Extended Community: RT:300:110 Cost:pre-bestpath:128:10880
0x8800:32768:0 0x8801:10:256 0x8802:65281:2560 0x8803:65281:1500
0x8806:0:1493108744
mpls labels in/out nolabel/5018
BGP routing table entry for 300:110:10.80.1.0/24, version 11
Paths: (1 available, best #1, no table)
Not advertised to any peer
Refresh Epoch 1
Local
30.255.5.5 (metric 3) from 30.255.5.5 (30.255.5.5)
Origin incomplete, metric 10880, localpref 100, valid, internal, best
Extended Community: RT:300:110 Cost:pre-bestpath:128:10880
0x8800:32768:0 0x8801:10:256 0x8802:65281:2560 0x8803:65281:1500
0x8806:0:1493108744
mpls labels in/out nolabel/5018
R1#
From this output, the details of EIGRP composite metrics are encoded in the BGP extended community.
EIGRP
When redistributing prefixes into EIGRP, ensure that the seed metric is configured.
Verification: show ip eigrp vrf <vrf-name> topology
.
Some prefixes will have VPNv4 as the route source.
R1>show ip eigrp vrf CUST_Z_2 topology
EIGRP-IPv4 VR(EIGRP_PE) Topology Table for AS(10)/ID(10.255.19.1)
Topology(base) TID(0) VRF(CUST_Z_2)
Codes: P - Passive, A - Active, U - Update, Q - Query, R - Reply,
r - reply Status, s - sia Status
P 10.10.8.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.9.1.0/24, 1 successors, FD is 328990720
via 10.255.19.2 (328990720/327761920), GigabitEthernet3/0
P 10.10.5.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.9.2.0/24, 1 successors, FD is 328990720
via 10.255.19.2 (328990720/327761920), GigabitEthernet3/0
P 10.255.19.0/30, 1 successors, FD is 1310720
via Connected, GigabitEthernet3/0
P 10.9.3.0/24, 1 successors, FD is 328990720
via 10.255.19.2 (328990720/327761920), GigabitEthernet3/0
P 10.255.67.0/30, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.9.4.0/24, 1 successors, FD is 328990720
via 10.255.19.2 (328990720/327761920), GigabitEthernet3/0
P 10.10.4.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.10.9.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.10.6.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.10.7.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.10.255.1/32, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.10.2.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.10.10.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.10.1.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
P 10.10.3.0/24, 1 successors, FD is 72089600
via Redistributed (72089600/0)
R1>
R1>show ip eigrp vrf CUST_Z_2 topology 10.10.1.0/24
EIGRP-IPv4 VR(EIGRP_PE) Topology Entry for AS(10)/ID(10.255.19.1)
Topology(base) TID(0) VRF(CUST_Z_2)
EIGRP-IPv4(10): Topology base(0) entry for 10.10.1.0/24
State is Passive, Query origin flag is 1, 1 Successor(s), FD is 72089600
Descriptor Blocks:
30.255.6.6, from Redistributed, Send flag is 0x0
Composite metric is (72089600/0), route is External
Vector metric:
Minimum bandwidth is 100000 Kbit
Total delay is 1000000000 picoseconds
Reliability is 200/255
Load is 1/255
Minimum MTU is 1500
Hop count is 0
Originating router is 10.255.19.1
External data:
AS number of route is 100
External protocol is BGP, external metric is 2
Administrator tag is 0 (0x00000000)
R1>
OSPF
Enabling OSPF on an interface that is already part of a VRF makes the OSPF process VRF-aware.
External Routes as Inter-area Routes
The OSPF behaviour when redistributing BGP routes into OSPF is for the routes to be external routes E1 or E2.
The following output shows the OSPF database of a remote PE router:
R5#show ip ospf 1001 database
OSPF Router with ID (10.255.58.1) (Process ID 1001)
Router Link States (Area 1)
Link ID ADV Router Age Seq# Checksum Link count
10.255.58.1 10.255.58.1 1196 0x80000006 0x00BD57 1
88.255.8.8 88.255.8.8 38 0x80000006 0x00E82C 5
Net Link States (Area 1)
Link ID ADV Router Age Seq# Checksum
10.255.58.2 88.255.8.8 38 0x80000004 0x00617C
Type-5 AS External Link States
Link ID ADV Router Age Seq# Checksum Tag
10.10.1.1 10.255.58.1 28 0x80000001 0x00EEB2 3489661028
10.10.2.1 10.255.58.1 28 0x80000001 0x00E3BC 3489661028
10.10.3.1 10.255.58.1 28 0x80000001 0x00D8C6 3489661028
10.10.4.1 10.255.58.1 28 0x80000001 0x00CDD0 3489661028
10.10.5.1 10.255.58.1 28 0x80000001 0x00C2DA 3489661028
10.255.110.0 10.255.58.1 28 0x80000001 0x00A1A1 3489661028
On the CE router, these prefixes are E2 routes:
R8>show ip route
Codes: L - local, C - connected, S - static, R - RIP, M - mobile, B - BGP
D - EIGRP, EX - EIGRP external, O - OSPF, IA - OSPF inter area
N1 - OSPF NSSA external type 1, N2 - OSPF NSSA external type 2
E1 - OSPF external type 1, E2 - OSPF external type 2
i - IS-IS, su - IS-IS summary, L1 - IS-IS level-1, L2 - IS-IS level-2
ia - IS-IS inter area, * - candidate default, U - per-user static route
o - ODR, P - periodic downloaded static route, H - NHRP, l - LISP
+ - replicated route, % - next hop override
Gateway of last resort is not set
10.0.0.0/8 is variably subnetted, 16 subnets, 3 masks
O E2 10.10.1.1/32 [110/2] via 10.255.58.1, 00:02:39, GigabitEthernet0/0
O E2 10.10.2.1/32 [110/2] via 10.255.58.1, 00:02:39, GigabitEthernet0/0
O E2 10.10.3.1/32 [110/2] via 10.255.58.1, 00:02:39, GigabitEthernet0/0
O E2 10.10.4.1/32 [110/2] via 10.255.58.1, 00:02:39, GigabitEthernet0/0
O E2 10.10.5.1/32 [110/2] via 10.255.58.1, 00:02:39, GigabitEthernet0/0
.....
!!!!Additional output omitted
...
With VPNv4, OSPF on the PE LSRs can be configured to consider these routes as
internal(inter-area). OSPF and BGP VPNv4 operate differently from normal redistribution.
These routes are considered internal routes for the purpose of loop prevention
and traffic engineering. Note that when redistributing prefixes into OSPF, excluding
the subnets
keyword results in classful prefixes being
advertised.
The output of show ip ospf <process-ID>
on the PE
router displays the output "Domain ID type.. value ...
".
The value is inherited from the OSPF process ID.
R1#show ip ospf 1000
Routing Process "ospf 1000" with ID 10.255.1.1
Domain ID type 0x0005, value 0.0.3.232
Start time: 00:00:36.340, Time elapsed: 01:48:49.108
Supports only single TOS(TOS0) routes
Supports opaque LSA
Supports Link-local Signaling (LLS)
Supports area transit capability
Supports NSSA (compatible with RFC 3101)
Connected to MPLS VPN Superbackbone, VRF CUST_X_HQ
Event-log disabled
It is an area border and autonomous system boundary router
Redistributing External Routes from,
bgp 100, includes subnets in redistribution
Router is not originating router-LSAs with maximum metric
Initial SPF schedule delay 5000 msecs
Minimum hold time between two consecutive SPFs 10000 msecs
Maximum wait time between two consecutive SPFs 10000 msecs
Incremental-SPF disabled
Minimum LSA interval 5 secs
Minimum LSA arrival 1000 msecs
LSA group pacing timer 240 secs
Interface flood pacing timer 33 msecs
Retransmission pacing timer 66 msecs
Number of external LSA 0. Checksum Sum 0x000000
Number of opaque AS LSA 0. Checksum Sum 0x000000
Number of DCbitless external and opaque AS LSA 0
Number of DoNotAge external and opaque AS LSA 0
Number of areas in this router is 1. 1 normal 0 stub 0 nssa
Number of areas transit capable is 0
External flood list length 0
IETF NSF helper support enabled
Cisco NSF helper support enabled
Reference bandwidth unit is 100 mbps
Area 1
Number of interfaces in this area is 1
Area has no authentication
SPF algorithm last executed 01:11:49.224 ago
SPF algorithm executed 2 times
Area ranges are
Number of LSA 9. Checksum Sum 0x02ADF2
Number of opaque link LSA 0. Checksum Sum 0x000000
Number of DCbitless LSA 0
Number of indication LSA 0
Number of DoNotAge LSA 0
Flood list length 0
On the remote PE R5:
R5(config-if)#do show ip ospf 1001
Routing Process "ospf 1001" with ID 10.255.58.1
Domain ID type 0x0005, value 0.0.3.233
Start time: 00:15:57.372, Time elapsed: 00:13:48.868
Supports only single TOS(TOS0) routes
Supports opaque LSA
Supports Link-local Signaling (LLS)
Supports area transit capability
Supports NSSA (compatible with RFC 3101)
Connected to MPLS VPN Superbackbone, VRF CUST_X_2
Event-log disabled
It is an area border and autonomous system boundary router
Redistributing External Routes from,
bgp 100, includes subnets in redistribution
Router is not originating router-LSAs with maximum metric
Initial SPF schedule delay 5000 msecs
Minimum hold time between two consecutive SPFs 10000 msecs
Maximum wait time between two consecutive SPFs 10000 msecs
Incremental-SPF disabled
Minimum LSA interval 5 secs
Minimum LSA arrival 1000 msecs
LSA group pacing timer 240 secs
Interface flood pacing timer 33 msecs
Retransmission pacing timer 66 msecs
Number of external LSA 6. Checksum Sum 0x04DD7F
Number of opaque AS LSA 0. Checksum Sum 0x000000
Number of DCbitless external and opaque AS LSA 0
Number of DoNotAge external and opaque AS LSA 0
Number of areas in this router is 1. 1 normal 0 stub 0 nssa
Number of areas transit capable is 0
External flood list length 0
IETF NSF helper support enabled
Cisco NSF helper support enabled
Reference bandwidth unit is 100 mbps
Area 1
Number of interfaces in this area is 1
Area has no authentication
SPF algorithm last executed 00:11:09.780 ago
SPF algorithm executed 4 times
Area ranges are
Number of LSA 3. Checksum Sum 0x0218F6
Number of opaque link LSA 0. Checksum Sum 0x000000
Number of DCbitless LSA 0
Number of indication LSA 0
Number of DoNotAge LSA 0
Flood list length 0
If the domain ID matches on the local and remote PE devices, prefixes that were originally external after redistribution will be considered as inter-area routes after redistribution from BGP to OSPF e.g., OSPF with process ID 1000 on both the local and remote PE devices; because the OSPF process ID is matching, then the routes can be encoded as inter-area routes.
The domain-ID can be configured to be the same on the local and remote PE by:
- Configuring the same OSPF process ID on the local and remote PE.
- A second option is to configure the same domain-id value on the local and remote PE.
The domain-ID value can be modified in OSPF using the OSPF
command domain-id <ip-address>
; the domain-ID value is
in the form of an IPv4 address.
If the domain-ID value is different in the OSPF process of the local and remote
PE LSRs, then the routes redistributed from BGP to OSPF will be flagged as
E2 external routes.
The domain-ID is encoded(in HEX) in the BGP path attributes as an extended community
value.
The domain-ID value takes precedence over the OSPF process ID value; i.e., if the
process IDs are different but the domain-IDs are the same, then the routes will
be advertised as inter-area routes.
R5(config)#router ospf 1001 vrf CUST_X_2
R5(config-router)#domain-id 0.0.3.232
R5(config-router)#do show ip ospf 1001
Routing Process "ospf 1001" with ID 10.255.58.1
Domain ID type 0x0005, value 0.0.3.232
Start time: 00:15:57.372, Time elapsed: 02:33:44.228
Supports only single TOS(TOS0) routes
Supports opaque LSA
Supports Link-local Signaling (LLS)
Supports area transit capability
Supports NSSA (compatible with RFC 3101)
Connected to MPLS VPN Superbackbone, VRF CUST_X_2
Event-log disabled
It is an area border and autonomous system boundary router
Redistributing External Routes from,
bgp 100, includes subnets in redistribution
Router is not originating router-LSAs with maximum metric
Initial SPF schedule delay 5000 msecs
Minimum hold time between two consecutive SPFs 10000 msecs
Maximum wait time between two consecutive SPFs 10000 msecs
Incremental-SPF disabled
Minimum LSA interval 5 secs
Minimum LSA arrival 1000 msecs
LSA group pacing timer 240 secs
Interface flood pacing timer 33 msecs
Retransmission pacing timer 66 msecs
Number of external LSA 0. Checksum Sum 0x000000
Number of opaque AS LSA 0. Checksum Sum 0x000000
Number of DCbitless external and opaque AS LSA 0
Number of DoNotAge external and opaque AS LSA 0
Number of areas in this router is 1. 1 normal 0 stub 0 nssa
Number of areas transit capable is 0
External flood list length 0
IETF NSF helper support enabled
Cisco NSF helper support enabled
Reference bandwidth unit is 100 mbps
Area 1
Number of interfaces in this area is 1
Area has no authentication
SPF algorithm last executed 02:31:05.160 ago
SPF algorithm executed 4 times
Area ranges are
Number of LSA 9. Checksum Sum 0x059C39
Number of opaque link LSA 0. Checksum Sum 0x000000
Number of DCbitless LSA 0
Number of indication LSA 0
Number of DoNotAge LSA 0
Flood list length 0
R5(config-router)#end
R5>show ip ospf 1001 database
OSPF Router with ID (10.255.58.1) (Process ID 1001)
Router Link States (Area 1)
Link ID ADV Router Age Seq# Checksum Link count
10.255.58.1 10.255.58.1 109 0x80000008 0x00B959 1
88.255.8.8 88.255.8.8 67 0x80000007 0x00E62D 5
Net Link States (Area 1)
Link ID ADV Router Age Seq# Checksum
10.255.58.2 88.255.8.8 67 0x80000005 0x005F7D
Summary Net Link States (Area 1)
Link ID ADV Router Age Seq# Checksum
10.10.1.1 10.255.58.1 789 0x80000001 0x000E51
10.10.2.1 10.255.58.1 789 0x80000001 0x00035B
10.10.3.1 10.255.58.1 789 0x80000001 0x00F765
10.10.4.1 10.255.58.1 789 0x80000001 0x00EC6F
10.10.5.1 10.255.58.1 789 0x80000001 0x00E179
10.255.110.0 10.255.58.1 789 0x80000001 0x00C040
The OSPF database on R5 shows that the routes are now considered inter-area.
On the remote site client CE, the routes are now considered inter-area:
R8>show ip route ospf
Codes: L - local, C - connected, S - static, R - RIP, M - mobile, B - BGP
D - EIGRP, EX - EIGRP external, O - OSPF, IA - OSPF inter area
N1 - OSPF NSSA external type 1, N2 - OSPF NSSA external type 2
E1 - OSPF external type 1, E2 - OSPF external type 2
i - IS-IS, su - IS-IS summary, L1 - IS-IS level-1, L2 - IS-IS level-2
ia - IS-IS inter area, * - candidate default, U - per-user static route
o - ODR, P - periodic downloaded static route, H - NHRP, l - LISP
+ - replicated route, % - next hop override
Gateway of last resort is not set
10.0.0.0/8 is variably subnetted, 16 subnets, 3 masks
O IA 10.10.1.1/32 [110/3] via 10.255.58.1, 00:10:22, GigabitEthernet0/0
O IA 10.10.2.1/32 [110/3] via 10.255.58.1, 00:10:22, GigabitEthernet0/0
O IA 10.10.3.1/32 [110/3] via 10.255.58.1, 00:10:22, GigabitEthernet0/0
O IA 10.10.4.1/32 [110/3] via 10.255.58.1, 00:10:22, GigabitEthernet0/0
O IA 10.10.5.1/32 [110/3] via 10.255.58.1, 00:10:22, GigabitEthernet0/0
O IA 10.255.110.0/30 [110/2] via 10.255.58.1, 00:10:22, GigabitEthernet0/0
show ip cef
displays both the transport label and VPN label.
Superbackbone
On the PE router of the provider, in the OSPF output of the command
show ip ospf <process-ID>
, the message,
Connected to the MPLS VPN superbackbone
.
R5(config-router)#do show ip ospf 1001
Routing Process "ospf 1001" with ID 10.255.58.1
Domain ID type 0x0005, value 0.0.3.232
Start time: 00:15:57.372, Time elapsed: 02:33:44.228
Supports only single TOS(TOS0) routes
Supports opaque LSA
Supports Link-local Signaling (LLS)
Supports area transit capability
Supports NSSA (compatible with RFC 3101)
Connected to MPLS VPN Superbackbone, VRF CUST_X_2
Event-log disabled
It is an area border and autonomous system boundary router
Redistributing External Routes from,
bgp 100, includes subnets in redistribution
Router is not originating router-LSAs with maximum metric
Initial SPF schedule delay 5000 msecs
Minimum hold time between two consecutive SPFs 10000 msecs
Maximum wait time between two consecutive SPFs 10000 msecs
Incremental-SPF disabled
Minimum LSA interval 5 secs
Minimum LSA arrival 1000 msecs
LSA group pacing timer 240 secs
Interface flood pacing timer 33 msecs
Retransmission pacing timer 66 msecs
Number of external LSA 0. Checksum Sum 0x000000
Number of opaque AS LSA 0. Checksum Sum 0x000000
Number of DCbitless external and opaque AS LSA 0
Number of DoNotAge external and opaque AS LSA 0
Number of areas in this router is 1. 1 normal 0 stub 0 nssa
Number of areas transit capable is 0
External flood list length 0
IETF NSF helper support enabled
Cisco NSF helper support enabled
Reference bandwidth unit is 100 mbps
Area 1
Number of interfaces in this area is 1
Area has no authentication
SPF algorithm last executed 02:31:05.160 ago
SPF algorithm executed 4 times
Area ranges are
Number of LSA 9. Checksum Sum 0x059C39
Number of opaque link LSA 0. Checksum Sum 0x000000
Number of DCbitless LSA 0
Number of indication LSA 0
Number of DoNotAge LSA 0
Flood list length 0
R5(config-router)#end
The superbackbone is a hierarchy above area 0. With a superbackbone hierarchy in the OSPF domain, permitted hierarchies include:
- Super-backbone (MPLS)→ Area 0 → Area 1
- Super-backbone (MPLS)→ Area 1 → Area 1 Not permitted is:
- Super-backbone (MPLS)→ Area 1 → Area 0. Here, a virtual link would be required to be configured to connect the Area 0 to the superbackbone as it would be considered a discontiguous area.
Traffic Enginneering: Primary and Backup Paths
Load-balancing across multiple paths can be accomplished by modifying the OSPF process ID or OSPF domain number. This results in some paths becoming external routes. OSPF always chooses internal routes over external routes.
OSPF Sham-links
OSPF sham-links are similar to virtual links going over the SP network. It does not have to be in area 0.
To configure OSPF sham-links:
- Advertise a new prefix (/32) of a loopback into the VRF and advertise this into BGP: Remember, the netmask must be /32 for this feature to work. This is done on both PE devices.
- Verify IP connectivity between the two /32 prefixes.
- Under the OSPF process, configure the sham-link
area 1 sham-link 8.8.8.8 3.3.3.3
. where 8.8.8.8 is the source (/32 of local device) and 3.3.3.3 is remote OSPF device. Do the same but in the reverse direction on the remote device. - Verify with the command
show ip ospf <process-id> neighbor
. It shows neighborship over the sham-link. Routes that were previously learned through BGP are now learned thru OSPF (because of lower AD); verify this on PE router using command:show ip route
.
Practical use case is that if you had a point-to-point VPN and you are migrating to MPLS L3VPN from legacy frame relay service, the backdoor link will be preferred from an OSPF point of view because intra-area is preferred to inter-area preferred to external routes.
Traffic engineering can be implemented based on the cost.
Load-balancing
If the SP network employs the use of BGP route reflector, and will only
advertise what it considers as the best path of the available paths and not
use the available paths simultaneously. For RR groups these updates based on the
RD and performs path selection between the two of them.
To enable the RR to use both paths, we need to configure different RDs so that the
RR can advertise both paths to the remote PE. The RR will receive these as two
separate and independent best paths and they can be advertised to the remote neighbor.
The command maximum-paths <value>
needs to be configured
to enable multipaths.
BGP as PE-CE
If prefixes are to be shared using BGP as the protocol between at the PE-CE interconnection, on the PE device, the peering should be implemented in the VRF address-family. At the CE devices, the ASN for BGP should be the same. However, because BGP will detect the same ASN, it will reject the advertised route to prevent loop formation. There are two fixes for this:
- On the CE, configure the neighbor with the command
allow-as-in
- Alternatively, on the PE,
as-override
to the CE iBGP peer on both ends of the LSP.
Loops can be introduced by the above configuration options. One solution is the
BGP site-of-origin.
Under BGP SOO, prefixes from one site are tagged with SOO code (say 1000:1) as they move to the PE.
This gets advertised as a BGP extended community. The PE devices will filter out
any other prefixes that have that SOO code set. This is configured on the PE device
for the CE eBGP peer. This is configured under the VRF address-family.
neighbor 155.1.67.7 soo 1000:1
.
SOO can be verified with command show bgp vpnv4 unicast all <prefix>
.
Lookout for SoO.
Verification
On the RR to confirm sharing of prefixes with PE:
show bgp vpnv4 unicast all neighbors <neighbor-ip-address> advertised-routes
.
Step 6: Test Connectivity
A ping test or traceroute test can be used to confirm the operation of the MPLS L3VPN.
traceroute
R7>traceroute 10.9.1.1
Type escape sequence to abort.
Tracing the route to 10.9.1.1
VRF info: (vrf in name/id, vrf out name/id)
1 10.255.67.2 20 msec 16 msec 20 msec
2 30.255.46.1 [MPLS: Labels 4003/1000 Exp 0] 124 msec 92 msec 104 msec
3 30.255.24.1 [MPLS: Labels 2006/1000 Exp 0] 144 msec 52 msec 76 msec
4 10.255.19.1 [MPLS: Label 1000 Exp 0] 28 msec 76 msec 56 msec
5 10.255.19.2 112 msec * 104 msec
In the output of the traceroute command, two (2) MPLS labels are displayed; one label is the transport (MPLS) label and the other label is the VPN label. This confirms operation of MPLS L3VPN. In this case, the transport label is 4003 and VPN label 1000. As can be noticed, the transport label is continually swapped from 4003 to 2006 as the packet is switched from hop to hop while the VPN label is maintained with the packet from the ingress PE across the LSP to the egress PE where it finally gets popped and the packet is then redistributed into the IGP at the PE-CE link and routed using IP.
ping
R7>ping 10.9.2.1
Type escape sequence to abort.
Sending 5, 100-byte ICMP Echos to 10.9.2.1, timeout is 2 seconds:
!!!!!
Success rate is 100 percent (5/5), round-trip min/avg/max = 44/65/100 ms
The ping command confirms the connectivity from customer site to remote customer site across the SP MPLS L3VPN.
To view the MPLS LFIB
R6>show mpls forwarding-table
Local Outgoing Prefix Bytes Label Outgoing Next Hop
Label Label or Tunnel Id Switched interface
16 Pop Label 30.255.114.0/30 0 Gi0/0 30.255.46.1
17 4006 30.255.112.0/30 0 Gi0/0 30.255.46.1
5024 30.255.112.0/30 0 Gi1/0 30.255.56.1
18 Pop Label 30.255.24.0/30 0 Gi0/0 30.255.46.1
19 4000 30.255.11.11/32 0 Gi0/0 30.255.46.1
5022 30.255.11.11/32 0 Gi1/0 30.255.56.1
20 4012 30.255.2.2/32 0 Gi0/0 30.255.46.1
21 4023 30.2.10.1/32 0 Gi0/0 30.255.46.1
22 Pop Label 30.255.5.5/32 0 Gi1/0 30.255.56.1
23 4003 30.255.1.1/32 0 Gi0/0 30.255.46.1
24 Pop Label 30.255.115.0/30 0 Gi1/0 30.255.56.1
25 4007 30.255.113.0/30 0 Gi0/0 30.255.46.1
5001 30.255.113.0/30 0 Gi1/0 30.255.56.1
26 4009 30.255.12.0/30 0 Gi0/0 30.255.46.1
27 4005 30.255.13.0/30 0 Gi0/0 30.255.46.1
5003 30.255.13.0/30 0 Gi1/0 30.255.56.1
28 4004 30.255.35.0/30 0 Gi0/0 30.255.46.1
5037 30.255.35.0/30 0 Gi1/0 30.255.56.1
29 4011 30.255.3.3/32 0 Gi0/0 30.255.46.1
5004 30.255.3.3/32 0 Gi1/0 30.255.56.1
Local Outgoing Prefix Bytes Label Outgoing Next Hop
Label Label or Tunnel Id Switched interface
30 4013 30.3.10.1/32 0 Gi0/0 30.255.46.1
5005 30.3.10.1/32 0 Gi1/0 30.255.56.1
31 4014 30.3.9.1/32 0 Gi0/0 30.255.46.1
5006 30.3.9.1/32 0 Gi1/0 30.255.56.1
32 4015 30.3.8.1/32 0 Gi0/0 30.255.46.1
5007 30.3.8.1/32 0 Gi1/0 30.255.56.1
33 4016 30.3.7.1/32 0 Gi0/0 30.255.46.1
5008 30.3.7.1/32 0 Gi1/0 30.255.56.1
34 4017 30.3.6.1/32 0 Gi0/0 30.255.46.1
5009 30.3.6.1/32 0 Gi1/0 30.255.56.1
35 4018 30.3.5.1/32 0 Gi0/0 30.255.46.1
5010 30.3.5.1/32 0 Gi1/0 30.255.56.1
36 4019 30.3.4.1/32 0 Gi0/0 30.255.46.1
5011 30.3.4.1/32 0 Gi1/0 30.255.56.1
37 4020 30.3.3.1/32 0 Gi0/0 30.255.46.1
5012 30.3.3.1/32 0 Gi1/0 30.255.56.1
38 4021 30.3.2.1/32 0 Gi0/0 30.255.46.1
5013 30.3.2.1/32 0 Gi1/0 30.255.56.1
39 4022 30.3.1.1/32 0 Gi0/0 30.255.46.1
5014 30.3.1.1/32 0 Gi1/0 30.255.56.1
40 No Label 10.7.3.0/24[V] 0 Gi2/0 10.255.67.1
41 Pop Label 30.255.4.4/32 0 Gi0/0 30.255.46.1
Local Outgoing Prefix Bytes Label Outgoing Next Hop
Label Label or Tunnel Id Switched interface
42 No Label 30.4.10.1/32 0 Gi0/0 30.255.46.1
43 No Label 30.4.9.1/32 0 Gi0/0 30.255.46.1
44 No Label 30.4.8.1/32 0 Gi0/0 30.255.46.1
45 No Label 30.4.7.1/32 0 Gi0/0 30.255.46.1
46 No Label 30.4.6.1/32 0 Gi0/0 30.255.46.1
47 No Label 30.4.5.1/32 0 Gi0/0 30.255.46.1
48 No Label 30.4.4.1/32 0 Gi0/0 30.255.46.1
49 No Label 30.4.3.1/32 0 Gi0/0 30.255.46.1
50 No Label 10.10.255.1/32[V] \
0 Gi2/0 10.255.67.1
51 No Label 10.255.67.0/30[V] \
2372 aggregate/CUST_Z_HQ
52 4024 30.2.9.1/32 0 Gi0/0 30.255.46.1
53 4025 30.2.8.1/32 0 Gi0/0 30.255.46.1
54 4026 30.2.7.1/32 0 Gi0/0 30.255.46.1
55 4027 30.2.6.1/32 0 Gi0/0 30.255.46.1
56 4028 30.2.5.1/32 0 Gi0/0 30.255.46.1
57 4029 30.2.4.1/32 0 Gi0/0 30.255.46.1
58 4030 30.2.3.1/32 0 Gi0/0 30.255.46.1
59 4031 30.2.2.1/32 0 Gi0/0 30.255.46.1
60 4032 30.2.1.1/32 0 Gi0/0 30.255.46.1
61 No Label 10.7.1.0/24[V] 0 Gi2/0 10.255.67.1
Local Outgoing Prefix Bytes Label Outgoing Next Hop
Label Label or Tunnel Id Switched interface
62 No Label 10.7.4.0/24[V] 0 Gi2/0 10.255.67.1
63 No Label 10.7.2.0/24[V] 0 Gi2/0 10.255.67.1
64 No Label 10.7.5.0/24[V] 0 Gi2/0 10.255.67.1
65 No Label 30.4.2.1/32 0 Gi0/0 30.255.46.1
66 No Label 30.4.1.1/32 0 Gi0/0 30.255.46.1
67 Pop Label 30.4.40.0/24 0 Gi0/0 30.255.46.1
R6>
L3VPN routes have a [V] next to the learned route.
To view the FIB for the different VRFs run the command show ip cef vrf <vrf_name>
.
R6>show ip cef vrf CUST_Z_HQ
Prefix Next Hop Interface
0.0.0.0/0 no route
0.0.0.0/8 drop
0.0.0.0/32 receive
10.7.1.0/24 10.255.67.1 GigabitEthernet2/0
10.7.2.0/24 10.255.67.1 GigabitEthernet2/0
10.7.3.0/24 10.255.67.1 GigabitEthernet2/0
10.7.4.0/24 10.255.67.1 GigabitEthernet2/0
10.7.5.0/24 10.255.67.1 GigabitEthernet2/0
10.9.1.0/24 30.255.46.1 GigabitEthernet0/0
10.9.2.0/24 30.255.46.1 GigabitEthernet0/0
10.9.3.0/24 30.255.46.1 GigabitEthernet0/0
10.9.4.0/24 30.255.46.1 GigabitEthernet0/0
10.10.255.1/32 10.255.67.1 GigabitEthernet2/0
10.255.19.0/30 30.255.46.1 GigabitEthernet0/0
10.255.67.0/30 attached GigabitEthernet2/0
10.255.67.0/32 receive GigabitEthernet2/0
10.255.67.1/32 attached GigabitEthernet2/0
10.255.67.2/32 receive GigabitEthernet2/0
10.255.67.3/32 receive GigabitEthernet2/0
127.0.0.0/8 drop
224.0.0.0/4 drop
224.0.0.0/24 receive
Prefix Next Hop Interface
240.0.0.0/4 drop
255.255.255.255/32 receive
Additionally, to view the MPLS L3VPN on a specific route in the VRF LIB
R6>show ip cef vrf CUST_Z_HQ 10.9.1.0
10.9.1.0/24
nexthop 30.255.46.1 GigabitEthernet0/0 label 4003 1000
R6>
R6>show ip cef vrf CUST_Z_HQ 10.7.1.0
10.7.1.0/24
nexthop 10.255.67.1 GigabitEthernet2/0
R6>show ip cef vrf CUST_Z_HQ 10.7.1.0 detail
10.7.1.0/24, epoch 0
local label info: other/61
nexthop 10.255.67.1 GigabitEthernet2/0
R6>show ip cef vrf CUST_Z_HQ 10.9.1.0 detail
10.9.1.0/24, epoch 0, flags rib defined all labels
recursive via 30.255.1.1 label 1000
nexthop 30.255.46.1 GigabitEthernet0/0 label 4003
R6>
Troubleshooting
LDP Adjacency
- Is LDP actually enabled:
show mpls interfaces
- Is LDP transport working:
debug mpls ldp transport events
- Is LDP session authenticated: LDP(like BGP) uses TCP auth (option 19)
LDP Advertisements
Advertise labels for all IGP learned routes.
- Are labels actually bound:
show mpls ldp binding
show mpls forwarding-table
debug mpls ldp binding
debug mpls ldp transport events
: able to view LDP messages being exchanged. If an LDP peer does not have a route to its neighbor's LDP ID, this debug command will display this appropriate message.
- Is allocation being filtered?:
- Advertise filter vs. allocate filter: labels can be advertised for prefixes that are in a specified access-list. It is important to advertise labels for all IGP learned routes.
- Typically only /32 needs an allocation.
PE-CE Routing
- Is loop prevention being violated?:
- OSPF don-bit and domain-tag
- BGP AS-Path and Site-of-Origin(SoO)
- EIGRP Site-of-Origin
- Is the VRF just hung?
clear ip route vrf *
- Forces re-import/export
VPNv4 BGP
Is the route-target being filtered
- Wrong import/export policy
- Default route-target filter
- VPNv4 Route reflection
debug bgp vpnv4 unicast updates
to view debugging messages
associated with VPNv4 failures. Critical error message is "DENIED due to: extended
community not supported". show vrf detail
.
Are VPNv4 routes being sent/received
show bgp vpnv4 unicast all
show bgp vpnv4 unicast all <prefix>
: The in() and out MPLS labels of a prefix can be viewed.show bgp vpnv4 unicast all neighbor advertised-routes
debug bgp vpnv4 unicast updates
clear bgp vpnv4 unicast * <in|out>
MPLS Data Plane
- PHP may happen one hop too soon if peering to the wrong address. This will result in a traffic blackhole.
- Problems in route summarization.