Project

General

Profile

Bugless #53

Access to k0 pod network from routing fabric is borked

Added by q3k over 2 years ago. Updated almost 2 years ago.

Status:
Assigned
Priority:
Normal
Assignee:
Category:
hscloud

Description

For example, from boston:

$ curl 10.10.25.14:9092 # matrix metrics

will sometimes work and sometimes get stuck.

10.10.25.0/26 is ECMP'd across all k0 hosts:

dcsw01.hswaw.net#show ip route 10.10.25.0/26
Codes: C - connected, S - static, K - kernel, 
       O - OSPF, IA - OSPF inter area, E1 - OSPF external type 1,
       E2 - OSPF external type 2, N1 - OSPF NSSA external type 1,
       N2 - OSPF NSSA external type2, B I - iBGP, B E - eBGP,
       R - RIP, I - ISIS, A B - BGP Aggregate, A O - OSPF Summary,
       NG - Nexthop Group Static Route 

 B E    10.10.25.0/26 [200/0] via 185.236.240.35, Vlan2001
                              via 185.236.240.36, Vlan2001
                              via 185.236.240.39, Vlan2001
                              via 185.236.240.40, Vlan2001

However, it's a pod IP, so that's only really handled by one node - in this case, dcr01s24 / 185.236.240.40. And it seems like it only works when it gets ECMPd directly to that node, but not otherwise.

But even still, it should be properly bounced off if it hits other nodes, what's going on?

Also available in: Atom PDF