Michael’s tech blog

Achieving BIG-IP High Availability with Azure Route Server

2024-10-25T00:00:00+00:00

Broadly speaking, there’s a few common ways to achieve High Availability in Azure, where 2 BIG-IP devices may be running Active/Active or Active/Standby. I will categorize them as follows:

Azure Load Balancer
Cloud Failover Extension (CFE)
DNS-based HA

There are further options. The Azure GWLB is an option, but I’ll consider this another load balancer option in category 1. And the CFE approach has some further sub-options too. You can move public or private IP addresses between devices (just like ARP does on-prem), or update UDR in Azure with a CIDR block that covers the entire VIP range.

I will offer an additional alternative:

BGP and Azure Route Server

This article covers the concept and how-to of using BGP for High Availability in Azure. Using BGP and Route Health Injection with F5 is nothing new, but it is almost never seen in public cloud environments. Still, some methods that have been used for many years on-prem (Route Health Injection, or Active/Active with ECMP) can be achieved in Azure, using Azure Route Server.

Azure Route Server as depicted in Microsoft documentation

HA using Azure Route Server

I’ll cover some concepts that are related but discussed separately: Route Health Injection for Virtual Servers, and more generic BGP route advertising. I’ll also talk about Active/Standby vs Active/Active options, and ECMP.

I will then outline multiple methods and a “how-to” for achieving HA in Azure using BGP routing:

Active/Standby (Routing to Active device only)
Active/Active routes for multiple BIG-IP devices, using the network command or static routes

Concepts: Active/Standby vs Active/Active

Route Health Injection and Active/Standby mode

Route Health Injection allows the BIG-IP to advertise VIPs via routing protocols. As a basic example, a VirtualServer may have an IP address of 192.168.100.100/32. BIG-IP can advertise this /32 route to peers based on the health of the Virtual Server. Another example: I could have a VIP with a destination IP of 192.168.100.0/24, and then this route would also be advertised with a next hop of BIG-IP.

In an Active/Standby device group, both devices are sharing routes via BGP with the Active device as the next hop. In a graceful failover, both devices will immediately update their neighbors with the next hop of the (newly) Active BIG-IP. If the failover is not graceful and the Active device is suddenly lost, it’s BGP peering will be broken and its neighbors will remove the routes that were learned from the device that was Active but is now offline.

When considering an Active/Standby device cluster, let’s highlight words from F5’s guidance on dynamic routing:

Note: When you configure RHI in a device group configuration, only devices with active traffic groups attempt to advertise routes to virtual addresses.

BGP configurations and Active/Active mode

In an Active/Active scenario, multiple devices can advertise a route with equal weight. To ensure traffic flow is symmetric, neighbor routers must support ECMP (this ensures that a connection from a client is persisted to a single BIG-IP only). Azure Route Server uses ECMP when multiple devices advertise the same route with the same AS path length.

There are several methods by which you can share routes via BGP from BIG-IP, whether the device is Active or Standby. Using these methods will result in the same route and AS path with multiple next hops: ie., Active/Active routes on BGP peers.

How-to: HA with BIG-IP using Azure Route Server

Demo environment showing Azure Route Server and BIG-IP

Whether we will choose an Active/Standby or Active/Active approach, we must first set up BGP peering between F5 BIG-IP and Azure Route Server. These instructions assume we have a single BIG-IP HA pair in Azure and are configuring iBGP peering between our BIG-IP devices and eBGP peering with Azure Route Server.

Follow this tutorial from Microsoft on configuring Azure Route Server. Configure only Azure Route Server. Stop when you reach this section for configuring a network virtual appliance (NVA). We’ll use F5’s instructions to configure BIG-IP devices instead.

Screenshot of successful deployment of Azure Route Server

Take note of the ASN and Peer IPs after the Route Server is created. The ASN will always be 65515. We will use this, and the Peer IP’s, when configuring BGP on BIG-IP.

Follow this tutorial from F5 on configuring BGP on BIG-IP.

allow TCP/179 on appropriate Self IPs.
enable BGP on your Route Domain: tmsh modify /net route-domain 0 routing-protocol add { BGP }

configure iBGP between your BIG-IP devices. This script will be slightly different on each device.

# run these commands on both devices. The "neighbor" commands will be unique on each device.
imish
enable
config terminal
router bgp 200
neighbor 10.0.1.12 remote-as 200
# neighbor 10.1.1.11 remote-as 200
neighbor 10.0.1.12 activate
# neighbor 10.0.1.11 activate
end
write

configure eBGP between the devices and the Azure Route Server. This script will be identical on each device.

notice that a route map is created in order to filter which routes we share via BGP (only 192.168.100.0/24)

notice the “redistribute kernel” command, without which, BGP would not share Kernel routes (and would share routes that we could configure manually using imish commands)

imish
enable
config terminal
ip prefix-list PFX_ALLOW_VIPS seq 5 permit 192.168.100.0/24
route-map RESTRICT_ADVERTISE permit 10
match ip address prefix-list PFX_ALLOW_VIPS
router bgp 200
redistribute kernel
neighbor 10.0.3.4 remote-as 65515
neighbor 10.0.3.5 remote-as 65515
neighbor 10.0.3.4 activate
neighbor 10.0.3.5 activate
neighbor 10.0.3.4 route-map RESTRICT_ADVERTISE out
neighbor 10.0.3.5 route-map RESTRICT_ADVERTISE out
end
write

Return to Microsoft’s tutorial at this location to configure Route Server peering.
- create a peering to BIG-IP device 1
- create a peering to BIG-IP device 2

Ideally, at this point the Status of the Peering will show completed. Lastly, don’t forget to check the box “enable IP Forwarding” on your NIC in Azure if you want to use a VIP range that is not native to your VNET.

Screenshot of successful configuration of BGP peers

We can check learned routes from the Azure Route Server. The Microsoft tutorial uses PowerShell and the Get-AzRouteServerPeerLearnedRoute cmdlet. I prefer az cli so I’ll use:

$ az network routeserver peering list-learned-routes -g oleary-bgp-rg --routeserver oleary-bgp-rs -n bigip1
{
  "RouteServiceRole_IN_0": [],
  "RouteServiceRole_IN_1": []
}

Important
We should ensure that we only share routes with Azure Route Server that we intend to share. Typically we do not want to advertise the 0.0.0.0/0 route from the BIG-IP, because it will cause the entire VNET’s default route to be BIG-IP (and likely make your VNET unreachable). For that reason, we’ve created a route map that will filter any shared routes and only allow our desired route, 192.168.100.0/24, to be advertised via BGP.

At this point, Azure Route Server has not received any routes from BIG-IP, but the BGP relationship is Established.

1. Active/Standby: Advertising a VIP range from the Active device only

In this example, I’ll use a range (192.168.0.0/24) for my VIPs. In the cloud, I’ll often call this an “alien range” because it will be a CIDR block that does not truly exist in the VNET or VPC, but that is routed toward the BIG-IP.

Create a “dummy” VIP where the Destination IP is 192.168.100.0/24. We won’t actually target this VIP, so it can be any type (eg IP forwarding).
Set the Virtual Address to have “Route Advertisement” set to “Enabled”.

BIG-IP GUI config setting routing of Virtual Address

This VIP will be “redistributed” by BGP because it matches our allowed route for BGP sharing. It will be advertised with a next hop of the Active BIG-IP device. Let’s check this from both our Active and Standby device, and see that

Verifying routes using imish command line

We can also check the Azure Portal (eg, the effective routes of interfaces in the VNET) or use the CLI commands from earlier to see the learned routes by Azure Route Server.

Verifying routes using Azure Route Server

2. Active/Active: using BGP to advertise routes from both devices

Let’s return to the section of our F5 tutorial titled “Configuring and verifying the BIG-IP system to exchange routing prefixes”.

You can exchange network prefixes with BGP on the BIG-IP system with the network command or through redistribution. Using these methods will likely result in both your Active and Standby devices advertising a route with their own Self IP as the next hop.

To use the network command to manually configure a route to advertise, you would use imish configuration like this:

imish
enable
configure terminal
router bgp 200
network 192.168.100.0/24
end
write

You also would not need to advertise kernel routes in this scenario, so you could add no redistribute kernel if so desired.

Use the same methods as above to verify learned routes in Azure. You can also use show ip bgp neighbor 10.0.3.4 advertised-routes to verify which routes are advertised to Azure from the perspective of BIG-IP.

Another option is to create static routes on BIG-IP itself - with GUI or tmsh create net route - and share those. However, network routes in BIG-IP cannot have a Self IP as the next hop, so sharing these routes will not result in routes in Azure’s VNET that point to BIG-IP. In this case, I don’t see a good reason to explore this option further.

Other Considerations

BGP offers some advantes. No Azure load balancer is required in this design. Unlike CFE, there are no Managed Identities to configure, no RBAC permissions, and no API calls needed from BIG-IP to Azure’s management plane.

However, peering with BGP requires the Advanced Routing module (included with Better and Best licenses) and Azure Route Server. BGP also requires planning, skills, and perhaps team cooperation beyond what may have been planned.

Some BIG-IP admins have strong network skills and feel comfortable with BGP; others may fear the potential to cause outages beyond just BIG-IP if misconfigured.

Conclusion

Failover via BGP and Azure Route Server is fast and reliable, although personally I see Azure Load Balancer and CFE implementations much more commonly. Thanks for reading and please reach out with any questions!

BIG-IP BGP Routing Protocol Configuration And Use Cases

Migrate Bitnami Wordpress between servers, Part 3

2024-10-14T00:00:00+00:00

Ugh. I thought I was done, but I have a Part 3 to write.

Part 1
Part 2

AWS Lightsail Bitnami Wordpress and sending mail

I realized that I had set up Amazon SES in the past under my own account. I needed to move it to my buddy’s AWS account. Here we go.

Short version is that using Amazon SES is documented here, but really that’s no better than generic Amazon SES documentation. We want something Wordpress-specific, and ideally something that is “Bitnami Wordpress on AWS Lightsail”-specific.

Within Wordpress, I actually had to activate the plugin called WP Mail SMTP, just like is outlined in these specific instructions from Bitnami.

I would take screenshots, but those in the linked document above were exactly what I had. After following the first link and creating SMTP creds in SES, I was able to enter them into my Wordpress installation, and configure the SMTP host email-smtp.us-east-2.amazonaws.com.

After verifying the email address I want to send my mail to, and the domain, my contact forms now work.

After doing all of this, I see that Amazon has instructions specifically for Lightsail Wordpress instances. These are the best instructions that I’ve seen regarding this procedure.

A couple more things

I changed my wp-config.php file one last time:

define( 'DOMAIN_CURRENT_SITE', 'lowellpaincenter.com' );
was changed to
define( 'DOMAIN_CURRENT_SITE', $_SERVER['HTTP_HOST'] );

I think this allows me to login to either site and get to the Network Admin Dashboard. Before, I could only do this if logged into lowellpaincenter

I removed SSH (TCP/22) from allowed FW rules in Lightsail. Now, for admin access to CLI, myself or someone else will need to log into the AWS Lightsail console and re-enable. Even after that, SSH will require authenticating with my private key.

Migrate Bitnami Wordpress between servers, Part 2

2024-10-13T00:00:00+00:00

Yesterday I wrote a blog post about migrating my multi-site Wordpress deployment between 2 servers. It ended in failure because I could not reach the Network Admin dashboard, even though my 2 sites were accessible.

This evening I blew away the new server (twice, in the end) and tried again and worked it out. Here’s a record so I remember the changes that I made.

Migrating to a new server

Set up new AWS Lightsail server.
Change it so that it has a static IP address that persists across reboots
Ensure the private key for SSH access is as desired.
Deploy, and wait a few mins for Wordpress multi-site deployment to finish.
Install only the plugin called Updraft
Restore the backup that you created on source server, using Updraft, to new server.
When restoring, do not restore plugins. I don’t know if they cause a problem, but to rule it out, I did not migrate the plugins.
After restoration, I had this situation
- www.henklecosmetics.com was accessible at the expected URL without problems
- lowellpaincenter.com was not accessible. This site was called “52.2.160.149.nip.io”

Fixing the site name

Database updates

I believe there are only a few DB tables I need to care about. They are

wp_blogs
wp_site
wp_options
wp_%_options

Now, in the end I only edited 2 tables.

wp_site had only 1 entry. I left this alone.
wp_blogs had 3 entries, each with the same site_id but different blog_id. It was straightforward to see which one needed updating from 52.2.160.149.nip.io to lowellpaincenter.com
wp_options appeared to be the options table for the “main” site. I think that’s a default site. You can’t delete that site, so I left this table alone.
wp_6_options was the table that I updated. The 6 corresponds to the blog_id from the wp_blogs table. I updated the values for home and siteurl.

Config file updates

I believe I ended up making only 2x changes to the wp-config.php file. They were

define( 'DOMAIN_CURRENT_SITE', '52.2.160.149.nip.io' ); was changed to define( 'DOMAIN_CURRENT_SITE', 'lowellpaincenter.com' );
define( 'COOKIE_DOMAIN', $_SERVER['HTTP_HOST'] ); was added. This fixed the cookie error on login if I was trying to log in to admin panel from henklecosmetics and not from lowellpaincenter.

Restarted apache and things seemed to work.

Redirect rules

Around Sept 11, 2024, I updated the “old” server to accept any of the 4 variations of domain names:

henklecosmetics.com (which should redirect to www.henklecosmetics.com)
www.henklecosmetics.com
lowellpaincenter.com
www.lowellpaincenter.com (which should redirect browsers with a 301 or 302 to just lowellpaincenter.com)

All of these should work, whether HTTP or HTTPS, because my certificate is valid for all 4 domain names. Don’t ask me why one of the sites uses “www” and the other does not. I really don’t remember that.

In any case, I googled and learned how to do it with my new server, following the official documentation from Bitnami Wordpress packaged for AWS.

I followed these steps and created some files in this location: /opt/bitnami/apache/conf/vhosts

Other fixes

I still had to copy over the old TLS certs again, as outlined in the previous blog post.
I will still run the bncert tool in the future to ensure that future Lets Encrypt certs are deployed.

Conclusion

At this point, it looks like I’ve successfully migrated my Wordpress multi-site deployment between AWS Lightsail servers. I’ve managed to do this without paying for a commercial migration plugin that supports multi-site, but it did take me 2x late nights of troubleshooting Wordpress.

At least I’ve documented a handful of things in case I need to do this again. More importantly, the AWS account is owned by the website owner, so I can get this off my credit card.

Migrate Bitnami Wordpress between servers

2024-10-12T00:00:00+00:00

As I’ve blogged about before, I have a friend with 2x websites that I host on Bitnami Wordpress using AWS Lightsail. For about $5 USD /mth, I get a small EC2 instance, public IP address, and Bitnami Wordpress Multi-site pre-installed on the instance. I am yet to find a cheaper way to host a Wordpress site, certainly multiple Wordpress sites, that is still relatively robust.

Migrating

Background

I have been paying about $5 USD /mth for about 4 years, running 2x websites on this single EC2 instance. In addition, I pay about $20 /yr, per domain name, for domain registration. That’s $40 /yr (2 domain names) + around $60 /yr (monthly Lightsail costs), or $100 /yr that I’ve been paying for my friend’s small business website.

He would pay me back in an instant if I asked him, but it’s not just the cost that’s motivating me to migrate this. I also don’t want someone else’s small business website running with my personal AWS account anymore. If I get hit by a bus or just plain-old forget how to manage this, the website is at risk. I would rather it exist in his AWS account. He now has a couple employees that are capable of managing this.

Time to migrate!

Simple plan first: migrate like-for-like. Bitnami Wordpress on Amazon Lightsail, from 1 AWS account to another.

Long-term ideal plan: migrate this website away from Lightsail, on some fancy Wordpress deployment that is container-based and can run anywhere in a serverless, almost cost-free manner. I’m yet to find this ideal solution for next-to-free with almost-zero effort for migration.

Setting up for migration

I had my friend create a personal account on aws.amazon.com. He then gave me username and password.
I set up Lightsail instance.

As you can see from the screenshots below, there’s a few different choices for deploying an instance. You can deploy an instance with an OS only (eg Ubuntu, Windows, etc) or with an application installed (eg WordPress). I am choosing WordPress Multisite. In reality what you get is a Debian 6.1.106-3 instance that gets WordPress installed upon start up. I expect the underlying OS version and WP version to increment over time. As you can see, the lowest cost for this is currently $5 USD /mth.

Lightsail EC2 Instance options

Lightsail pricing for Linux-based instance

Broadly speaking you will follow the steps outlined here: Set up WordPress Multisite on Lightsail

Username, keys and passwords

Linux username is bitnami. This is configured by Lightsail.

When deploying, I uploaded the public key that matches the private key that I use to connect to many of my demo servers when using SSH. I will set up a password on the Debian OS when I can, and share it with my friend. I will keep my private key used for authentication in case he forgets his password.

I may lock down the IP ranges that can access the SSH instance. I may also disable SSH access via TCP/22 completely; Lightsail allows a user to connect to an instance’s command line via the Amazon web console, and I could always re-open TCP/22 if I want.

Wordpress default username is user. Default password is configured in a file on the instance. Run this command to retrieve it and access http://public-ip-address/wp-login.php

cat $HOME/bitnami_application_password

Setting up for migration

I ended up using a plugin to migrate my sites, and then troubleshooting when things slightly turned south. I read this article which reviewed some of the “best” plugins for migration. I didn’t realize it was published by the maker of the favorite plugin until I’d finished reading.

The problem? Each one required payment for multi-site Wordpress, which I am running. I ended up coming across something called Updraft which looked like it might do the trick with it’s free option. I can’t remember where I read it, but it looked like even though you are supposed to buy their premium product to get multi-site support, it may be do-able with the free version.

I installed this via the plugin marketplace on the source and destination instances. The backup on the source created 5x files, I downloaded them and uploaded them to the destination, and hit restore.

Migration and issues

The plugin worked, but I could not log into the site any longer. However, as far as I could tell, the content migrated okay. When I pointed my hosts file at the new server, the sites appeared to load correctly.

But I could not log in to them. I got this message about cookies not being enabled:

Annoying and misleading errors. I do have cookies enabled.

I figured that perhaps the passwords of the users did not get migrated correctly, so I followed this link to reset the passwords.

However, that wasn’t the problem. I still could not log in. Then I found this page that gave me a few ideas:

Disable plugins

I followed the first step and renamed /wp-content/plugins/ to /wp-content/plugins.hold and then restarted apache. I think this is supposed to disable plugins. Anyway, I still got the error from the screenshot above, so I renamed it back and moved on. This particular troubleshooting step was also suggested here.

Editing wp-config.php

I then followed the suggestion of adding this line to my wp-config.php file: define('COOKIE_DOMAIN', $_SERVER['HTTP_HOST'] );

I also restarted apache. That seemed to do it! I can now log into the admin console.

Troubleshooting SSL issues

The plugin did not move my SSL certificates. I just manually grabbed them off the old server at /opt/bitnami/apache2/conf and then copied the contents of the cert and key. Then I just overwrote the files on the new server at /opt/bitnami/apache/conf/bitnami/certs/server.crt and /opt/bitnami/apache/conf/bitnami/certs/server.key

The reason I did this was because I didn’t want to cut public DNS over to the new servers just yet, so I couldn’t use the cert tool with Let’s Encrypt. (I will go back and do this later. Good thing I documented it last time).

Long story short on TLS certs: every IT person needs to know the basics of TLS, handshakes, the cert tool and LetsEncrypt.

Network Admin dashboard

Finally, I think I broke something because the Network Admin dashboard would not work. Here’s a screenshot of what it would look like when I wanted to look at the dashboard for ALL sites (not just 1 of my 2 sites).

What happens here is the dashboard is trying to load a page at http://publicIP.nip.io which I don't control.

After googling the phrase “wordpress change hostname of network admin dashboard” - because I suspected that the hostname that shows up in the screenshot would need to be updated in settings or the DB - I found these 2 links here and here.

I think I can set the values of WP_HOME and WP_SITEURL in the wp-config.php file, OR the values of home and siteurl in the wp_options table in the DB. I liked the idea of doing it in the DB. So I made these mysql commands:

$ mysql -u root -p bitnami_wordpress -e "SELECT * FROM wp_options WHERE option_name = 'siteurl';"
mysql: Deprecated program name. It will be removed in a future release, use '/opt/bitnami/mariadb/bin/mariadb' instead
Enter password:
+-----------+-------------+-----------------------------+----------+
| option_id | option_name | option_value                | autoload |
+-----------+-------------+-----------------------------+----------+
|         1 | siteurl     | https://x.x.x.x.nip.io | yes      |
+-----------+-------------+-----------------------------+----------+
$ mysql -u root -p bitnami_wordpress -e "SELECT * FROM wp_options WHERE option_name = 'home';"
mysql: Deprecated program name. It will be removed in a future release, use '/opt/bitnami/mariadb/bin/mariadb' instead
Enter password:
+-----------+-------------+-----------------------------+----------+
| option_id | option_name | option_value                | autoload |
+-----------+-------------+-----------------------------+----------+
|         2 | home        | https://x.x.x.x.nip.io | yes      |
+-----------+-------------+-----------------------------+----------+

Then I saw the URL that would not load, so I updated the values:

mysql -u root -p bitnami_wordpress -e "UPDATE wp_options SET option_value = 'https://my-preferred-fqdn.com' WHERE option_name = 'home' OR option_name = 'siteurl';

This worked and updated the DB values, but didn’t seem to allow me to log in. So I updated the wp-config.php file with this:

define('WP_HOME','https://my-preferred-fqdn.com')
define('WP_SITEURL','https://my-preferred-fqdn.com')

Restarted apache, but still no luck. I’m giving up here. I will live with no Network Admin dashboard for now.

Tips

Like every time I deal with Wordpress, I got lost in version mis-matches, poor documentaton, a wild west of plugins and gotchas, like plugins that require a monthly fee charged annually to do something you may need just one time.

So here’s some general tips I will make note of, since I can’t seem to get a smooth and reliable procedural document written:

restart apace with this command: sudo /opt/bitnami/ctlscript.sh restart apache

reset wp-admin password:

mysql -u root -p bitnami_wordpress -e "SELECT * FROM wp_users;"
mysql -u root -p bitnami_wordpress -e "UPDATE wp_users SET user_pass=MD5('NEWPASSWORD') WHERE ID='ADMIN-ID';"

location of wp-config.php file. I never seem to know where it is. On my new server, it is here: /bitnami/wordpress/wp-config.php and there is also a symlink at /opt/bitnami/wordpress/wp-config.php

Quickly test UDP on XC with NTP

2024-10-07T00:00:00+00:00

Sometimes I want to test UDP connectivity, usually through F5 XC. Here’s a quick way to set up NTP using Ubuntu.

I basically copied this link for setting up NTP on client and server.

NTP Server

deployed Ubuntu 22.04 LTS

sudo apt update -y
sudo apt install ntp -y
sudo systemctl status ntp

NTP Client

sudo apt update -y
sudo apt install ntpdate -y

Test from client to server

ntpdate -q [ip-address-of-server]

F5 XC as UDP proxy and load balancer

To proxy this through F5 XC, you cannot use HTTP or TCP Load Balancers, obviously. You must create the equivalent with the Virtual Host objects:

Create Endpoint (IP address of NTP server)
Create Cluster (group of endpoints)
Create Route (send traffic to cluster)
Create Advertise Policy (listen on a given IP address on a CE, or “Virtual Network” and “vesi-io-shared/public” if you want to advertise to public Internet)
Create Virtual Host object to link all of these together.

Screenshot of Advertise Policy that will use default tenant IP on RE's

Notes

at this time it looks like you cannot have a custom VIP for internet-facing traffic for UDP traffic.
at this time it looks like Performance and Application Dashboards do not include UDP traffic.

Windows

I am not 100% sure but I think I used this tool to easily demo with a Windows desktop as the NTP client.

Deploying OpenShift with metal nodes

2024-09-19T00:00:00+00:00

Summary

In order to use OpenShift Virtualization, your nodes must support virtualization. If you’re running OpenShift in AWS, as I like to do when I deploy in a hurry, you will need to use bare metal nodes. That is very expensive, but cost can be reduced if you build a small cluster first and then add a single metal node for a small and quick PoC.

Deploying OpenShift

I do not claim expertise in OpenShift, but as a popular enterprise tool I need to be aware and capable with OpenShift. I’m considering OpenShift certification in future for this reason.

As I’ve written before, I typically deploy OpenShift in AWS, using the installer command line tool. This is Installer Provisioned Infrastructure (IPI), where the CLI tools builds my EC2 instances (they don’t pre-exist and I don’t have to build them myself.)

As you can see from my default install-config.yaml file, I will deploy 3 master nodes and 3 worker nodes. You can change worker nodes to 2 if you want to save a little more money.

KubeVirt and OpenShift Virtualization

I’m late to the game and learning about running VM’s on OpenShift, which requires that the node supports virtualization. Since the AWS EC2 instances do not, the solution is to deploy a cluster with metal nodes. But that is very expensive.

The answer is to use machinesets in OpenShift. Specifically, I found this nice article that spoke about OpenShift Virtualization on AWS, and how to reduce the cost of a PoC: deploy a cluster with regular EC2 instances and then an additional single bare metal node. Then remember to kill that metal node as soon as you’re done!

How to PoC OpenShift Virtualization in AWS

I’ll document my steps here so I can remember for future:

Deploy OpenShift cluster. You can scale back to 2x worker nodes (+3 master = 5 nodes total in the cluster.)

.\openshift-installer create cluster --dir=cluster --log-level=debug

After cluster is built, I personally see 2x machinesets when I get all machinesets:

(I have transposed the cluster id suffix with x’s)

$ oc get machinesets -n openshift-machine-api
NAME                                 DESIRED   CURRENT   READY   AVAILABLE   AGE
ocpcluster-xxxxx-worker-us-east-1a   1         1         1       1           27m
ocpcluster-xxxxx-worker-us-east-1b   1         1         1       1           27m

I also see my VM’s in the EC2 console:

5 total VM's belonging to this cluster

Then I follow the instructions in the article to create a new machineset by editing an existing machineset.

oc get machinesets -n openshift-machine-api ocpcluster-xxxxx-worker-us-east-1a -o yaml > machineset.yaml

After editing machineset.yaml, mine looks like this file. I apply the file, and a new machineset exists in Openshift (see the last in the list, with desired=0)

$ oc apply -f machineset.yaml
machineset.machine.openshift.io/ocpcluster-xxxxx-worker-us-east-1c created
$ oc get machineset -n openshift-machine-api
NAME                                 DESIRED   CURRENT   READY   AVAILABLE   AGE
ocpcluster-xxxxx-worker-us-east-1a   1         1         1       1           32m
ocpcluster-xxxxx-worker-us-east-1b   1         1         1       1           32m
ocpcluster-xxxxx-worker-us-east-1c   0         0                             19s

I then scale this machineset to 1, and sure enough a metal VM is built in AWS.

$ oc scale machineset ocpcluster-xxxxx-worker-us-east-1c -n openshift-machine-api --replicas=1
machineset.machine.openshift.io/ocpcluster-xxxxx-worker-us-east-1c scaled

There are now 6 VM's. Notice the additional node of type m5.metal

At first, the new EC2 instance is in a Provisioning state, but is not yet a node in OpenShift. A good 15 mins or so later, that metal VM is both a machine and a node.

### After applying the file:
$ oc get machines -n openshift-machine-api
NAME                                       PHASE         TYPE         REGION      ZONE         AGE
ocpcluster-xxxxx-master-0                  Running       m5.xlarge    us-east-1   us-east-1a   41m
ocpcluster-xxxxx-master-1                  Running       m5.xlarge    us-east-1   us-east-1b   41m
ocpcluster-xxxxx-master-2                  Running       m5.xlarge    us-east-1   us-east-1a   41m
ocpcluster-xxxxx-worker-us-east-1a-wnrbq   Running       m6i.xlarge   us-east-1   us-east-1a   37m
ocpcluster-xxxxx-worker-us-east-1b-45svt   Running       m6i.xlarge   us-east-1   us-east-1b   37m
ocpcluster-xxxxx-worker-us-east-1c-6n569   Provisioned   m5.metal     us-east-1   us-east-1c   9m3s
$ oc get nodes
NAME                           STATUS   ROLES                  AGE   VERSION
ip-10-0-128-9.ec2.internal     Ready    worker                 34m   v1.29.7+4510e9c
ip-10-0-136-153.ec2.internal   Ready    control-plane,master   41m   v1.29.7+4510e9c
ip-10-0-143-152.ec2.internal   Ready    control-plane,master   41m   v1.29.7+4510e9c
ip-10-0-144-135.ec2.internal   Ready    control-plane,master   41m   v1.29.7+4510e9c
ip-10-0-155-171.ec2.internal   Ready    worker                 33m   v1.29.7+4510e9c

### Notice that there are 6x provisioned machines, but 5x nodes. 1 of the machines is still being set up.
### After 15 mins or so, this machine will also be listed as a node. The bare metal instance must power up and then join the cluster as a node.

I can now follow other documentation to insall OpenShift Virtualization, and then continue back in our original article for instructions to deploy Fedora.

You should now have a VM running in OpenShift.

Accessing the VM

You will notice that accessing the VM is possible using virtctl, which is a tool you will need to download.

Documentation links:

You could also create a Service (ClusterIP, NodePort, or LB) but since I haven’t done this yet, I’ll leave it for another blog post.

Thanks for reading!

F5 CFE, private endpoints, and custom DNS

2024-09-04T00:00:00+00:00

Summary

When using the F5 Cloud Failover Extension (CFE) for API-based failover in public cloud, some customers disallow API calls out to the public Internet. Private endpoints solve for this constraint, but rely on DNS. If a customer is using custom DNS servers, a workaround may be required.

F5 CFE

The CFE is code that runs on top of BIG-IP to perform failover in public cloud. It makes a series of API calls to the cloud provider to move IP addresses between interfaces, update public IP mapping between private IP’s, or update route tables. It is supported in Azure, AWS, and Google Cloud (GCP).

Typical case of CFE: API calls to update cloud configuration at failover. Source.

Customer problem statement

My customer was using CFE to successfully move IP addresses between interfaces in Azure. They had followed default documentation and had a storage account that was publicly accessible, but only writeable by the BIG-IP VM’s in Azure. They decided to create a private endpoint for their storage account in Azure and block Internet access to their storage account.

CFE stopped working and reported “403 Forbidden” errors in logs. The issue was their use of custom DNS settings!

CFE in isolated environments

Because the API calls are made to the cloud provider, the destination of these calls is a public API endpoint. For example:

In Azure, storage API calls may be sent to https://xxx.blob.core.windows.net
In AWS, storage might be accessed at https://s3.us-east-2.amazonaws.com
In GCP, storage calls might be made to https://storage.googleapis.com

Since the BIG-IP is already running in the cloud provider, it’s possible to access these services without traversing the Internet. Private endpoints allow a VM to access services within these cloud providers directly, and they can be set up in Azure, AWS, and GCP.

Documentation exists to help F5 customers configure private endpoints in Azure and GCP. A DevCentral article also walks through this for AWS users.

Azure, Custom DNS, and CFE in isolated environments

My recent customer was running CFE in Azure, but this should apply to other cloud providers also.

Private endpoints work by using DNS. Once a private endpoint is configured, DNS entries are added for the networks that will use the private endpoint.

For example, if I have an Azure storage account called olearystorageacct, blobs in this storage account will be reachable at olearystorageacct.blob.core.windows.net. This might resolve to a public IP of 1.1.1.1. If I disable public Internet access to my Storage Account and create a private endpoint for it, VM’s in my VNET will now resolve this name to a private IP address from their own VNET, such as 10.1.1.5 in the diagram below.

An example diagram of a private endpoint for Azure storage account access.

However, this assumes VM’s are using the default DNS settings in the Azure VNET. When using custom DNS, VM’s will resolve DNS queries against configured servers. In the diagram below, 10.0.0.100 and 101 are IaaS DNS servers.

Example flow of DNS queries when custom DNS servers are used in VNET settings

Working with custom DNS settings and private endpoints

Custom DNS represents a problem here. The IaaS DNS servers will forward DNS queries out to Internet-based DNS servers for domains like blob.core.windows.net. Despite the existence of a private endpoint, VM’s using custom DNS may still resolve this record to a public IP address. How can we overcome this?

I quickly found others with the same problem, and read related Microsoft documentation that got my mental wheels turning.

I came up with this list of options for my customer, ordered from easiest to hardest in my opinion:

Set a hosts record on your BIG-IP device, so that olearystorageacct.blob.core.windows.net is manually set to the IP of the private endpoint, e.g. 10.1.1.5
Set the DNS server of BIG-IP to Azure’s DNS server at 168.63.129.16. However, this means every DNS request from this VM will use Azure’s DNS and not custom DNS servers.
Create a forwarding rule on the custom DNS server to forward requests for *.blob.core.windows.net to 168.63.129.16.
Create an A record on the custom DNS server: xxx.blob.core.windows.net could be set to 10.1.1.5
Lastly, a customer could always move away from custom DNS, and revert to default Azure settings for DNS. They can still use private DNS zones and Azure’s Private DNS Resolver. This is a larger architectural change of course, not a simple workaround.

My customer chose Option 2. CFE started working again! Their BIG-IP VM is using Azure’s DNS server and accessing the storage account via private endpoint, but the custom DNS servers are in place for all other VM’s following the VNET settings.

Conclusion

In retrospect, this was a simple solution. When using both private endpoints and custom DNS together, plan for how DNS resolution will work for given services. I hope this article helps if you find yourself in the same situation!

Let’s Encrypt cert automation for Bitnami Wordpress

2024-08-15T00:00:00+00:00

Summary

I am no Wordpress expert. I have found a Bitnami Wordpress image, running via AWS Lightsail, to be very cheap. I pay around $6/mth for a small EC2 instance, VPC, public IP address, all as part of a Lightsail deployment.

How to update SSL certs for Bitnami Wordpres

Log into AWS with personal account.
- username is my personal email address
- password is stored in vault
Navigate to Lightsail service, and see the public IP of the EC2 VM instance that’s running.
SSH to the VM.
- Username is bitnami.
- SSH key: mikeo-keypair-personal.ppk
Run sudo /opt/bitnami/bncert-tool
I had to then fill out the prompts with the URL’s that I wanted, comma-separated.
That should be it. After this, there should be a cron job that updates the certs every 90 days or so.

You can also manually use the cert tool to generate SSL certs and then put them in the correct directory.

Logging in via SSH to public IP of my EC2 instance

Wordpress Admin

This is something I have left up to the website owner, but for my own reference:

URL: https://the-domain-of-the-pain-clinic/wp-admin
username: henklecosmetics
password: [this one has been communicated to website admin]

There is another user, too:

URL: https://the-domain-of-the-pain-clinic/wp-admin
username: my personal email address
password: [this one has been communicated to website admin]

Documentation

This worked for me: https://docs.bitnami.com/aws/how-to/generate-install-lets-encrypt-ssl/

Screenshots

Screenshots of websites. No TLS warnings!

FTP server on K8s with F5

2024-07-30T00:00:00+00:00

Background

After 5 years honing Kubernetes expertise, I was happy to undertake a challenge: expose an FTP server from within Kubernetes, and protecting with F5 BIG-IP. I’ll do this using Azure Kubernetes Service (AKS) as an example environment.

Isn’t FTP a legacy technology? Yes, FTP has been around since the early 70’s. It was designed for efficient tranfer of files. Although it’s insecure by default, it’s still commonly seen today for some file transfer types.

Advantages of FTP

As opposed to other file transfer methods, FTP does offer a few advantages:

allows applications to resume file transfers if a connection is lost
allows for a queue of files to be uploadeded or downloaded
faster / more efficient than HTTP
no file size limitations
extremely common platform offered widely for many years

Disadvantages of FTP

FTP is considered legacy because of a few limitations. There’s workaround and enhancements, so I’ll give a basic overview:

insecure by default
- Standard FTP sends username, password, and files in clear text
- Servers can be spoofed to send data to the wrong client
complexity
- by default, the control channel is over TCP/21, but is usually configurable
- by default, the data channel is over a random high port, if using Passive mode, and is usually configurable
- by default, the data channel is over TCP/20, if using Active mode, and is usually configurable
active vs passive
- active mode requires the server to establish a connection to a client. This is an outdated model disallowed by most firewalls
- passive mode requires outbound connections over random high ports, which are also disallowed by most firewalls. Therefore, firewalls or L4 network devices (like BIG-IP) must be “ftp aware”

There are many more complex advantages and disadvantages, but to summarize: running and securing FTP servers requires knowledge of the protocols, not just general network and security knowledge.

FTPS and SFTP

FTP is common, but it’s also common to see enterprises also offer FTPS and SFTP. While they sound similar, these two are different protocols. FTPS allows for encryption of the control channel and (optionally) the data channel of FTP. SFTP adds file transfers upon the SSH protocol, meaning all transfers can happen over a single TCP connection on port 22.

There is some difficulty with all of these technologies. SFTP is difficult to proxy. FTPS requires additional knowledge on top of FTP, which itself is a learning curve for most folks.

Common FTP servers

For this reason, enterprise-level FTP servers are usually well built out with a customer support team, static IP addresses, commercial software, and support. Changes are usually slow and upgrades infrequent. File transfers themselves are usually frequent, large, and core to a business process. For example, large financial customers may have longstanding practices built around FTP with partners, and so require very high confidence in the redundancy, security, and support of their FTP systems.

In my most recent case, my customer was running IBM Sterling B2B Integrator installed Red Hat OpenShift. OpenShift was running on Azure (ARO). This means we had a commercial application running on an enterprise K8s distribution, which itself was running as a managed service in Azure. In my demonstration, I’m going to use a free FTP server, vsftp, and I will start by running on Azure Kubernetes Service (AKS).

Why run FTP on Kubernetes?

Kubernetes offers the same advantages for FTP applications as it does to others: scale, platform-agnostic, heavily automated deployment and operations, etc. But Kubernetes networking is a challenge for the average network engineer, and FTP applications are an additional challenge for the average engineer. FTP and K8s are not a likely combination, but it can be done!

How to run your FTP server on Kubernetes and still get enterprise-level protection

Service type of ClusterIP, NodePort, or LoadBalancer?

Broadly speaking, there’s three major types of services in Kubernetes¹: ClusterIP, NodePort, and LoadBalancer. Whichever type you choose to expose your FTP service, you’re going to have some things to consider.

3 types of LB services. Image source.

ClusterIP

Cluster IP is possible if your pods are directly routable by your external load balancer. With my AKS deployments, my pods are routable from the VNet without additional work.

If you’re using a CNI with an overlay network (which typically use tunnels like VXLAN or GENEVE or routing like BGP), then you’ll need to get your external loadbalancer integrated with the CNI, or use another method. I won’t go into the details of CNI integration here.

NodePort

NodePort builds on top of ClusterIP, and maps a high port from the NodePort range (30000-32767) to port exposed on the pod.

However, with FTP, we cannot have random port translation. Since the data channel port is assigned by the server, it will tell the client to connect on an expected port. That port must be available to the client, and correctly mapped back to the server.

The way to do this is to manually define our NodePort values in our service, and match them with the PASV ports from your FTP server. Ie.,

apiVersion: v1
kind: Service
metadata:
  name: my-ftp-service
spec:
  type: NodePort
  selector:
    app.kubernetes.io/name: MyApp
  ports:    
    - port: 21
      targetPort: 21
      # no need for a pre-determined port for the control channel
    - port: 30100
      targetPort: 30100
      nodePort: 30100 # notice here, we have our PASV port range within the NodePort default range, and we match them manually.
    - port: 30101
      targetPort: 30101
      nodePort: 30101
    # etc...

In reality, a NodePort service is a confusing option for exposing FTP services!

LoadBalancer

Creation of a LoadBalancer service builds upon NodePort. A controller will automatically configure a corresponding load balancer in the cloud.

In this case, I’m running in Azure but I do not want to use an Azure Load Balancer, so I’ll create a service of type LoadBalancer but also define loadBalancerClass so the Azure controller ignores this object. CIS will configure BIG-IP as the load balancer instead.

LoadBalancer is a superset of NodePort, which is itself a superset of ClusterIP.

Let’s deploy our cloud environment

In this demo, I’ll use a LoadBalancer service and deploy my CIS instance in cluster mode.

Build an Azure VNet with a few subnets.

SUBSCRIPTION_ID="your-subscription-id"
LOCATION=eastus2
RESOURCEGROUP=oleary-rg
CLUSTER=mycluster
VNET_NAME=my-vnet
# create vnet
az network vnet create --resource-group $RESOURCEGROUP --name $VNET_NAME --address-prefixes 10.0.0.0/16 --location $LOCATION
az network vnet subnet create --resource-group $RESOURCEGROUP --vnet-name $VNET_NAME --name worker-subnet --address-prefixes 10.0.2.0/23
az network vnet subnet create --resource-group $RESOURCEGROUP --vnet-name $VNET_NAME --name mgmt --address-prefixes 10.0.4.0/23
az network vnet subnet create --resource-group $RESOURCEGROUP --vnet-name $VNET_NAME --name external --address-prefixes 10.0.6.0/23

Now, deploy a pair of F5 BIG-IP devices into the VNET, where the network interfaces are in the subnets of mgmt, external, and worker-subnet.²

Then, deploy an AKS cluster with the nodes in the worker-subnet subnet.

# create AKS cluster
az aks create --resource-group $RESOURCEGROUP --name $CLUSTER --node-count 1 --generate-ssh-keys --network-plugin azure --service-cidr "172.16.0.0/24" --dns-service-ip "172.16.0.10" --vnet-subnet-id /subscriptions/$SUBSCRIPTION_ID/resourceGroups/$RESOURCEGROUP/providers/Microsoft.Network/virtualNetworks/$VNET_NAME/subnets/worker-subnet
# get kubeconfig file of AKS cluster
az aks get-credentials -n $CLUSTER -g $RESOURCEGROUP -f ~/.kube/config

Now, configure CIS in the cluster so that applications can be exposed from Kubernetes via BIG-IP.³

I’m going to use Cluster mode (not NodePort mode) in this example, but either will work.⁴

At this point, you’ll have an environment that looks like this:

Typical BIG-IP integration with K8s using CIS

Now, deploy your FTP server like this:

First, a namespace like this.
Second, a PersistentVolumeClaim like this.
Then, a Deployment like this to run a FTP server. Notice that our deployment defines several environment variables within our pods, which are used to set the PASV FTP ports and the FTP server address.
On the BIG-IP, create an iRule called /Common/ftp_ports that looks like this: when SERVER_CONNECTED { FTP::port 10000 10002 }
Now, create a policy object like this.
Finally, in order to have CIS create a VirtualServer on BIG-IP, a service of type LoadBalancer like this.

Testing our FTP application

I’m going to use a simple ftp commands at the linux command prompt. Below, I will connect to our FTP server with ftp -p 4.152.28.199, and then enter username and password when prompted. Then I will type ls in order to list the directory, which contains a single file test.txt. Finally, I will type bye to disconnect.

ubuntu@ubuntu-Virtual-Machine:~$ ftp -p 4.152.28.199
Connected to 4.152.28.199.
220 (vsFTPd 3.0.2)
Name (4.152.28.199:ubuntu): vsftp
331 Please specify the password.
Password:
230 Login successful.
Remote system type is UNIX.
Using binary mode to transfer files.
ftp> ls
229 Entering Extended Passive Mode (|||32002|)
150 Here comes the directory listing.
-rw-------    1 ftp      ftp             8 Aug 29 01:51 test.txt
226 Directory send OK.
ftp> bye
221 Goodbye.

Here’s a very short clip using a graphical tool, WinSCP, to demonstrate the same thing:

OpenShift vs other Kubernetes distributions

You may notice in my example above that I have used fauria/vsftp as the container image for my FTP server. This will work in a regular K8s distro (I’ve used AKS in my PoC). OpenShift will require additional resources, such as a Security Context Constraint (SCC), so I have not documented this here. Perhaps in a future article.

Conclusion

It is possible to:

run FTP applications in Kubernetes
expose FTP services outside of the cluster
integrate external services, like F5 BIG-IP, with the FTP traffic
run this in public cloud or with enterprise services like Azure Red Hat OpenShift

Most of this requires a skillset that covers legacy and modern technologies. If you undertake something like this, ensure you have a plan for high availability and commercial support. Thanks for reading!

There are more than 3 types of services in K8s, but understanding these 3 major types is key. ↩
For this step, I typically deploy an ARM template from F5, like this one: https://github.com/F5Networks/f5-azure-arm-templates-v2/tree/main/examples/failover ↩
I won’t detail installing CIS here, except to say that I defined pool-member-type as cluster and load-balancer-class as f5cis, to match the spec in my service. ↩
I find cluster mode easiest. If using a service of type NodePort, there are several differences. The PASV ports must be between 30000-32767, must manually match each NodePort they are assigned, and the FTP server must send the IP address of the K8s Node (not Pod) in the PASV response. CIS must have pool-member-type as nodeport ↩

Transparent load balancing in Azure, Part 2

2024-07-24T00:00:00+00:00

This article is intended to be a sequel to a previous article I read, but did not author, titled Transparent Load Balancing in Azure. If you read this article, you may see I’ve left a comment that briefly describes some necessary details left out of the original article.

This article covers an advanced scenario for running an Active/Standby BIG-IP pair behind Azure LB. Read and follow these instructions all of the following conditions are met:

You want to run 2x BIG-IP devices in Active/Standby configuration
You plan to use Azure LB to provide High Availability (HA), as opposed to other methods like DNS or the Cloud Failover Extension
Your intention is to have your BIG-IP’s Virtual Server ip address (VIP) be the same as the frontend ipconfig on the Azure LB.
You have checked the “Floating IP” checkbox on your Azure LB rule that forwards traffic to BIG-IP

Options for High Availability (HA) of Network Virtual Appliances (NVA’s) in Azure

This article does not cover the advantages and disadvantages of different methods to achieve HA in Azure. There are multiple approaches:

Azure Load Balancer. This approach uses a simple, L4 load balancer in Azure to disaggregate traffic across multiple devices.
- Active/Standby, Active/Active, or multiple standalone devices are options here.
- This article focuses on a scenario using Active/Standby BIG-IP devices behind Azure LB, when the “Floating IP” checkbox is checked on the Azure LB rule.
F5 Cloud Failover Extension (CFE). This automation approach uses software on the BIG-IP device to ensure High Availability across devices that are Active/Standby, without requiring Azure LB.
- It can move IP addresses between Azure network interfaces (emulating Gratuitous ARP that happens on-prem)
- It can update a route table to ensure the default route (0.0.0.0/0) for a subnet sends response traffic to remote clients back via the active BIG-IP.
- It can update a route table to point an entire CIDR block dedicated for VIPs at the active device. This is also referred to as an “alien range” approach.
DNS Load Balancing. A simple method for High Availability between devices is using DNS and multiple standalone appliances. However DNS load balancing within a local site is not common (although GSLB is a common practice across geographic regions)
Other approaches. Azure offers a Gateway Load Balancer that F5 supports but this requires advanced knowledge, and you might even consider BGP in some cloud scenarios. These are out of scope for this article.

Common Architecture of Azure LB with BIG-IP pair

The most common way to run Active/Standby BIG-IP devices behind Azure LB looks like the following diagram.

Notice that the BIG-IP devices are in Active/Standby mode, and Azure LB is simply a Layer-4 traffic disaggregator.

Taking a closer look at IP addressing with this common architecture

Let’s take a look at where we configure IP addresses based on the most common approach:

Notice that Destination NAT occurs by default at Azure LB, and at BIG-IP.

There’s a few things here that sometimes confuse the first-time cloud admin:

this solution requires 2x IP addresses for every Virtual Server on BIG-IP.
- You could create 2 separate VS’s, each with 1 IP address.
- You could also create a single VS with 2x IP’s using Shared Address lists, or your Virtual Address could be a /30 range. Either way, it’s different than what you’re accustomed to on-prem.
this means IP addresses will get used up twice as fast as we’re accustomed to multiple destination NAT’s can sometimes confuse app owners (although normally a network admin has no problem understanding this)

When and why to use Azure LB’s “Floating IP” option.

The floating IP checkbox on your Azure LB rule could be understood as telling Azure LB: “do not perform Destination NAT for this traffic”. This is similar to F5’s nPath (aka asymmetric, or Direct Server Return) architecture.

Notice that you can disable DNAT at Azure LB (and at F5 BIG-IP if desired).

Why would you configure as above?

No destination NAT at Azure LB can make overall IP addressing easier
1x IP Address on your BIG-IP VIP is more like on-prem config we are familiar with

How to configure BIG-IP when “Floating IP” is used

The previous option is an alternative approach, but it requires a semi-advanced workaround for Azure health checks. It’s important to understand this workaround and if you don’t, just stick with the common approach outlined first in this article.

If you use Floating IP with your Azure LB rule, health probes from Azure LB will target the primary ipconfig on the Azure NIC. In BIG-IP, that’s your Self IP. And your Self IP will always respond healthy if it is probed, evn on the Standby device (of course, port lockdown settings on Self IP’s must allow health checks).

Put another way: your Standby BIG-IP will respond as healthy to Azure LB, and Azure LB will send data plane traffic to it. This will cause problems, so we must make our Standby BIG-IP “unhealthy” in Azure LB.

Enter VIP targeting and iRules. Do this:

Create LB rule on Azure LB sending traffic to the primary ipconfig on the dataplane NIC on the BIG-IP devices.
Configure a health probe for this rule. A HTTP health check with default settings is fine.

Create an iRule on BIG-IP:

when HTTP_REQUEST {
HTTP::respond 200 content "device is active"
}

Create a VIP called /Common/unroutable_vip and give it an IP address of 255.255.255.254 and attach the iRule from the previous step. This VIP will only be reachable on the Active device, and is not routable from outside of BIG-IP.

Create another iRule:

when HTTP_REQUEST {
virtual /Common/unroutable_vip
}

On BIG-IP Device 1, create a VIP with the same IP addresses as the Self IP. This is allowed. Listen on port 80, add HTTP profile, and attach the above iRule.
On BIG-IP Device 2, notice that the VIP created in the above step is not sync’d to Device 2. Repeat the above step with a VIP created on the same IP address as the Self IP.

Now, your Azure LB will health check both devices, sending HTTP health checks to both devices and hitting the VIP’s you created on the Self IP’s. However, only the active device will successfully forward traffic to this VIP called “unroutable” and the Standby device will fail to do this. This means that Azure LB will believe that the Active is health and the Standby is down.

Don’t forget you’ll need the Enable IP Forwarding checkbox checked on your BIG-IP’s interfaces in the Azure portal.

Conclusion

If all of the above makes sense, feel free to use Azure LB’s Floating IP checkbox on your Azure LB rules so that you can have the benefits of our last diagram. They are operational benefits only (fewer IP addresses used, potentially easier to understand for operators) but there is no functional or performance benefit to this method (no performance benefits or features/functionality enabled).

Thanks for reading, and please ask questions via comments or message me directly using this website.

Michael’s tech blog

Achieving BIG-IP High Availability with Azure Route Server

HA using Azure Route Server

Concepts: Active/Standby vs Active/Active

Route Health Injection and Active/Standby mode

BGP configurations and Active/Active mode

How-to: HA with BIG-IP using Azure Route Server

1. Active/Standby: Advertising a VIP range from the Active device only

2. Active/Active: using BGP to advertise routes from both devices

Other Considerations

Conclusion

Related Articles

Migrate Bitnami Wordpress between servers, Part 3

AWS Lightsail Bitnami Wordpress and sending mail

A couple more things

Migrate Bitnami Wordpress between servers, Part 2

Migrating to a new server

Fixing the site name

Database updates

Config file updates

Redirect rules

Other fixes

Conclusion

Migrate Bitnami Wordpress between servers

Migrating

Background

Setting up for migration

Username, keys and passwords

Setting up for migration

Migration and issues

Troubleshooting wp-admin login issues

Disable plugins

Editing wp-config.php

Troubleshooting SSL issues

Network Admin dashboard

Tips

Quickly test UDP on XC with NTP

NTP Server

NTP Client

Test from client to server

F5 XC as UDP proxy and load balancer

Notes

Windows

Deploying OpenShift with metal nodes

Summary

Deploying OpenShift

KubeVirt and OpenShift Virtualization

How to PoC OpenShift Virtualization in AWS

Accessing the VM

F5 CFE, private endpoints, and custom DNS

Summary

F5 CFE

Customer problem statement

CFE in isolated environments

Azure, Custom DNS, and CFE in isolated environments

Working with custom DNS settings and private endpoints

Conclusion

Let’s Encrypt cert automation for Bitnami Wordpress

Summary

How to update SSL certs for Bitnami Wordpres

Wordpress Admin

Documentation

Screenshots

FTP server on K8s with F5

Background

Advantages of FTP

Disadvantages of FTP

FTPS and SFTP

Common FTP servers

Why run FTP on Kubernetes?

How to run your FTP server on Kubernetes and still get enterprise-level protection

Service type of ClusterIP, NodePort, or LoadBalancer?

ClusterIP

NodePort

LoadBalancer

Let’s deploy our cloud environment

Testing our FTP application

OpenShift vs other Kubernetes distributions

Conclusion

Transparent load balancing in Azure, Part 2