What to do if your podman leaks an IP address

A1kmm@lemmy.amxl.com · edit-2 2 months ago

What to do if your podman leaks an IP address

Justin@lemmy.jlh.name · edit-2 2 months ago

Good debugging!

I’m thinking that it’s best for production to use dynamic IP addresses, to avoid this kind of conflict. In the kubernetes space, all containers must have dynamic IP addresses, which are then tracked by a ebpf load balancer with a (somewhat) static IP.

poVoq@slrpnk.net · 2 months ago

Yeah, inside of Pods you can just use the container name and thus avoid hard-coding any IPs.

Justin@lemmy.jlh.name · edit-2 2 months ago

All containers in a pod share an IP, so you can just use localhost: https://www.baeldung.com/ops/kubernetes-pods-sidecar-containers

Between pods, the universal pattern is to add a Service for your pod(s), and just use the name of the service to connect to the pods the Service is tracking. Internally, the Service is a load-balancer, running on top of Kube-Proxy, or Cilium eBPF, and it tracks all the pods that match the correct labels. It also takes advantage of the Kubelet’s health checks to connect/disconnect dying pods. Kubedns/coredns resolves DNS names for all of the Services in the cluster, so you never have to use raw IP addresses in Kubernetes.

poVoq@slrpnk.net · 2 months ago

I was talking about Podman Pods. Sorry for not being clear.

Justin@lemmy.jlh.name · 2 months ago

ah ok

deafboy@lemmy.world · 2 months ago

Every time I feel like I finally get kubernetes, somebody surprises me by talking about very specific, complex modern technologies being used to do basically the same thing we’ve been doing for decades by simple tools.

And I always experience the same urge to re-grow a tail and climb a tree.

Justin@lemmy.jlh.name · edit-2 2 months ago

Oh definitely, everything in kubernetes can be explained (and implemented) with decades-old technology.

The reason why Kubernetes is so special is that it automates it all in a very standardized way. All the vendors come together and support a single API for management which is very easy to write automation for.

There’s standard, well-documented “wizards” for creating databases, load-balancers, firewalls, WAFs, reverse proxies, etc. And the management for your containers is extremely robust and extensive with features like automated replicas, health checks, self-healing, 10 different kinds of storage drivers, cpu/memory/disk/gpu allocation, and declarative mountable config files. All of that on top of an extremely secure and standardized API.

With regard for eBPF being used for load-balancers, the company who writes that software, Isovalent, is one of the main maintainers of eBPF in the kernel. A lot of it was written just to support their Kubernetes Cilium CNI. It’s used, mainly, so that you can have systems with hundreds or thousands of containers on a single node, each with their own IP address and firewall, etc. IPtables was used for this before. But it started hitting a performance bottleneck for many systems. Everything is automated for you and open-source, so all the ops engineers benefit from the development work of the Isovalent team.

It definitely moves fast, though. I go to kubecon every year, and every year there’s a whole new set of technologies that were written in the last year lol

Appoxo@lemmy.dbzer0.com · 2 months ago

At least when doing something non-important like lemmy.
For essential services or a non-redundant enviromment (e.g. LDAP/DNS and homelab/small businesses) I would still assign a second permanent network with something like macvlans.

Justin@lemmy.jlh.name · edit-2 2 months ago

Go all out and register your container IPs on your router with BGP 😁

https://codeberg.org/jlh/h5b/src/branch/main/argo/custom_applications/kube-system/cilium-bgp-policy.yaml

(This comment was sent over a route my automation created with BGP)

horse_battery_staple@lemmy.world · edit-2 2 months ago

Tangentially what’s your opinion on Traefik?

https://traefik.io/

JASN_DE@lemmy.world · 2 months ago

Works well for me, although with Docker. The labeling can be a bit non-intuitive at first, but it’s really solid.

Also, I simply love autodiscovery.

Justin@lemmy.jlh.name · 2 months ago

I’ve inherited it on production systems before, automated service discovery and certificate renewal is definitely what admins should have in 2025. I thought the label/annotation system it used on Docker had some ergonomics/documentation issues, but nothing serious.

It feels like it’s more meant for Docker/Podman though. On Kubernetes I use cert-manager and Gateway API+Project Contour. It does seem like Traefik has support for Gateway API too, so it’s probably a good choice for Kubernetes too?

horse_battery_staple@lemmy.world · edit-2 2 months ago

We’re thinking of moving to it from a custom coredns and flannel inplementation in a k3s 33 node cluster.

Justin@lemmy.jlh.name · 2 months ago

Ah, interesting. What kind of customization are you using CoreDNS for? If you don’t have Ingress/Gateway API for your HTTP traffic, Traefik is likely a good option for adopting it.

horse_battery_staple@lemmy.world · edit-2 2 months ago

Coredns and an nginx reverse proxy are handling DNS, failover, and some other redirect. However it’s not ideal as it’s a custom implementation a previous engineer setup.

Justin@lemmy.jlh.name · edit-2 2 months ago

Ah, but your dns discovery and fail over isn’t using the built-in kubernetes Services? Is the nginx using Ingress-nginx or is it custom?

I would definitely look into Ingress or api-gateway, as these are two standards that the kubernetes developers are promoting for reverse proxies. Ingress is older and has more features for things like authentication, but API Gateway is more portable. Both APIs are implemented by a number of implementations, like Nginx, Traefik, Istio, and Project Contour.

It may also be worth creating a second Kubernetes cluster if you’re going to be migrating all the services. Flannel is quite old, and there are newer CNIs like Cilium that offer a lot more features like ebpf, ipv6, Wireguard, tracing, etc. (Cilium’s implementation of the Gateway API is bugger than other implementations though) Cillium is shaping up to be the new standard networking plugin for Kubernetes, and even Red Hat and AWS are starting to adopt it over their proprietary CNIs.

If you guys are in Europe and are looking for consultants, I freelance, and my employer also has a lot of Kubernetes consulting expertise.

horse_battery_staple@lemmy.world · 2 months ago

It’s a custom nginx proxy to the kube api. Too long to get into it. I was hired to move this giant cluster that started as a lab and make it production ready.

Thanks for the feedback

MangoPenguin@lemmy.blahaj.zone · 2 months ago

Neat bit of work finding the issue.

What’s the advantage of static container IPs? I’ve never thought about that in all my time using docker.

atzanteol · 2 months ago

Had you tried ‘podman rm -f containerid’?

A1kmm@lemmy.amxl.com · 2 months ago

I believe nothing in the podman rm family worked because the container was already gone - it was just the IP allocation that was left.

Possibly linux@lemmy.zip · 2 months ago

Podman really isn’t well suited for production workloads. It is nice for simple things but it has a habit of blowing up.

Ideally you should have some sort of cluster with health checking.

kylian0087@lemmy.dbzer0.com · 2 months ago

Kubernetes comes to mind for that

Possibly linux@lemmy.zip · edit-2 2 months ago

Kubernetes is a mixed bag. It is extremely powerful but its complexity tends to scare people away. The biggest issue with Kubernetes is that it can become the source of failure when done incorrectly.

I don’t really have a good alternative. I have investigated pacemaker but it has its own challenges.

For now it is probably best to just setup shared storage and then manually start containers on a host. The idea is that having multiple hosts allows for faster recovery. You still can have health checking per host so that containers get restarted as needed.

kylian0087@lemmy.dbzer0.com · edit-2 2 months ago

To simply running Kubernetes at least at home. Take a look at Talos OS. It is build for Kubernetes. But I totally agree it still isn’t for the faint of hard.

FrederikNJS@lemm.ee · 2 months ago

Agreed, Talos or k3s are great for home clusters