• tiredofsametab@fedia.io
    link
    fedilink
    arrow-up
    7
    ·
    2 months ago

    There are a few options off the top off my head

    1. the route/connection between the monitoring and you is OK; between the company and the monitoring is OK; but between you and the company is not OK – this means that, so far as the monitoring can tell, the site is up.
    2. The status checks run at some interval and you’re hitting it before that interval
    3. There’s some threshold of errors that needs to happen first so tiny hiccups don’t register as full-blown outages.
    4. the monitoring/metrics are poorly-designed

    There are probably other cases. I don’t know the architecture in this case, so I won’t speculate at any others.

    • AngryPancake
      link
      fedilink
      arrow-up
      3
      ·
      2 months ago

      Maybe it’s not automated and whoever is responsible isn’t awake yet.

      • tiredofsametab@fedia.io
        link
        fedilink
        arrow-up
        2
        ·
        2 months ago

        Yeah, I thought of that a few minutes later and was too lazy to edit. I also thought maybe it’s semi-auto and someone needs to verify it manually before allowing the UI to show