Commit graph

35 commits

Author SHA1 Message Date
6b4285a264
Let alertmanager run as its own user
It's already not-root, and can't access the filesystem anyway
2023-11-01 22:13:37 +00:00
54e2205e48
Don't bother renaming speedtest metrics
Some checks failed
/ terraform (push) Successful in 32s
/ ansible (push) Failing after 1m20s
2023-10-23 22:09:25 +01:00
c29dfb5ad2
Add hostname label for blackbox
Some checks failed
/ terraform (push) Successful in 37s
/ ansible (push) Failing after 1m22s
2023-10-23 21:06:43 +01:00
e56ffa576f
Deploy vikunja
All checks were successful
/ terraform (push) Successful in 1m2s
/ ansible (push) Successful in 2m32s
2023-09-07 20:18:32 +01:00
b07d424d87
Move remaining DBs to SSD
All checks were successful
/ ansible (push) Successful in 1m43s
/ terraform (push) Successful in 42s
2023-06-15 21:18:50 +01:00
2af9f8529d
Fix new ansible-lint errors
All checks were successful
/ terraform (push) Successful in 46s
/ ansible (push) Successful in 1m53s
Quite a few changes here, hopefully they work!
2023-06-15 15:16:19 +01:00
f14b16ec26
Monitor YOURLS 2023-04-23 20:24:55 +01:00
fcf0a373c3
Monitor gitea
Some checks failed
terraform
ansible
2023-04-06 20:31:57 +01:00
d52f0bc8b2
Remove gitlab configuration 2023-03-30 22:56:35 +01:00
2f3958b3e6
Reduce speedtest interval
I don't need it to run that often
2023-02-28 08:40:26 +00:00
11eb833f44
Add some prometheus alerts 2023-02-19 18:31:52 +00:00
f2f9e194c5
Install node-exporter on pi-hole 2023-01-31 23:10:31 +00:00
e4fba8208e
Add speedtest exporter 2023-01-31 21:45:59 +00:00
051b5b0e19
Add ICMP checks 2023-01-31 14:29:27 +00:00
ddc91059b5
Replace telegraf with prometheus exporters
Still missing SMART and ping, but those can come later.

Swapped as the polling model for prometheus doesn't play especially well with telegraf, and leads to gaps in data
2023-01-31 09:11:09 +00:00
e86ed81102
Decommission decker server
It's been replaced by prometheus running internally, and uptimerobot checking.
2022-12-09 19:04:54 +00:00
f1c9e10d9a
Use IPv4 on blackbox
The docker networks they're running on are v4 only at the moment, so the DNS resolution passes by the connection fails.
2022-12-06 20:46:43 +00:00
bb4361e25f
Use a blackbox with custom DNS rather than proxy
This makes timing information more relevant and should improve reliability
2022-12-06 10:04:23 +00:00
772cf7536a
Run prometheus stack as docker user 2022-12-05 21:17:23 +00:00
53c969f2b8
Make prometheus config files read only 2022-12-05 21:12:55 +00:00
3e5b00d176
Add alertmanager 2022-12-05 21:10:41 +00:00
60514d191a
Exclude python stats for HA 2022-12-05 20:36:57 +00:00
ed7faf4678
Scrape healthchecks.io with prometheus and use it for outbound monitoring of prometheus scrapes 2022-12-05 14:41:48 +00:00
8e3b381b8d
Remove unused prometheus network 2022-12-04 22:12:55 +00:00
75862dbaf9
Setup blackbox for some HTTP monitoring 2022-12-04 22:11:49 +00:00
c7d5056f2a
Scrape homeassistant metrics into prometheus 2022-09-06 21:29:11 +01:00
8b21b9e6f2
Monitor decker Traefik with prometheus 2022-08-14 15:04:55 +01:00
edc5c325b7
Correctly check hostname against PVE hosts
Some of the hostnames have `-` in instead, which caused issues with the SSH config detecting which users to allow
2021-08-23 19:56:04 +01:00
d59e86a8e8
Remove unnecessary private_ip var from forrest
It was redundant
2021-06-03 20:47:33 +01:00
f1f2c620b0
Replace DHCP on PVE for static IPs
So much easier to deal with!
2021-06-03 20:47:08 +01:00
407e59ec5a
Use variables in prometheus config
Also replace grimes with walker
2021-05-05 18:12:42 +01:00
1c424cb2ef
Update some IP addresses
I really need to stop using these external addresses somewhen...
2021-04-07 22:11:24 +01:00
1d997d3c33
Remove separate private and protected IP 2021-03-27 18:42:06 +00:00
6b95b75fc2
Move telegraf to host
This makes metric collection for SMART much simpler. I'll still be using the prometheus node exporter for actual system metrics, though.
2021-03-05 20:39:11 +00:00
89dbbc71e5
Move files into application directories 2021-03-05 14:40:17 +00:00