Commit graph

59 commits

Author SHA1 Message Date
f4b96afcfa
Deploy ntfy
All checks were successful
/ terraform (push) Successful in 1m15s
/ ansible (push) Successful in 2m22s
2023-12-23 16:40:53 +00:00
b33e19e152
Remove unnecessary extra variable definitions
The world could do with a bit less YAML!
2023-12-14 22:03:23 +00:00
e4b2318c82
Monitor authentik
All checks were successful
/ terraform (push) Successful in 37s
/ ansible (push) Successful in 1m42s
2023-11-12 21:25:02 +00:00
dfef31cbfa
Deploy minio
My own S3, for various things
2023-11-12 21:23:54 +00:00
b0347fc037
Remove redundant quotes 2023-11-05 21:43:02 +00:00
b1284877a3
Update blackbox configuration for not following redirects
Some checks failed
/ terraform (push) Successful in 30s
/ ansible (push) Failing after 1m23s
2023-11-01 22:14:35 +00:00
6b4285a264
Let alertmanager run as its own user
It's already not-root, and can't access the filesystem anyway
2023-11-01 22:13:37 +00:00
54e2205e48
Don't bother renaming speedtest metrics
Some checks failed
/ terraform (push) Successful in 32s
/ ansible (push) Failing after 1m20s
2023-10-23 22:09:25 +01:00
c29dfb5ad2
Add hostname label for blackbox
Some checks failed
/ terraform (push) Successful in 37s
/ ansible (push) Failing after 1m22s
2023-10-23 21:06:43 +01:00
e56ffa576f
Deploy vikunja
All checks were successful
/ terraform (push) Successful in 1m2s
/ ansible (push) Successful in 2m32s
2023-09-07 20:18:32 +01:00
b07d424d87
Move remaining DBs to SSD
All checks were successful
/ ansible (push) Successful in 1m43s
/ terraform (push) Successful in 42s
2023-06-15 21:18:50 +01:00
2af9f8529d
Fix new ansible-lint errors
All checks were successful
/ terraform (push) Successful in 46s
/ ansible (push) Successful in 1m53s
Quite a few changes here, hopefully they work!
2023-06-15 15:16:19 +01:00
f14b16ec26
Monitor YOURLS 2023-04-23 20:24:55 +01:00
fcf0a373c3
Monitor gitea
Some checks failed
terraform
ansible
2023-04-06 20:31:57 +01:00
d52f0bc8b2
Remove gitlab configuration 2023-03-30 22:56:35 +01:00
2f3958b3e6
Reduce speedtest interval
I don't need it to run that often
2023-02-28 08:40:26 +00:00
11eb833f44
Add some prometheus alerts 2023-02-19 18:31:52 +00:00
f2f9e194c5
Install node-exporter on pi-hole 2023-01-31 23:10:31 +00:00
e4fba8208e
Add speedtest exporter 2023-01-31 21:45:59 +00:00
051b5b0e19
Add ICMP checks 2023-01-31 14:29:27 +00:00
ddc91059b5
Replace telegraf with prometheus exporters
Still missing SMART and ping, but those can come later.

Swapped as the polling model for prometheus doesn't play especially well with telegraf, and leads to gaps in data
2023-01-31 09:11:09 +00:00
e86ed81102
Decommission decker server
It's been replaced by prometheus running internally, and uptimerobot checking.
2022-12-09 19:04:54 +00:00
f1c9e10d9a
Use IPv4 on blackbox
The docker networks they're running on are v4 only at the moment, so the DNS resolution passes by the connection fails.
2022-12-06 20:46:43 +00:00
bb4361e25f
Use a blackbox with custom DNS rather than proxy
This makes timing information more relevant and should improve reliability
2022-12-06 10:04:23 +00:00
772cf7536a
Run prometheus stack as docker user 2022-12-05 21:17:23 +00:00
53c969f2b8
Make prometheus config files read only 2022-12-05 21:12:55 +00:00
3e5b00d176
Add alertmanager 2022-12-05 21:10:41 +00:00
60514d191a
Exclude python stats for HA 2022-12-05 20:36:57 +00:00
ed7faf4678
Scrape healthchecks.io with prometheus and use it for outbound monitoring of prometheus scrapes 2022-12-05 14:41:48 +00:00
8e3b381b8d
Remove unused prometheus network 2022-12-04 22:12:55 +00:00
75862dbaf9
Setup blackbox for some HTTP monitoring 2022-12-04 22:11:49 +00:00
ebfe057658
Add support for public dashboards in Grafana 2022-12-01 13:48:36 +00:00
4f0ee78b7e
Update grafana to use mailgun 2022-09-18 15:27:58 +01:00
c7d5056f2a
Scrape homeassistant metrics into prometheus 2022-09-06 21:29:11 +01:00
8b21b9e6f2
Monitor decker Traefik with prometheus 2022-08-14 15:04:55 +01:00
c54ce7b209
For migrate grafana plugins 2022-06-13 21:52:40 +01:00
c5215e330b
Update yamllint to fix dependency issue
I think this still validates everything we need it to
2022-01-11 20:51:12 +00:00
da41fcd7bc
Update grafana DB to postgres 14 2021-12-22 13:10:06 +00:00
edc5c325b7
Correctly check hostname against PVE hosts
Some of the hostnames have `-` in instead, which caused issues with the SSH config detecting which users to allow
2021-08-23 19:56:04 +01:00
3c3f69a776
Remove unnecessary influxdb instance
Never used it anyway
2021-06-03 20:50:54 +01:00
d59e86a8e8
Remove unnecessary private_ip var from forrest
It was redundant
2021-06-03 20:47:33 +01:00
f1f2c620b0
Replace DHCP on PVE for static IPs
So much easier to deal with!
2021-06-03 20:47:08 +01:00
407e59ec5a
Use variables in prometheus config
Also replace grimes with walker
2021-05-05 18:12:42 +01:00
1c424cb2ef
Update some IP addresses
I really need to stop using these external addresses somewhen...
2021-04-07 22:11:24 +01:00
1d997d3c33
Remove separate private and protected IP 2021-03-27 18:42:06 +00:00
6b95b75fc2
Move telegraf to host
This makes metric collection for SMART much simpler. I'll still be using the prometheus node exporter for actual system metrics, though.
2021-03-05 20:39:11 +00:00
89dbbc71e5
Move files into application directories 2021-03-05 14:40:17 +00:00
8e977edba1
Ignore go metrics 2021-03-05 14:27:33 +00:00
b264e5cbcc
Monitor traefik with prometheus rather than influxdb 2021-03-04 16:37:53 +00:00
2e05ed08fa
Use hostname rather than fqdn 2021-03-04 16:06:43 +00:00