mirror/dsa-nagios.git
4 years agostart new release
Peter Palfrader [Mon, 20 May 2019 10:52:10 +0000 (12:52 +0200)]
start new release

4 years agorelease
Peter Palfrader [Mon, 20 May 2019 10:50:57 +0000 (12:50 +0200)]
release

4 years agoalso filter by cpu flags
Peter Palfrader [Mon, 20 May 2019 10:07:39 +0000 (12:07 +0200)]
also filter by cpu flags

4 years agoadd dsa-check-ucode-intel
Peter Palfrader [Mon, 20 May 2019 10:00:45 +0000 (12:00 +0200)]
add dsa-check-ucode-intel

4 years agodinis on buster
Peter Palfrader [Tue, 14 May 2019 15:28:11 +0000 (17:28 +0200)]
dinis on buster

4 years agogeo1 on buster
Peter Palfrader [Tue, 14 May 2019 14:31:39 +0000 (16:31 +0200)]
geo1 on buster

4 years agoarnold on stretch
Aurelien Jarno [Sat, 11 May 2019 08:50:17 +0000 (10:50 +0200)]
arnold on stretch

5 years agoswitch gw-ynic ip
Julien Cristau [Fri, 10 May 2019 08:37:17 +0000 (10:37 +0200)]
switch gw-ynic ip

The old one seems to have gone away.  Picking janetrouter2.york.ac.uk
which seems to be on the path and responds to pings.

5 years agodsa-check-timedatectl: buster adaptions
Peter Palfrader [Wed, 8 May 2019 09:06:44 +0000 (11:06 +0200)]
dsa-check-timedatectl: buster adaptions

5 years agoadd 209.87.16.45 (godard secondary ip)
Julien Cristau [Mon, 29 Apr 2019 07:38:42 +0000 (09:38 +0200)]
add 209.87.16.45 (godard secondary ip)

5 years agoReplace ganeti3 with ganeti-manda
Julien Cristau [Sun, 28 Apr 2019 12:20:34 +0000 (14:20 +0200)]
Replace ganeti3 with ganeti-manda

ganeti3 was the clementi/czerny cluster, and is gone

5 years agohartmann on buster
Aurelien Jarno [Tue, 16 Apr 2019 21:31:05 +0000 (23:31 +0200)]
hartmann on buster

5 years agobacula director is now run with -fP
Peter Palfrader [Fri, 12 Apr 2019 13:42:26 +0000 (15:42 +0200)]
bacula director is now run with -fP

5 years agoarm-arm-04 on buster
Aurelien Jarno [Mon, 8 Apr 2019 19:44:57 +0000 (21:44 +0200)]
arm-arm-04 on buster

5 years agolindsay on buster
Julien Cristau [Mon, 8 Apr 2019 16:16:40 +0000 (18:16 +0200)]
lindsay on buster

5 years agomipsel-aql-01 on buster
Aurelien Jarno [Sun, 7 Apr 2019 21:03:07 +0000 (23:03 +0200)]
mipsel-aql-01 on buster

5 years agozani on buster
Aurelien Jarno [Fri, 5 Apr 2019 21:02:33 +0000 (23:02 +0200)]
zani on buster

5 years agomips-sil-01 on buster
Aurelien Jarno [Fri, 5 Apr 2019 20:36:41 +0000 (22:36 +0200)]
mips-sil-01 on buster

5 years agoAdd manda VMs to systemd-timesyncd group
Aurelien Jarno [Wed, 3 Apr 2019 09:29:06 +0000 (11:29 +0200)]
Add manda VMs to systemd-timesyncd group

5 years agoppc64el-osuosl-01 on buster
Aurelien Jarno [Tue, 2 Apr 2019 20:19:25 +0000 (22:19 +0200)]
ppc64el-osuosl-01 on buster

5 years agoDecomission mirror-conova
Aurelien Jarno [Mon, 1 Apr 2019 09:11:59 +0000 (11:11 +0200)]
Decomission mirror-conova

5 years agoStart 118
Aurelien Jarno [Mon, 1 Apr 2019 08:09:31 +0000 (10:09 +0200)]
Start 118

5 years agoRelease 116
Aurelien Jarno [Mon, 1 Apr 2019 07:59:33 +0000 (09:59 +0200)]
Release 116

5 years agoAdd missing changelog entry
Aurelien Jarno [Mon, 1 Apr 2019 07:59:03 +0000 (09:59 +0200)]
Add missing changelog entry

5 years agoMove the SSL CA check with the other SSL checks and rename it
Aurelien Jarno [Mon, 1 Apr 2019 05:06:03 +0000 (07:06 +0200)]
Move the SSL CA check with the other SSL checks and rename it

5 years agoFix typos in previous commits
Aurelien Jarno [Sun, 31 Mar 2019 21:38:47 +0000 (23:38 +0200)]
Fix typos in previous commits

5 years agox86-csail-01 on buster
Aurelien Jarno [Sun, 31 Mar 2019 21:37:01 +0000 (23:37 +0200)]
x86-csail-01 on buster

5 years agoAdd a buster hostgroup
Aurelien Jarno [Sun, 31 Mar 2019 20:15:28 +0000 (22:15 +0200)]
Add a buster hostgroup

For now assume that anything applying to stretch also applies to buster

5 years agoRevert "Add a check for puppet client cert expiration"
Aurelien Jarno [Sun, 31 Mar 2019 20:06:43 +0000 (22:06 +0200)]
Revert "Add a check for puppet client cert expiration"

This reverts commit e88536af180f92dee0a035de9fce7e3b6ecf2bb8.

This would require a sudoers update for all hosts, not sure we really
want that.

5 years agoAdd a check for puppet client cert expiration
Aurelien Jarno [Sun, 31 Mar 2019 19:04:28 +0000 (21:04 +0200)]
Add a check for puppet client cert expiration

It has been noticed while regenerating the puppet CA certificate that a
few puppet client certificate were also about to expire. We didn't have
any check in nagios for that, but thanks to Heartbleed this has not been
an issue.

5 years agoCheck for Debian SMTP CA cert expiration
Aurelien Jarno [Sun, 31 Mar 2019 18:50:16 +0000 (20:50 +0200)]
Check for Debian SMTP CA cert expiration

Note that despite its name this CA is also used for at least SSL tunnels,
and PostgreSQL.

5 years agotrabaci has /srv
Aurelien Jarno [Sat, 23 Mar 2019 13:07:16 +0000 (14:07 +0100)]
trabaci has /srv

5 years agoAdd trabaci
Aurelien Jarno [Sat, 23 Mar 2019 12:53:40 +0000 (13:53 +0100)]
Add trabaci

5 years agoRemove drbd-hosts from czerny & clementi
Aurelien Jarno [Mon, 18 Mar 2019 09:30:32 +0000 (10:30 +0100)]
Remove drbd-hosts from czerny & clementi

They are being decomissionned and do not run VMs anymore.

5 years agoschmelzer doesn't do rsync on its main ip address
Julien Cristau [Sun, 17 Mar 2019 17:00:52 +0000 (18:00 +0100)]
schmelzer doesn't do rsync on its main ip address

5 years agomove mirror-conova's secondary IPs to schmelzer
Julien Cristau [Sun, 17 Mar 2019 16:27:46 +0000 (17:27 +0100)]
move mirror-conova's secondary IPs to schmelzer

5 years agoDecommission lully.d.o
Aurelien Jarno [Sun, 17 Mar 2019 12:45:13 +0000 (13:45 +0100)]
Decommission lully.d.o

Replaced by loghost-osuosl-01

5 years agodsa-check-packages: fix typo noticed by jwilk
Julien Cristau [Fri, 15 Mar 2019 22:06:46 +0000 (23:06 +0100)]
dsa-check-packages: fix typo noticed by jwilk

5 years agoCorrectly do the moszumanska removal from 838803e155d783697b4559012bde63eff478d8ed
Peter Palfrader [Sun, 10 Mar 2019 16:58:43 +0000 (17:58 +0100)]
Correctly do the moszumanska removal from 838803e155d783697b4559012bde63eff478d8ed

5 years agodsa-check-backuppg: Ignore lost+found directory
Peter Palfrader [Sun, 10 Mar 2019 16:57:04 +0000 (17:57 +0100)]
dsa-check-backuppg: Ignore lost+found directory

5 years agoschmelzer has apache and rsyncd running
Martin Zobel-Helas [Sun, 10 Mar 2019 11:52:13 +0000 (12:52 +0100)]
schmelzer has apache and rsyncd running

5 years agoAdd incomingmailrelayed2025 to smit
Aurelien Jarno [Fri, 8 Mar 2019 18:08:52 +0000 (19:08 +0100)]
Add incomingmailrelayed2025 to smit

5 years agoFix previous commit
Aurelien Jarno [Thu, 7 Mar 2019 21:09:30 +0000 (22:09 +0100)]
Fix previous commit

5 years agoAdd smit
Aurelien Jarno [Thu, 7 Mar 2019 20:54:51 +0000 (21:54 +0100)]
Add smit

5 years agoAdd schmelzer
Julien Cristau [Wed, 20 Feb 2019 16:01:51 +0000 (17:01 +0100)]
Add schmelzer

5 years agoAdapt timedatectl check for buster
Moritz Muehlenhoff [Wed, 13 Feb 2019 16:14:29 +0000 (17:14 +0100)]
Adapt timedatectl check for buster

In the systemd version in buster, the output format of timedatectl changed:
- "Network time on: yes" became "NTP service: active"
- "NTP synchronized: yes" became "System clock synchronized: yes"

Signed-off-by: Peter Palfrader <peter@palfrader.org>
5 years agoDecommission kantuser (RT#7583)
Julien Cristau [Sun, 17 Feb 2019 18:55:28 +0000 (19:55 +0100)]
Decommission kantuser (RT#7583)

5 years agoMove ppc64el-osuosl-01 to pijper
Aurelien Jarno [Sun, 3 Feb 2019 16:40:34 +0000 (17:40 +0100)]
Move ppc64el-osuosl-01 to pijper

5 years agoadd check for logs on loghost-osuosl-01
Julien Cristau [Mon, 28 Jan 2019 22:49:29 +0000 (23:49 +0100)]
add check for logs on loghost-osuosl-01

5 years agoadd loghost-osuosl-01
Julien Cristau [Mon, 28 Jan 2019 21:33:55 +0000 (22:33 +0100)]
add loghost-osuosl-01

5 years agoIgnore "Cache Battery 0 in controller 0 is Degraded" on wieck
Julien Cristau [Sun, 27 Jan 2019 10:04:09 +0000 (11:04 +0100)]
Ignore "Cache Battery 0 in controller 0 is Degraded" on wieck

5 years agoAdd checks for {www,wiki}.debconf.org
Julien Cristau [Thu, 17 Jan 2019 19:22:13 +0000 (20:22 +0100)]
Add checks for {www,wiki}.debconf.org

5 years agodsa-check-running-kernel: handle -unsigned packages
Peter Palfrader [Thu, 17 Jan 2019 11:55:02 +0000 (12:55 +0100)]
dsa-check-running-kernel: handle -unsigned packages

5 years agoIgnore cache battery warning on schumann
Julien Cristau [Thu, 10 Jan 2019 21:10:40 +0000 (22:10 +0100)]
Ignore cache battery warning on schumann

5 years agoRT#7513 Remove moszumanska
Tollef Fog Heen [Mon, 7 Jan 2019 20:53:17 +0000 (21:53 +0100)]
RT#7513 Remove moszumanska

5 years agodsa-check-hpssacli: ignore text after "active spare" from pd status
Julien Cristau [Sat, 15 Dec 2018 10:00:06 +0000 (11:00 +0100)]
dsa-check-hpssacli: ignore text after "active spare" from pd status

Rather than getting confused by

      physicaldrive 1I:1:4 (port 1I:box 1:bay 4, SAS, 146 GB, OK, active spare for 1I:1:1)

treat it the same as "active spare".

5 years agoCleanup dsa-update-unowned-file-status and dsa-update-unowned-file-status creation...
Peter Palfrader [Mon, 26 Nov 2018 13:25:08 +0000 (14:25 +0100)]
Cleanup dsa-update-unowned-file-status and dsa-update-unowned-file-status creation of statusdir

5 years agochangeloge entry
Peter Palfrader [Mon, 26 Nov 2018 13:23:29 +0000 (14:23 +0100)]
changeloge entry

5 years agoMerge remote-tracking branch 'waja/update-apt-statusdir'
Peter Palfrader [Mon, 26 Nov 2018 13:22:14 +0000 (14:22 +0100)]
Merge remote-tracking branch 'waja/update-apt-statusdir'

* waja/update-apt-statusdir:
  Create directory if not existing

5 years agoCreate directory if not existing
Jan Wagner [Mon, 26 Nov 2018 11:49:34 +0000 (12:49 +0100)]
Create directory if not existing

5 years agomanda-node0[34] have many processes
Julien Cristau [Thu, 22 Nov 2018 18:44:40 +0000 (19:44 +0100)]
manda-node0[34] have many processes

5 years agoAdd pijper
Julien Cristau [Mon, 19 Nov 2018 17:08:41 +0000 (18:08 +0100)]
Add pijper

5 years agoDraghi no longer has a /boot
Peter Palfrader [Sun, 18 Nov 2018 17:06:17 +0000 (18:06 +0100)]
Draghi no longer has a /boot

5 years agomanda-node0[34] run drbd
Peter Palfrader [Sun, 18 Nov 2018 12:42:23 +0000 (13:42 +0100)]
manda-node0[34] run drbd

5 years agoBlacklist openmanage battery probe on wieck and schumann
Julien Cristau [Tue, 13 Nov 2018 14:26:39 +0000 (15:26 +0100)]
Blacklist openmanage battery probe on wieck and schumann

5 years agoAdd hostgroup for new dell hosts
Julien Cristau [Wed, 7 Nov 2018 22:05:11 +0000 (23:05 +0100)]
Add hostgroup for new dell hosts

5 years agocheck if we can reach the peer on the backend network at new-manda
Peter Palfrader [Wed, 7 Nov 2018 17:16:02 +0000 (18:16 +0100)]
check if we can reach the peer on the backend network at new-manda

5 years agoadd manda-node03
Julien Cristau [Tue, 6 Nov 2018 22:26:36 +0000 (23:26 +0100)]
add manda-node03

5 years agoadd manda-node04
Julien Cristau [Tue, 6 Nov 2018 21:25:30 +0000 (22:25 +0100)]
add manda-node04

5 years agosibelius no longer runs postgresql
Julien Cristau [Thu, 1 Nov 2018 17:55:06 +0000 (18:55 +0100)]
sibelius no longer runs postgresql

5 years agodsa-check-zone-rrsig-expiration-many: fix use of uninitialized value in numeric gt (>)
Peter Palfrader [Thu, 25 Oct 2018 07:48:16 +0000 (09:48 +0200)]
dsa-check-zone-rrsig-expiration-many: fix use of uninitialized value in numeric gt (>)

We have a state count array, and we assign each state (ok, warn, etc.) a
nagios error code.  one of the states we use internally is "unsigned",
which is not an error but did not have an integer exit code.  Give it 0
now.

5 years agobendel ("heavy-postfix") also runs fail2ban
Peter Palfrader [Fri, 12 Oct 2018 09:13:11 +0000 (11:13 +0200)]
bendel ("heavy-postfix") also runs fail2ban

5 years agocheck if fail2ban is running where it should
Peter Palfrader [Wed, 10 Oct 2018 12:18:22 +0000 (14:18 +0200)]
check if fail2ban is running where it should

5 years agohandel will have an /srv soon
Peter Palfrader [Tue, 9 Oct 2018 17:38:57 +0000 (19:38 +0200)]
handel will have an /srv soon

5 years agoCheck if all unbound trust anchors are current
Peter Palfrader [Tue, 9 Oct 2018 07:45:58 +0000 (09:45 +0200)]
Check if all unbound trust anchors are current

5 years agoretire alioth hostgroup
Peter Palfrader [Tue, 9 Oct 2018 07:45:16 +0000 (09:45 +0200)]
retire alioth hostgroup

5 years agoAdd dsa-check-unbound-anchors
Peter Palfrader [Tue, 9 Oct 2018 07:42:17 +0000 (09:42 +0200)]
Add dsa-check-unbound-anchors

5 years agoconova-node*: ping our drbd/ganeti peer on the mgmt network
Peter Palfrader [Tue, 7 Aug 2018 07:14:21 +0000 (09:14 +0200)]
conova-node*: ping our drbd/ganeti peer on the mgmt network

5 years agomonitor drbd at conova
Peter Palfrader [Tue, 7 Aug 2018 06:49:13 +0000 (08:49 +0200)]
monitor drbd at conova

5 years agoDecommission powerpc-osuosl-01
Julien Cristau [Mon, 6 Aug 2018 16:29:30 +0000 (18:29 +0200)]
Decommission powerpc-osuosl-01

5 years agoRemove powerpc-unicamp-01
Julien Cristau [Mon, 6 Aug 2018 15:51:04 +0000 (17:51 +0200)]
Remove powerpc-unicamp-01

5 years agoretire hostgroup sparc
Peter Palfrader [Tue, 17 Jul 2018 12:53:53 +0000 (14:53 +0200)]
retire hostgroup sparc

5 years agoretire hostgroup wheezy
Peter Palfrader [Tue, 17 Jul 2018 12:51:22 +0000 (14:51 +0200)]
retire hostgroup wheezy

5 years agoretire hostgroup wheezy
Peter Palfrader [Tue, 17 Jul 2018 12:47:30 +0000 (14:47 +0200)]
retire hostgroup wheezy

5 years agoretire smetana
Peter Palfrader [Mon, 16 Jul 2018 12:18:18 +0000 (14:18 +0200)]
retire smetana

5 years agosw-raid on arm-arm-0[134]
Julien Cristau [Mon, 2 Jul 2018 18:15:20 +0000 (20:15 +0200)]
sw-raid on arm-arm-0[134]

5 years agounicamp renumbering
Julien Cristau [Fri, 29 Jun 2018 14:08:58 +0000 (16:08 +0200)]
unicamp renumbering

5 years agoremove parth, re: RT#7334
Peter Palfrader [Sun, 24 Jun 2018 21:21:58 +0000 (23:21 +0200)]
remove parth, re: RT#7334

5 years agodf -h checks on nfs client at lw
Peter Palfrader [Fri, 1 Jun 2018 16:51:29 +0000 (18:51 +0200)]
df -h checks on nfs client at lw

5 years agoremove most of the monitoring for moszumanska
Peter Palfrader [Thu, 31 May 2018 13:27:54 +0000 (15:27 +0200)]
remove most of the monitoring for moszumanska

5 years agonot this varnish process job on jessie
Peter Palfrader [Wed, 30 May 2018 12:21:29 +0000 (14:21 +0200)]
not this varnish process job on jessie

5 years agoboth pkgmirror-csail and sibelius run varnish
Peter Palfrader [Wed, 30 May 2018 09:18:14 +0000 (11:18 +0200)]
both pkgmirror-csail and sibelius run varnish

5 years agomonitor varnish, haproxy
Peter Palfrader [Wed, 30 May 2018 08:35:26 +0000 (10:35 +0200)]
monitor varnish, haproxy

5 years agomove lw0[78] to stretch
Peter Palfrader [Mon, 28 May 2018 22:03:56 +0000 (00:03 +0200)]
move lw0[78] to stretch

5 years agomove lw0[1234] to stretch
Peter Palfrader [Mon, 28 May 2018 20:26:11 +0000 (22:26 +0200)]
move lw0[1234] to stretch

6 years agokantuser has apache
Julien Cristau [Thu, 10 May 2018 14:02:43 +0000 (16:02 +0200)]
kantuser has apache

6 years agosallinen now runs apache
Julien Cristau [Mon, 30 Apr 2018 08:42:21 +0000 (10:42 +0200)]
sallinen now runs apache

6 years agoAdd kantuser
Julien Cristau [Tue, 24 Apr 2018 20:58:21 +0000 (23:58 +0300)]
Add kantuser

6 years agoadd grabbe
Peter Palfrader [Tue, 24 Apr 2018 20:50:10 +0000 (22:50 +0200)]
add grabbe

6 years agoRetire check for SSL certs living in puppet, they're all gone
Julien Cristau [Fri, 13 Apr 2018 11:30:51 +0000 (13:30 +0200)]
Retire check for SSL certs living in puppet, they're all gone