thecamels-zabbix/README.md

299 lines
8.4 KiB
Markdown
Raw Normal View History

2014-08-19 08:38:12 +02:00
Zabbix
2013-05-23 08:25:18 +02:00
======
2016-05-02 09:24:36 +02:00
Script and templates for Zabbix 2.2.x, 2.4.x, 3.0.x
2014-08-11 14:22:34 +02:00
2015-10-13 09:12:43 +02:00
- In bin you will find Bash/Perl/PHP scripts used by some User Parameters (need to be installed on agent)
- In sudoers.d you can find settings for sudo
2014-08-11 14:22:34 +02:00
- In Templates there are XML files ready to import using Zabbix GUI
2015-10-13 09:12:43 +02:00
- In zabbix_agentd.conf.d there are custom UserParameter (need to be installed on agent)
2013-05-23 08:25:18 +02:00
2015-10-13 09:12:43 +02:00
Templates was tested on Red Hat 5.x, 6.x and CentOS 5.x, 6.x. Common UserParameter were added to ```zabbix_agentd.conf.d/linux.conf``` file. Please add it to your own Zabbix Agent installation. Sometimes you need to use ```sudo``` for ```UserParameter```. All rules are in file ```sudoers.d/zabbix```.
Please let us know if you have any questions or concerns.
2014-08-11 14:22:34 +02:00
2014-08-19 08:38:12 +02:00
The Camels Team
http://thecamels.org
Template App APC
======
Monitoring memory usage of APC (http://pecl.php.net/package/APC) module. File ```bin/apc-stats.php``` need to be accessed via HTTP for example http://127.0.0.1/apc-stats.php
2014-08-19 08:38:12 +02:00
2015-08-12 09:14:50 +02:00
Template App Amavisd
======
Monitoring for Amavisd service.
Template App BIND
======
Monitoring for BIND (DNS server) service. Checking also version of BIND.
Template App Clamav
======
Monitoring for ClamAV.
Template App Dovecot
======
Monitoring for Dovecot.
Template App Etherpad
======
Monitoring for Etherpad (http://etherpad.org/)
Template App Exim
=====
Monitoring for mail server Exim.
2016-05-02 09:24:36 +02:00
Template App Rserve
=====
Monitoring for RServer in R.
2015-08-12 09:14:50 +02:00
Template App lm_sensors
=====
Monitoring CPU and MotherBoard temperatures by lm_senros module.
Template App mdadm
======
Monitoring mdadm arrays. Checking number of corrupted disk arrays.
Template App MySQL
=====
Monitoring for MySQL 5.5, 5.6 and 5.7. It is using PHP-cli for monitoring. You need also add zabbix user to database. Please run SQL query:
```
CREATE USER 'zabbix'@'localhost' IDENTIFIED BY 'ha7jqnlacwefrs';
GRANT REPLICATION CLIENT, SELECT, PROCESS, SHOW DATABASES ON *.* TO 'zabbix'@'localhost' IDENTIFIED BY 'ha7jqnlacwefrs';
FLUSH PRIVILEGES;
```
If you want change password, you need to edit files ```.my.cnf``` and also in file ```bin/ss_get_mysql_stats.php```.
2015-08-12 09:26:09 +02:00
Template App MySQL Slave
=====
Monitoring for replication in MySQL 5.5, 5.6 and 5.7. Please run SQL query:
```
CREATE USER 'zabbix'@'localhost' IDENTIFIED BY 'ha7jqnlacwefrs';
GRANT REPLICATION CLIENT, SELECT, PROCESS, SHOW DATABASES ON *.* TO 'zabbix'@'localhost' IDENTIFIED BY 'ha7jqnlacwefrs';
FLUSH PRIVILEGES;
```
If you want change password, you need to edit files ```.my.cnf```.
Template App Nginx
======
Monitoring for Nginx. It is using script ```nginx-check.sh``` written by Vincent Viallet.
2014-11-22 22:58:06 +01:00
Template App Brocade HBA
======
2014-11-22 23:41:38 +01:00
Monitoring for Network Adapters - Brocade. Template is using Discovery to create Items and Triggers. You need also add SUDO for zabbix user: ```zabbix ALL=(ALL) NOPASSWD: /usr/bin/bcu```
2014-11-22 22:58:06 +01:00
Template App OpenDKIM
======
Monitoring for OpenDKIM.
Template App Postfix
======
Monitoring for mail server Postfix.
Template App Pure-FTPd
======
Monitoring for Pure-FTPd.
2016-01-28 15:21:54 +01:00
Template App Redis
======
Monitoring for redis server 3.x. Support for redis cluster.
Template App Spamassassin
======
Monitoring for Spamassassin.
Template App vsftpd
======
Monitoring for vsftpd.
2014-11-22 22:58:06 +01:00
Template App Nscd
======
Nscd is a daemon that provides a cache for the most common name service requests. Nscd provides caching for accesses of the passwd(5), group(5), and hosts(5) databases through standard libc interfaces, such as getpwnam(3), getpwuid(3), getgrnam(3), getgrgid(3), gethostbyname(3), and others.
You need also add SUDO for zabbix user: ```zabbix ALL=(ALL) NOPASSWD: /usr/sbin/nscd -g```
2014-08-19 08:38:12 +02:00
Template App OPcache
======
Monitoring memory usage of OPcache (http://php.net/manual/en/book.opcache.php). File ```bin/opcache.php``` need to be accessed via HTTP for example http://127.0.0.1/opcache.php. Curl need to be installed on server.
2014-08-19 08:38:12 +02:00
Template App RabbitMQ
======
Monitoring RabbitMQ (http://www.rabbitmq.com/) basic parameters like queues, exchanges and memory usage. You need install PHP on server for monitoring RabbitMQ. You need also add SUDO for zabbix user:
```
zabbix ALL=(ALL) NOPASSWD: /usr/sbin/rabbitmqctl
zabbix ALL=(ALL) NOPASSWD: /usr/bin/php /etc/zabbix/bin/rabbit.php
```
2014-08-19 08:38:12 +02:00
Template App cPanel
======
Monitoring basic WHM/cPanel services.
Template Device BigIP F5
======
Monitoring of F5 BigIP network load balancer. It uses SNMP items to monitor basic device parameters (CPU/RAM usage, hardware failure, global traffic) and also it discovers network interfaces, storage, virtual servers and pools.
2014-08-19 08:38:12 +02:00
Template Device Cisco ASA
======
Monitoring of Cisco ASA firewall. It uses SNMP items to monitor basic device parameters (CPU/RAM usage, hardware failure, global traffic) and also it discovers network interfaces. It requires manual addition of value mappings (Administration -> General -> Value Mapping)
```
Cisco ASA Failover status
2014-11-22 22:25:32 +01:00
9 - Active
10 - Standby
Cisco Interface Status
2014-11-22 22:25:32 +01:00
1 - up
2 - down
```
2014-08-19 08:38:12 +02:00
Template Device NetApp
======
Monitoring of NetApp dedicated NAS disk arrays. It uses SNMP items to monitor basic device parameters (CPU/RAM usage, disk status, hardware failure, global traffic) and also it discovers snap mirrors volumens (more discovery rules in the future). It requires manual addition of value mappings (Administration -> General -> Value Mapping)
```
NetAppcfInterconnectStatus
1 - not present
2 - down
3 - not present
4 - up
NetAppcfPartnerStatus
1 - maybe down
2 - ok
3 - dead
NetAppCfSetting
1 - not configured
2 - enabled
3 - disabled
4 - takeover by partner disabled
5 - dead
NetAppCfState
1 - dead
2 - can takeover
3 - cannot takeover
4 - takeover
NetAppFcpTgtStatus
1 - startup
2 - uninitialized
3 - initializingFW
4 - linkNotConnected
5 - waitingForLinkUp
6 - online
7 - linkDisconnected
8 - resetting
9 - offline
10 - offlinedByUserSystem
11 - unknown
NetAppFsStatus
1 - ok
2 - nearlyFull
3 - full
NetAppGlobalStatus
1 - other
2 - unknown
3 - ok
4 - non critical
5 - critical
6 - non recoverable
NetAppLunOnline
1 - false
2 - true
NetAppNvramBatteryStatus
1 - ok
2 - partially discharged
3 - fully discharged
4 - not present
5 - near end of life
6 - at end of life
7 - unknown
8 - overcharged
NetAppOverTemp
1 - no
2 - yes
NetAppRaidStatus
1 - active
2 - reconstructionInProgress
3 - parityReconstructionInProgress
4 - parityVerificationInProgress
5 - scrubbingInProgress
6 - failed
7 - addingSpare
8 - spare
9 - prefailed
10 - offline
NetAppSpareStatus
1 - spare
2 - addingspare
3 - bypassed
4 - unknown
5 - offline
NetAppVolStatus
1 - unmounted
2 - mounted
3 - frozen
4 - destroying
5 - creating
6 - mounting
7 - unmounting
8 - nofsinfo
9 - replaying
```
2014-08-19 08:38:12 +02:00
Template Device SNMP Generic
======
This is a generic template for SNMP (and any other in fact) devices that do not provide any information via SNMP/any other protocol. It only checks for host availability
2014-08-19 08:38:12 +02:00
Template Inventory
======
This template is used to collect data for automatic host inventory (architecture, operating system, hardware etc). Requires facter (puppetlabs)
2014-08-19 08:38:12 +02:00
Template OMSA
======
Collects data from OpenManage Server Administrator (OMSA) tool for Dell Servers. It monitors hardware components such as chassis, disks, power supplies, fans and other. It requires manual addition of value mappings (Administration -> General -> Value Mapping)
```
Dell Open Manage System Status
1 - Other
2 - Unknown
3 - OK
4 - NonCritical
5 - Critical
6 - NonRecoverable
```
2014-08-19 08:38:12 +02:00
2014-11-22 22:58:06 +01:00
You need also add SUDO for zabbix user: ```zabbix ALL=(ALL) NOPASSWD: /opt/dell/srvadmin/bin/omreport```
2015-06-01 22:11:18 +02:00
Template App S.M.A.R.T.
======
Monitoring for S.M.A.R.T. enabled storage devices (HDD's, SSD's and other). Uses discovery script to populate disks. INFO: For non present S.M.A.R.T. values disable items on per-host level.
2015-09-20 23:41:54 +02:00
Template and scripts created by:
Michał Macioszek, Taras Baran, Michal Gębora, Marcin Wilk, Maks Bednarek, Anna Fałek, Mikołaj Szczuraszek
Template App PowerPath
======
2014-11-26 10:02:33 +01:00
Monitoring for EMC PowerPath: Host-based software for automated data path management, failover and recovery, and optimized load balancing. PowerPath automates, standardizes, and optimizes data paths in physical and virtual environments as well as cloud deployments to deliver high availability and performance.
2014-08-19 08:38:12 +02:00
Template Security
======
2014-11-22 22:58:06 +01:00
Contains items related with basic linux security (iptables status, selinux status) and Fail2ban application. You need also add SUDO for zabbix user: ```zabbix ALL=(ALL) NOPASSWD: /sbin/iptables -L INPUT -n```
Template Device Back-UPS ES 700G
======
Monitoring of APC UPS dedicated battery power. Requires package apcupsd to be installed.