Ganglia的安装与部署

1)三台机器安装epel源

  1. [atguigu@hadoop102 flume]$ sudo yum install -y epel-release

2) 在102安装web,meta和monitor

  1. [atguigu@hadoop102 flume]$ sudo yum -y install ganglia-gmetad ganglia-web ganglia-gmond

3) 在103、104安装monitor

  1. [atguigu@hadoop103 flume]$ sudo yum -y install ganglia-gmond
  2. [atguigu@hadoop104 flume]$ sudo yum -y install ganglia-gmond

Ganglia由gmond、gmetad和gweb三部分组成。
gmond(Ganglia Monitoring Daemon)是一种轻量级服务,安装在每台需要收集指标数据的节点主机上。使用gmond,你可以很容易收集很多系统指标数据,如CPU、内存、磁盘、网络和活跃进程的数据等。
gmetad(Ganglia Meta Daemon)整合所有信息,并将其以RRD格式存储至磁盘的服务。
gweb(Ganglia Web)Ganglia可视化工具,gweb是一种利用浏览器显示gmetad所存储数据的PHP前端。在Web界面中以图表方式展现集群的运行状态下收集的多种不同指标数据。
4)修改配置文件/etc/httpd/conf.d/ganglia.conf
修改为红颜色的配置:

  1. #
  2. # Ganglia monitoring system php web frontend
  3. #
  4. Alias /ganglia /usr/share/ganglia
  5. <Location /ganglia>
  6. #Require local
  7. Require ip 192.168.5.1//红色
  8. Require all granted//红色
  9. # Require ip 10.1.2.3
  10. # Require host example.org
  11. </Location>

5)修改配置文件/etc/ganglia/gmetad.conf

  1. [atguigu@hadoop102 flume]$ sudo vim /etc/ganglia/gmetad.conf

修改为:

  1. data_source "hadoop102" hadoop102

6)修改配置文件/etc/ganglia/gmond.conf

  1. [atguigu@hadoop102 flume]$ sudo vim /etc/ganglia/gmond.conf
  2. 修改为:
  3. cluster {
  4. name = "hadoop102"//红色
  5. owner = "unspecified"
  6. latlong = "unspecified"
  7. url = "unspecified"
  8. }
  9. udp_send_channel {
  10. #bind_hostname = yes # Highly recommended, soon to be default.
  11. # This option tells gmond to use a source address
  12. # that resolves to the machine's hostname. Without
  13. # this, the metrics may appear to come from any
  14. # interface and the DNS names associated with
  15. # those IPs will be used to create the RRDs.
  16. # mcast_join = 239.2.11.71//红色
  17. host = hadoop102//红色
  18. port = 8649
  19. ttl = 1
  20. }
  21. udp_recv_channel {
  22. # mcast_join = 239.2.11.71//红色
  23. port = 8649
  24. bind = 0.0.0.0//红色
  25. retry_bind = true
  26. # Size of the UDP buffer. If you are handling lots of metrics you really
  27. # should bump it up to e.g. 10MB or even higher.
  28. # buffer = 10485760
  29. }
  1. 将修改后的文件同步到103104。<br />**7)修改配置文件/etc/selinux/config**
  1. [atguigu@hadoop102 flume]$ sudo vim /etc/selinux/config

修改为:

  1. # This file controls the state of SELinux on the system.
  2. # SELINUX= can take one of these three values:
  3. # enforcing - SELinux security policy is enforced.
  4. # permissive - SELinux prints warnings instead of enforcing.
  5. # disabled - No SELinux policy is loaded.
  6. SELINUX=disabled
  7. # SELINUXTYPE= can take one of these two values:
  8. # targeted - Targeted processes are protected,
  9. # mls - Multi Level Security protection.
  10. SELINUXTYPE=targeted

尖叫提示:selinux本次生效关闭必须重启,如果此时不想重启,可以临时生效之:

  1. [atguigu@hadoop102 flume]$ sudo setenforce 0

8)102启动ganglia三个后台,103,104启动gmond

  1. [atguigu@hadoop102 flume]$ sudo systemctl start httpd
  2. [atguigu@hadoop102 flume]$ sudo systemctl start gmetad
  3. [atguigu@hadoop102 flume]$ sudo systemctl start gmond
  4. [atguigu@hadoop103 flume]$ sudo systemctl start gmond
  5. [atguigu@hadoop104 flume]$ sudo systemctl start gmond

9)打开网页浏览ganglia页面
http://192.168.1.102/ganglia
尖叫提示:如果完成以上操作依然出现权限不足错误,请修改/var/lib/ganglia目录的权限:
image.png

  1. [atguigu@hadoop102 flume]$ sudo chmod -R 777 /var/lib/ganglia

3.8.2 操作Flume测试监控

1)启动Flume任务

  1. [atguigu@hadoop102 flume]$ bin/flume-ng agent \
  2. --conf conf/ \
  3. --name a1 \
  4. --conf-file jobs/file-flume-logger.conf \
  5. -Dflume.root.logger=INFO,console \
  6. -Dflume.monitoring.type=ganglia \
  7. -Dflume.monitoring.hosts=hadoop102:8649

2)发送数据观察ganglia监测图

  1. [atguigu@hadoop102 flume]$ nc localhost 44444

样式如图:

图例说明:

字段(图表名称) 字段含义
EventPutAttemptCount source尝试写入channel的事件总数量
EventPutSuccessCount 成功写入channel且提交的事件总数量
EventTakeAttemptCount sink尝试从channel拉取事件的总数量。
EventTakeSuccessCount sink成功读取的事件的总数量
StartTime channel启动的时间(毫秒)
StopTime channel停止的时间(毫秒)
ChannelSize 目前channel中事件的总数量
ChannelFillPercentage channel占用百分比
ChannelCapacity channel的容量