监控kafka小tips
最近StarsL.cn (opens new window)出了史上最强Kafka看板 (opens new window)
# 1. 基于Kubertenes监控集群外kafka并自动发现kafka-exporter的pod
[root@prod-manage exporter]# cat cloud-kafka-exporter.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
name: cloud-kafka-exporter
namespace: kube-mon
labels:
app: cloud-kafka-exporter
spec:
selector:
matchLabels:
app: cloud-kafka-exporter
template:
metadata:
labels:
app: cloud-kafka-exporter
annotations:
prometheus.io/scrape: "true"
prometheus.io/port: "9308"
spec:
containers:
- name: cloud-kafka-exporter
image: registry.us-east-1.aliyuncs.com/y110/kafka-exporter:latest
resources:
limits:
memory: "128Mi"
cpu: "500m"
args: ["--kafka.server=172.19.192.234:9092"] # 集群环境填写其中任意一个IP即可
ports:
- containerPort: 9308
name: http
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
# 2. 一切都是如火如荼的正常进行着,突然有套kafka集群的exporter启动失败...
# 3. 一脸懵逼,为什么?这都能报错?看issues (opens new window),大致就是版本的问题.两套对比发现版本有点小区别,(2.12表示Scala版本,0.10.22是kafka版本)
# 4. 配置version,文章也有写kafka3需要指定版本,低版本启动出现问题可以尝试加个version
apiVersion: apps/v1
kind: Deployment
metadata:
name: kafka-exporter
namespace: kube-mon
labels:
app: kafka-exporter
spec:
selector:
matchLabels:
app: kafka-exporter
template:
metadata:
labels:
app: kafka-exporter
annotations:
prometheus.io/scrape: "true"
prometheus.io/port: "9308"
spec:
containers:
- name: kafka-exporter
#image: danielqsj/kafka-exporter:v1.2.0 # 用这个版本可以不用加version字段,看板ID: 7589
image: registry.us-east-1.aliyuncs.com/y110/kafka-exporter:latest
resources:
limits:
memory: "128Mi"
cpu: "500m"
args:
- '--kafka.server=172.19.32.255:9092' # 集群环境填写其中任意一个IP即可
- '--kafka.version=0.10.2.2' # 关键处
ports:
- containerPort: 9308
name: http
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
# 5. Prometheus规则查询
# 6. 导入看板ID:21078
上次更新: 2025/04/25, 03:40:17