System Reboot Engineer System Reboot Engineer
首页
运维
编程

小布江

首页
运维
编程
  • Kubernetes

  • 日常

  • Prometheus

    • Alertmanager报警历史持久化
    • blackbox-exporter监测站点
    • 监控kafka小tips
      • 常用Exporter
      • 远程存储之VictoriaMetrics
      • Nginx-vts模块
      • alertmanager
      • VMagent
      • VMalert
    • Ci

    • 运维
    • Prometheus
    小布江
    2024-07-10
    目录

    监控kafka小tips


    最近StarsL.cn (opens new window)出了史上最强Kafka看板 (opens new window)


    # 1. 基于Kubertenes监控集群外kafka并自动发现kafka-exporter的pod
    [root@prod-manage exporter]# cat cloud-kafka-exporter.yaml
    apiVersion: apps/v1
    kind: Deployment
    metadata:
      name: cloud-kafka-exporter
      namespace: kube-mon
      labels:
        app: cloud-kafka-exporter
    spec:
      selector:
        matchLabels:
          app: cloud-kafka-exporter
      template:
        metadata:
          labels:
            app: cloud-kafka-exporter
          annotations:
            prometheus.io/scrape: "true"
            prometheus.io/port: "9308"
        spec:
          containers:
          - name: cloud-kafka-exporter
            image: registry.us-east-1.aliyuncs.com/y110/kafka-exporter:latest
            resources:
              limits:
                memory: "128Mi"
                cpu: "500m"
            args: ["--kafka.server=172.19.192.234:9092"] # 集群环境填写其中任意一个IP即可
            ports:
            - containerPort: 9308
              name: http
    
    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31

    # 2. 一切都是如火如荼的正常进行着,突然有套kafka集群的exporter启动失败...

    image-20240730164721589

    # 3. 一脸懵逼,为什么?这都能报错?看issues (opens new window),大致就是版本的问题.两套对比发现版本有点小区别,(2.12表示Scala版本,0.10.22是kafka版本)

    image-20240730165431929

    # 4. 配置version,文章也有写kafka3需要指定版本,低版本启动出现问题可以尝试加个version
    apiVersion: apps/v1
    kind: Deployment
    metadata:
      name: kafka-exporter
      namespace: kube-mon
      labels:
        app: kafka-exporter
    spec:
      selector:
        matchLabels:
          app: kafka-exporter
      template:
        metadata:
          labels:
            app: kafka-exporter
          annotations:
            prometheus.io/scrape: "true"
            prometheus.io/port: "9308"
        spec:
          containers:
          - name: kafka-exporter
             #image: danielqsj/kafka-exporter:v1.2.0 # 用这个版本可以不用加version字段,看板ID: 7589 
            image: registry.us-east-1.aliyuncs.com/y110/kafka-exporter:latest
            resources:
              limits:
                memory: "128Mi"
                cpu: "500m"
            args:
              - '--kafka.server=172.19.32.255:9092'  # 集群环境填写其中任意一个IP即可
              - '--kafka.version=0.10.2.2' # 关键处
            ports:
            - containerPort: 9308
              name: http
    
    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33

    # 5. Prometheus规则查询

    image-20240730172714035


    # 6. 导入看板ID:21078

    image-20240730171047707

    #Prometheus
    上次更新: 2025/04/25, 03:40:17
    blackbox-exporter监测站点
    常用Exporter

    ← blackbox-exporter监测站点 常用Exporter→

    最近更新
    01
    Harbor复制镜像
    04-15
    02
    CPU亲和
    04-10
    03
    开启telnet登录
    04-09
    更多文章>
    Theme by Vdoing
    • 跟随系统
    • 浅色模式
    • 深色模式
    • 阅读模式