正式从头开始(清晰版) 步骤 1:拿到 Prometheus 令牌(主机执行) bash 运行 kubectl get secret prometheus-k8s-token-mngkd -n kubesphere-monitoring-system -o jsonpath={.data.token} | base64 -d 把输出的一长串字符复制下来! 步骤 2:进入 Prometheus 容器(主机执行) bash 运行 kubectl exec -it prometheus-k8s-0 -n kubesphere-monitoring-system – sh 步骤 3:容器内执行调试(只复制这一行) sh wget --no-check-certificate --spider https://11.2.212.6:9100 正常返回:401 Unauthorized(说明需要 token) 步骤 4:容器内执行带 token 访问(最终测试) 把下面命令中的 【TOKEN】 替换成你刚才复制的长串: sh wget --no-check-certificate --header=“Authorization: Bearer 【TOKEN】” https://11.2.212.6:9100/metrics 如果你执行成功,会看到: plaintext Connecting to 11.2.212.6:9100 (11.2.212.6:9100) saving to ‘metrics’ metrics 100% |******************************| 123k 0:00:00 ETA ‘metrics’ saved 这就代表: ✅ 网络通 ✅ HTTPS 正常 ✅ 令牌有效 ✅ node-exporter 工作正常 ✅ kube-rbac-proxy 正常 结论(100% 确定) 你所有组件都正常! 唯一问题:Prometheus 自己没有正确带上 token! 我可以继续带你一键修复 Prometheus 配置,让监控立刻恢复。 [root@master1 ~]# kubectl get secret prometheus-k8s-token-mngkd -n kubesphere-monitoring-system -o jsonpath={.data.token} | base64 -d eyJhbGciOiJSUzI1NiIsImtpZCI6IlhMeDRic0tPVnBMRWgtSkd0cncwemFFU3c3czFNd1B6U1haQTZ5SWxQZ1UifQ.eyJpc3MiOiJrdWJlcm5ldGVzL3NlcnZpY2VhY2NvdW50Iiwia3ViZXJuZXRlcy5pby9zZXJ2aWNlYWNjb3VudC9uYW1lc3BhY2UiOiJrdWJlc3BoZXJlLW1vbml0b3Jpbmctc3lzdGVtIiwia3ViZXJuZXRlcy5pby9zZXJ2aWNlYWNjb3VudC9zZWNyZXQubmFtZSI6InByb21ldGhldXMtazhzLXRva2VuLW1uZ2tkIiwia3ViZXJuZXRlcy5pby9zZXJ2aWNlYWNjb3VudC9zZXJ2aWNlLWFjY291bnQubmFtZSI6InByb21ldGhldXMtazhzIiwia3ViZXJuZXRlcy5pby9zZXJ2aWNlYWNjb3VudC9zZXJ2aWNlLWFjY291bnQudWlkIjoiMGU4NzdmZWItMmFkOC00MTczLWJkZTEtMjFmZmJhZmM1MDNhIiwic3ViIjoic3lzdGVtOnNlcnZpY2VhY2NvdW50Omt1YmVzcGhlcmUtbW9uaXRvcmluZy1zeXN0ZW06cHJvbWV0aGV1cy1rOHMifQ.l_b5hOvsASEhOaVUPcFwKIvPT9tmSgFYe71dNw_C8N2DVROhNHuGljePzHVUp6aPs_pfjh6KJWLm6RGZbJP24fjbtXH2Cxb8JsnkTo07rSoa-Q0xVwvKViz3qmBi6dO4mIscvB_gOL3ZjTyh6_fS2CQAowukPmiE8RoCVinsPt3fu2EoQmJ-jZWJ9wzKRfW-tvTwQIm4-DWZwHzDRpJjvjM7QfG1IA6ufODkbjoDRhKW6WGxCZj-zboL4qqwiny7ZLkhtM49aO_NmHTbqUu_odGm3DCPvt_kiY6XC1PsvrUhBh6wAtkuL9CgDH7VOVxPNJpojBSxWGoeFpTd0uubew[root@master1 ~]# toekn是base64需要解析吗
eyJhbGciOiJSUzI1NiIsImtpZCI6IlhMeDRic0tPVnBMRWgtSkd0cncwemFFU3c3czFNd1B6U1haQTZ5SWxQZ1UifQ.eyJpc3MiOiJrdWJlcm5ldGVzL3NlcnZpY2VhY2NvdW50Iiwia3ViZXJuZXRlcy5pby9zZXJ2aWNlYWNjb3VudC9uYW1lc3BhY2UiOiJrdWJlc3BoZXJlLW1vbml0b3Jpbmctc3lzdGVtIiwia3ViZXJuZXRlcy5pby9zZXJ2aWNlYWNjb3VudC9zZWNyZXQubmFtZSI6InByb21ldGhldXMtazhzLXRva2VuLW1uZ2tkIiwia3ViZXJuZXRlcy5pby9zZXJ2aWNlYWNjb3VudC9zZXJ2aWNlLWFjY291bnQubmFtZSI6InByb21ldGhldXMtazhzIiwia3ViZXJuZXRlcy5pby9zZXJ2aWNlYWNjb3VudC9zZXJ2aWNlLWFjY291bnQudWlkIjoiMGU4NzdmZWItMmFkOC00MTczLWJkZTEtMjFmZmJhZmM1MDNhIiwic3ViIjoic3lzdGVtOnNlcnZpY2VhY2NvdW50Omt1YmVzcGhlcmUtbW9uaXRvcmluZy1zeXN0ZW06cHJvbWV0aGV1cy1rOHMifQ.l_b5hOvsASEhOaVUPcFwKIvPT9tmSgFYe71dNw_C8N2DVROhNHuGljePzHVUp6aPs_pfjh6KJWLm6RGZbJP24fjbtXH2Cxb8JsnkTo07rSoa-Q0xVwvKViz3qmBi6dO4mIscvB_gOL3ZjTyh6_fS2CQAowukPmiE8RoCVinsPt3fu2EoQmJ-jZWJ9wzKRfW-tvTwQIm4-DWZwHzDRpJjvjM7QfG1IA6ufODkbjoDRhKW6WGxCZj-zboL4qqwiny7ZLkhtM49aO_NmHTbqUu_odGm3DCPvt_kiY6XC1PsvrUhBh6wAtkuL9CgDH7VOVxPNJpojBSxWGoeFpTd0uubew
kubectl exec -it prometheus-k8s-0 -n kubesphere-monitoring-system – sh
vi token.txt
TOKEN=$(cat token.txt) && wget --no-check-certificate --header=“Authorization: Bearer $TOKEN” https://11.2.212.6:9100/metrics
/prometheus # TOKEN=$(cat token.txt) && wget --no-check-certificate --header=“Authorization: Bearer $TOKEN” https://11.2.212.6:9100/metrics Connecting to 11.2.212.6:9100 (11.2.212.6:9100) saving to ‘metrics’ metrics 100% |***********************************************************************************************************************************************************************| 434k 0:00:00 ETA ‘metrics’ saved /prometheus # 正确的
进入容器。 下一步
[root@master1 ~]# kubectl get ClusterConfiguration ks-installer -n kubesphere-system -o yaml
monitoring:
GPUMonitoring:
enabled: false
endpoint: http://prometheus-operated.kubesphere-monitoring-system.svc:9090
| 组件 | 容器 | 端口 | 协议 | 认证 | 作用 |
|---|---|---|---|---|---|
| prometheus-k8s-0 Pod | prometheus | 9090 | HTTP | ❌ 无 | Prometheus 主程序,查指标 |
| config-reloader | 8080 | HTTP | ❌ 无 | 热重载配置 | |
| init-config-reloader | 8080 | HTTP | ❌ 无 | 初始化(已终止) | |
| prometheus-operator Pod | prometheus-operator | 8080 | HTTP | ❌ 无 | Operator 主程序(不对外) |
| kube-rbac-proxy | 8443 | HTTPS | ✅ 有 | 对外代理,加 RBAC 认证 | |
| prometheus-operated Service | — | 9090 | HTTP | ❌ 无 | 转发到 prometheus 容器 |
| — | 8080 | HTTP | ❌ 无 | 转发到 config-reloader | |
| — | 8443 | HTTPS | ✅ 有 | 转发到 operator 的 kube-rbac-proxy |
[root@master1 ~]# kubectl exec -it prometheus-k8s-0 -n kubesphere-monitoring-system -c prometheus – wget -qO- ‘http://localhost:9090/api/v1/query?query=node_cpu_seconds_total’ | head -c 500
{“status”:“success”,“data”:{“resultType”:“vector”,“result”:[]}}[root@master1 ~]#

797

被折叠的 条评论
为什么被折叠?



