openshift3.11下hawkular metrics重装

2022-04-24 00:00:00 专区 订阅 删除 报错 重装

Keyspace hawkular_metrics does not exist报错
日志中还有未通过版本检测,webconsole显示pod的健康状态为unhealthy,简单通过删除deployment的yaml中的版本检测,pod的健康状态可以装换为health,但是一样不能得到metrics,监控数据。

readinessProbe:
exec:
command:
- /opt/hawkular/scripts/hawkular-metrics-readiness.py
failureThreshold: 3
periodSeconds: 10
successThreshold: 1
timeoutSeconds: 1

尚未找到该问题突然报错的原因,因项目需要,直接重装了hawkular_metrics, 使用ansible playbook,找到之前安装集群的playbook的目录,例如原来我的集群创建是通过:

ansible-playbook ~/openshift-ansible/playbooks/deploy_cluster.yml

里面具体模块的单独安装的路劲示例如下

Master Additional Install ~/openshift-ansible/playbooks/openshift-master/additional_config.yml
Node Join ~/openshift-ansible/playbooks/openshift-node/join.yml
Hosted Install ~/openshift-ansible/playbooks/openshift-hosted/config.yml
Monitoring Install ~/openshift-ansible/playbooks/openshift-monitoring/config.yml
Web Console Install ~/openshift-ansible/playbooks/openshift-web-console/config.yml
Admin Console Install ~/openshift-ansible/playbooks/openshift-console/config.yml

以此类推,我们找到hawkular metrics的重装路劲,运行如下

ansible-playbook ~/openshift-ansible/playbooks/openshift-metrics/schema.yml

重装步骤具体如下
登录到Openshift, 进入openshift-infra项目
运行如下代码

oc get jobs -n openshift-infra

删除metrics的Jobs后再重装。

删除jobs

oc delete jobs/hawkular-metrics-schema -n openshift-infra

然后即可重装metric

ansible-playbook ~/openshift-ansible/playbooks/openshift-metrics/schema.yml

搞定!

相关文章