Note: This is a beta release of Red Hat Bugzilla 5.0. The data contained within is a snapshot of the live data so any changes you make will not be reflected in the production Bugzilla. Also email is disabled so feel free to test any aspect of the site that you want. File any problems you find or give feedback here.
Bug 1690995 - Metrics store Elasticsearch pod has status Error
Summary: Metrics store Elasticsearch pod has status Error
Alias: None
Product: ovirt-engine-metrics
Classification: oVirt
Component: Generic
Version: unspecified
Hardware: Unspecified
OS: Unspecified
Target Milestone: ovirt-4.3.3
: ---
Assignee: Shirly Radco
QA Contact: Ivana Saranova
Depends On:
Blocks: 1631193
TreeView+ depends on / blocked
Reported: 2019-03-20 15:52 UTC by Ivana Saranova
Modified: 2019-04-16 13:58 UTC (History)
2 users (show)

Fixed In Version:
Doc Type: If docs needed, set a value
Doc Text:
Clone Of:
Last Closed: 2019-04-16 13:58:32 UTC
oVirt Team: Metrics
sradco: ovirt-4.3?
lleistne: testing_ack+

Attachments (Terms of Use)

Description Ivana Saranova 2019-03-20 15:52:03 UTC
Description of problem:
When installing metrics according to the README in the role, the Elasticsearch deploy pod on the created metrics-store machine ends with Error status.

Version-Release number of selected component (if applicable):

How reproducible:

Steps to Reproduce:
1. On the Manager machine, copy /etc/ovirt-engine-metrics/config.yml.example to config.yml:
# cp /etc/ovirt-engine-metrics/metrics-store-config.yml.example /etc/ovirt-engine-metrics/config.yml.d/metrics-store-config.yml
2. Update the values of /etc/ovirt-engine-metrics/metrics-store-config.yml to match the details of your specific environment:
# vi /etc/ovirt-engine-metrics/config.yml.d/metrics-store-config.yml

3. Go to ovirt-engine-metrics repo:
# cd /usr/share/ovirt-engine-metrics

4. Run the metrics store installation playbook that creates the metrics store installer virtual machine
# ANSIBLE_JINJA2_EXTENSIONS="" ./ --playbook=ovirt-metrics-store-installation.yml

**Note:** If this playbook ends with failure on
But all vms are created succesfully. Need to see how to fix the failure.
You can continue with deploying OpenShift from the metrics store installer virtual machine.

5. Log into the admin portal and review the metrics store installer virtual machine creation.

6. Log into the metrics store installer virtual machine
# ssh root@<metrics-store-installer ip or fqdn>
**Note:** If you are not using DNS, make sure to add the new OpenShift virtual machines  to /etc/hosts on the engine and installer machines.

7. Run the ansible playbook that deploys OpenShift on the created vms

# ANSIBLE_CONFIG="/usr/share/ansible/openshift-ansible/ansible.cfg" \
  ANSIBLE_ROLES_PATH="/usr/share/ansible/roles/:/usr/share/ansible/openshift-ansible/roles" \
  ansible-playbook -i integ.ini install_okd.yaml -e @vars.yaml

Actual results:

[root@master0 ~]# oc get pods
NAME                                       READY     STATUS    RESTARTS   AGE
logging-es-data-master-dv5h9nw6-1-deploy   0/1       Error     0          1h
logging-fluentd-j9kp5                      1/1       Running   0          1h
logging-kibana-1-tzl8v                     2/2       Running   0          2h
logging-mux-1-rpjwn                        1/1       Running   0          1h

[root@master0 ~]# oc get svc
NAME                    TYPE        CLUSTER-IP       EXTERNAL-IP    PORT(S)     AGE
logging-es              ClusterIP      master0        9200/TCP    2h
logging-es-cluster      ClusterIP   None             <none>         9300/TCP    2h
logging-es-prometheus   ClusterIP   <none>         443/TCP     2h
logging-kibana          ClusterIP   <none>         443/TCP     2h
logging-mux             ClusterIP    master0        24284/TCP   2h

[root@master0 ~]# oc logs logging-es-data-master-dv5h9nw6-1-deploy
--> Scaling logging-es-data-master-dv5h9nw6-1 to 1
Warning: acceptAvailablePods encountered %T, retryingwatch closed before Until timeout--> Error listing events for replication controller logging-es-data-master-dv5h9nw6-1: Get dial tcp connect: connection refused
error: update acceptor rejected logging-es-data-master-dv5h9nw6-1: acceptAvailablePods failed to watch ReplicationController openshift-logging/logging-es-data-master-dv5h9nw6-1: Get dial tcp connect: connection refused

Expected results:
Elasticsearch pod is running and working properly.

Additional info:

Comment 1 Ivana Saranova 2019-04-04 09:32:13 UTC
1) Install metrics-store according to the documentation and README
2) Check if Elasticsearch pod is deployed and running: oc get pods

Elasticsearch pod is ready, running and successfully deployed.

Verified in: 

Also verified in:

Comment 2 Sandro Bonazzola 2019-04-16 13:58:32 UTC
This bugzilla is included in oVirt 4.3.3 release, published on April 16th 2019.

Since the problem described in this bug report should be
resolved in oVirt 4.3.3 release, it has been closed with a resolution of CURRENT RELEASE.

If the solution does not work for you, please open a new bug report.

Note You need to log in before you can comment on or make changes to this bug.