Skip to content

changes for controller HA#196

Open
raman-mann wants to merge 8 commits into
cablelabs:masterfrom
raman-mann:Controller-HA-support
Open

changes for controller HA#196
raman-mann wants to merge 8 commits into
cablelabs:masterfrom
raman-mann:Controller-HA-support

Conversation

@raman-mann
Copy link
Copy Markdown
Contributor

What does this PR do?

Support controller HA in openstack

Do you have any concerns with this PR?

NA

How can the reviewer verify this PR?

By deploying openstack with multiple controller nodes.

Any background context you want to provide?

NA

Screenshots or logs (if appropriate)

NA

Questions:

  • Have you connected this PR to the issue it resolves?
    Controller HA #12

  • Does the documentation need an update?
    Yes

  • Does this add new Python dependencies?
    No

  • Have you added unit or functional tests for this PR?
    No

  • Does this patch update any configuration files?
    No

@skulasekar
Copy link
Copy Markdown
Collaborator

Build finished.

Copy link
Copy Markdown
Contributor

@bo-quan bo-quan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@raman-mann Could you append it with CI related changes so that we can validate this patch via CI?

Copy link
Copy Markdown
Collaborator

@RandyLevensalor RandyLevensalor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Additional pep8 issues were added in this commit.

@RandyLevensalor
Copy link
Copy Markdown
Collaborator

Run Tests

1 similar comment
@RandyLevensalor
Copy link
Copy Markdown
Collaborator

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Oct 1, 2018

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Oct 11, 2018

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Oct 11, 2018

Aricent thinks the CI issue is not caused by the PR.

@mansi-jain2
Copy link
Copy Markdown
Contributor

CI script has been modified to add a new controller node for HA testing, and in the logs we can see SSH failure to this 4th node.

TASK [Deploy - iaas_launch.py -d *** This will run for around an hour without output] ***
21:27:23 fatal: [10.197.113.106]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh:

@spisarski : Can you please check, if CI script changes need any further modification

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Oct 25, 2018

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Oct 25, 2018

Bo will work with Steve and Randy to see if CI script needs further change to resolve this CI failure.

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Oct 26, 2018

Run Tests

3 similar comments
@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Oct 29, 2018

Run Tests

@RandyLevensalor
Copy link
Copy Markdown
Collaborator

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 15, 2018

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 15, 2018

The 4th vm appeared sometimes to be deploying to vdb instead of vda, causing its creation to fail. Need to dig further to understand the reason.

@skulasekar
Copy link
Copy Markdown
Collaborator

Run Tests

1 similar comment
@skulasekar
Copy link
Copy Markdown
Collaborator

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 20, 2018

@raman-mann @sbhojpur After more investigation, I found that the 4th vm creation failure was due to a project quota issue. Could you please increase the project quota in ci/snaps/snaps_os_tmplt.yaml file:

     quotas:
  •      ram: 102400
    
  •      cores: 40
    
  •      ram: 128000
    
  •      cores: 48
    

Increasing project quota for HA
@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 25, 2018

Run Tests

3 similar comments
@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 26, 2018

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 27, 2018

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 27, 2018

Run Tests

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 28, 2018

With the quota patch, CI passed the vm creation issue, however, it ran into the following error:

01:36:15 "EXECUTING SET HOSTS PLAY",
01:36:15 "kolla_sethosts.yaml",
01:36:15 "{'BASE_FILE_PATH': '/tmp/snaps-openstack/snaps_openstack/packages/source/', 'PROXY_DATA_FILE': '/tmp/snaps-openstack/snaps_openstack/ansible_p/ansible_utils/proxy_data.yaml', 'VARIABLE_FILE': '/tmp/snaps-openstack/snaps_openstack/ansible_p/ansible_utils/variable.yaml', 'target': '10.1.0.13', 'host_name': 'COMPUTE22557'}",
01:36:15 "FAILED IN SETTING HOSTS FILE"

@spisarski @RandyLevensalor does the error above ring a bell?

@spisarski
Copy link
Copy Markdown
Contributor

@bo-quan - Sorry, it has been awhile since I've been in this code thus have never encountered the problem. Please let me know if you need assistance debugging.

@bo-quan
Copy link
Copy Markdown
Contributor

bo-quan commented Nov 29, 2018

@raman-mann @sbhojpur If you have a chance (since I'll be consumed in another higher priority task within the next few days), could you take a look at the CI failure:

01:36:15 "EXECUTING SET HOSTS PLAY",
01:36:15 "kolla_sethosts.yaml",
01:36:15 "{'BASE_FILE_PATH': '/tmp/snaps-openstack/snaps_openstack/packages/source/', 'PROXY_DATA_FILE': '/tmp/snaps-openstack/snaps_openstack/ansible_p/ansible_utils/proxy_data.yaml', 'VARIABLE_FILE': '/tmp/snaps-openstack/snaps_openstack/ansible_p/ansible_utils/variable.yaml', 'target': '10.1.0.13', 'host_name': 'COMPUTE22557'}",
01:36:15 "FAILED IN SETTING HOSTS FILE"

in https://snapsci.cablelabs.com:8443/job/snaps-openstack/127/console

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants