lab:fah_installation
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
lab:fah_installation [2023/04/30 01:34] – improved user | lab:fah_installation [2023/05/01 19:49] (current) – replaced user | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== FAH Installation ====== | ||
- | |||
- | In our previous step we [[stack_-_ansible_-_cmos|checked the health of CMOS batteries]] on our [[Stack of NUCs]]. | ||
- | |||
- | Now we are going to create and run an Ansible playbook to set up [[https:// | ||
- | |||
- | Please note that the NUCs in am using this lab have only 4 cores, and for some WU's (work units) the client will only use 3 core. So don't expected to be scoring many points with these small boxes. | ||
- | |||
- | Purpose: | ||
- | * Demonstrate a running a complex workload of a service combined with configuration files | ||
- | |||
- | References | ||
- | * [[https:// | ||
- | |||
- | ====== Step 1 - Install the fahcontrol app on NUC 1 ====== | ||
- | The official download [[https:// | ||
- | |||
- | - Open a shell on [[stack - nuc 1|NUC 1]] | ||
- | - Install packages | ||
- | * '' | ||
- | * '' | ||
- | * '' | ||
- | * '' | ||
- | |||
- | ====== Step 2 - Install the the FAH client using Ansible ====== | ||
- | From [[Stack - NUC 1|NUC 1]], log in to the Ansible control node, [[Stack - NUC 2|NUC 2]]. | ||
- | |||
- | - Change directory to / | ||
- | - '' | ||
- | - Change directory to / | ||
- | - Modify file / | ||
- | * copy your ansible node IPs from the file / | ||
- | * chost=' | ||
- | * cpass=' | ||
- | * username=' | ||
- | * passkey=' | ||
- | - '' | ||
- | * if you encounter a DNS lookup failure on some or all nodes | ||
- | * your wireless router should be setting DNS information as part of DHCP | ||
- | * running dhclient might help: '' | ||
- | * if you cannot connect with the control app and/or you see an error regarding a locked database | ||
- | * reboot the node to clear the error | ||
- | * it seems running the playbook on an already configured system and run multiple copies of FAH and cause the problem; rebooting solves the issue | ||
- | - On [[NUC 1]], open the FAH control program | ||
- | * Add clients one at a time in FAHControl | ||
- | * Any name you want | ||
- | * IP address of the client | ||
- | * Control password you used configuring FAH | ||
- | |||
- | Congratulations! Your [[Stack of NUCs]] is now fully occupied running a valuable workload! Next up is [[FAH Removal]], where we disable FAH and potentially remove it. | ||
- | |||
- | ====== Optional ====== | ||
- | ===== Check FAH Service Status ===== | ||
- | <file yaml checkfahstatus.yml> | ||
- | --- | ||
- | - hosts: clients | ||
- | become: true | ||
- | become_user: | ||
- | tasks: | ||
- | - name: Get FAH service Status | ||
- | ansible.builtin.systemd: | ||
- | state: " | ||
- | name: " | ||
- | register: fah_service_status | ||
- | - name: Show status | ||
- | debug: | ||
- | msg: "{{ fah_service_status.status.ActiveState }}" | ||
- | </ | ||
- | |||
- | '' | ||
- | |||
- | Check points per day and queue information: | ||
- | * '' | ||
- | * '' | ||
- | |||
- | Tell all nodes to finish their work unit then pause: | ||
- | * '' | ||
- | |||
- | ===== Check Temperature ===== | ||
- | In this example we will look into monitoring the CPU and chipset temperature of our NUCs. | ||
- | |||
- | Install lm-sensors | ||
- | * Option 1 - Ad Hoc | ||
- | * '' | ||
- | * Option 2 - Playbook in / | ||
- | * <file yaml lm-sensors.yml> | ||
- | --- | ||
- | - name: lm-sensors install | ||
- | hosts: clients | ||
- | remote_user: | ||
- | become: true | ||
- | tasks: | ||
- | - name: Install lm-sensors | ||
- | apt: | ||
- | name: lm-sensors | ||
- | update_cache: | ||
- | - name: Detect sensors | ||
- | ansible.builtin.command: | ||
- | </ | ||
- | * '' | ||
- | * '' | ||
- | * See [[https:// |
lab/fah_installation.1682818492.txt.gz · Last modified: 2023/04/30 01:34 by user