This is an old revision of the document!
Table of Contents
FAH Installation
In our previous step we checked the health of CMOS batteries on our Stack of NUCs.
Now we are going to create and run an Ansible playbook to set up Folding at Home (FAH) on the nodes. I have updated the playbook by ajacocks to add the current release and hack up a quick fix.
Please note that the NUCs in am using this lab have only 4 cores, and for some WU's (work units) the client will only use 3 core. So don't expected to be scoring many points with these small boxes.
Purpose:
- Demonstrate a running a complex workload of a service combined with configuration files
References
Step 1 - Install the fahcontrol app on NUC 1
The official download here does not work with Ubuntu 22.04. Use https://github.com/cdberkstresser/fah-control.
- Open a shell on NUC 1
- Install packages
sudo apt-get install -y python3-stdeb python3-gi python3-all python3-six debhelper dh-python gir1.2-gtk-3.0
cd fah-control
./FAHControl
Step 2 - Install the the FAH client using Ansible
From NUC 1, log in to the Ansible control node, NUC 2.
- Change directory to /home/ansible/my-project
git clone –branch ubuntu-22.040-lts https://github.com/doritoes/fah.git
- Change directory to /home/ansible/my-project/fah
- Modify file /home/ansible/my-project/fah/inventory
- copy your ansible node IPs from the file /home/ansible/my-project/hosts to the [clients] section
- chost='IP of Control Node'
- cpass='control-node-password'
- username='(Yourname @ folding@home)'
- passkey='(redacted passkey from folding@home)'
ansible-playbook main.yml
- if you encounter a DNS lookup failure on some or all nodes
- your wireless router should be setting DNS information as part of DHCP
- running dhclient might help:
ansible -i hosts all -a dhclient
- if you cannot connect with the control app and/or you see an error regarding a locked database
- reboot the node to clear the error
- it seems running the playbook on an already configured system and run multiple copies of FAH and cause the problem; rebooting solves the issue
- On NUC 1, open the FAH control program
- Add clients one at a time in FAHControl
- Any name you want
- IP address of the client
- Control password you used configuring FAH
Next Step
Congratulations! Your Stack of NUCs is now fully occupied running a valuable workload! Next up is FAH Removal, where we disable FAH and potentially remove it.
Optional
Check FAH Status
- Check the FAHClient status
ansible-playbook checkfahstatus.yml
- checkfahstatus.yml
--- - hosts: clients become: true become_user: root tasks: - name: Get FAH service Status ansible.builtin.systemd: state: "started" name: "FAHClient" register: fah_service_status - name: Show status debug: msg: "{{ fah_service_status.status.ActiveState }}"
- Check points per day (PPD) and queue information:
ansible -i ../hosts all -a "FAHClient --send-command ppd"
ansible -i ../hosts all -a "FAHClient --send-command queue-info"
- Tell all nodes to finish their work unit then pause
ansible -i ../hosts all -a "FAHClient --send-command finish"
- It's good form to finish the work units that are assigned to you before removing FAH from the nodes
Check CPU Utilization
Check the CPU load on the nodes
ansible-playbook getcpu.yml
- getcpu.yml
--- - hosts: all gather_facts: false tasks: - name: Get CPU usage shell: "top -b -n 1" register: top - name: Set CPU usage facts set_fact: user_cpu: "{{ top.stdout_lines[2].split()[1] }}" system_cpu: "{{ top.stdout_lines[2].split()[3] }}" - name: Output CPU usage facts debug: msg: - "User CPU usage: {{ user_cpu }}" - "System CPU usage: {{ system_cpu }}"
Check Temperature
In this example we will look into monitoring the CPU and chipset temperature of our NUCs.
Install lm-sensors
- Option 1 - Ad Hoc
ansible -i hosts all -m apt -a “name=lm-tools state=present”
- Option 2 - Playbook in /home/ansible/my-project/fah/lm-sensors.yml
- lm-sensors.yml
--- - name: lm-sensors install hosts: clients remote_user: ansible become: true tasks: - name: Install lm-sensors apt: name: lm-sensors update_cache: true - name: Detect sensors ansible.builtin.command: sensors-detect --auto
ansible-playbook lm-sensors.yml
ansible -i ../hosts all -a sensors
- See this link for more information on using this sensors information with Ansible.