Upgrading Endeavour

From Nuclear Physics Group Documentation Pages
Jump to navigationJump to search

Upgrade main system

Started with a sideways migration to Centos 5. This worked except for the infiniband packages, which were skipped:

New system @ Centos 6.6:

  • Reconfigured RAID. All the 2TB drives are now in slots 1-9 and configured for a RAID6, 14TB raid.
    • Slots 10,11,12 will be hot-spare, and 2x passthrough.
      • The passthrough are for: slot11 - Can contain Home directory drive when Gourd is being upgraded. slot12 -- OldSys a 1TB drive with the old Centos 5.5 system.
    • There are 3 volumes on the RAID: "system" ~ 100GB, "system2" ~100GB, "data1"
    • Remaining 12 slots will be filled with high density new drives for another RAID6
  • Restarted web server.
  • Setup fail2ban

Installing new RPMs on Nodes

  • The Centos DVDs are installed at /net/data/node10/RHEL/Centos-6.6
  • This dir is added to the c6-media in /etc/yum.repos.d/CentOS-Media.repo
  • Install packages with: yum --disablerepo \* --enablerepo c6-media install package_name

More recently, Endeavour is now a proxy server and yum on the nodes is setup to make use of the proxy. Thus "yum update" simply works.

Upgrading Endeavour nodes

Upgrade to Centos 6 started March 17, 2015 with node2:

  • Reboot Node2 from a USB key with Centos6 distribution installed. Chose "minimal install"
    • Note: Should have added scp, i.e openssh-client stuff. Added this "by hand" by using from endeavour: cat openssh-client-... | ssh node2 "cat - > openssh-client.rpm" and then installing that rpm.
  • SSH into the system
  • Copy the Centos ISO to node2 with scp. Mount on /mnt/centos
  • Install packages with: yum --disablerepo \* --enablerepo c6-media install
  • List of old package installed are in ~root/new_packages.txt with the distribution and package version stripped already. From this list, the packages were filtered into "installed" and "available" with yum. From the resulting list of "available" only the x86_64 and noarch packages were installed.
  • A number of config tweaks needed.
  • Nodes @ Centos 6.6
    • nodes: 2,3,7, 11, 13

Done

To Do

  • Configure MPI
    • Later. I'm not sure anyone is using this right now.
  • Reconfigure Ganglia