Gourd

From Nuclear Physics Group Documentation Pages
Revision as of 20:16, 11 December 2009 by Maurik (talk | contribs)
Jump to navigationJump to search

This is the new Gourd Hardware. The old one is here old gourd

Note: This is the page for the NEW GOURD - 8 core server from Microway, which will serve EINSTEIN as a VM.

The previous einstein hardware is described in the previous page for Einstein at old einstein

New Microway Server

The new einstein is a 2 quad-CPU server in a 2U rackmount chassis put together nicely by Microway. It arrived at UNH on 11/24/2009. The system has an Areca RAID card with ethernet port and an IPMI card with ethernet port. The motherboard is from Super Micro. Details need to be put here!

  • IP address UNH: 132.177.88.75 (currently VLAN)
  • IP address Farm: 10.0.0.252
  • IP address RAID: 10.0.0.152
  • IP address IPMI: 10.0.0.151

Setting Up the Server

Important things to remember before this system takes on the identity of Einstein

  1. The ssh fingerprint of the old einstein needs to be imported.
  2. Obviously, all important data needs to be moved: Home Directories, Mail, DNS records, ... (what else?)
  3. Fully test functionality before switching!

Configurations Needed

  1. RAIDs need to be setup on Areca card.
  2. Mail system needs to be setup
  3. Webmail needs to be setup. Uses Apache?
  4. DNS needs to be setup.
  5. Backup needs to be made to work.
  6. rhn_config - I tried this but our subscriptions seem messed up. (Send message to customer support 11/25/09)

Initialization

Server arrived 11/24/2009, was unpacked, placed in the rack and booted on 11/25/2009.

Initial configuration steps are logged here:

  • Initial host name is gourd (gourd.unh.edu) with eth0 at 10.0.0.252 and eth0.2 (VLAN) at 132.177.88.75
  • The ARECA card is set at 10.0.0.152. The password is changed to the standard ARECA card password, user is still ADMIN.
  • The IPMI card was configured using the SuperMicro ipmicfg utility. The net address is set to 10.0.0.151. Access is verified by IPMIView from Taro. The grub.conf and inittab lines are changed so that SOL is possible at 19200 baud.
  • The LDAP configuration is copied from Taro. This means it is currently in client ldap mode, and needs to be change to an ldap server before production. You can log in as yourself.
  • The autofs configuration is copied from Taro. The /net/home and /net/data directories work.
  • The sudoers is copied from Taro, but it does not appear to work - REASON: pam.d/system-auth
  • Added "auth sufficient pam_ldap.so use_first_pass" to /etc/pam.d/system-auth - now sudo works correctly.

Disks and Raid Configuration

Current Disk usage Estimates for Einstein:

Mail (/var/spool): approx. 30GB
Home Folders (/home): approx. 122GB
Virtual Machines (/data/VMWare on Taro):  approx. 70GB
LDAP Database (/var/lib/ldap): approx. 91MB


Disk sizes in the following tables are based roughly on these current usage estimates with plenty of extra space to grow. They can be adjusted as appropriate to better suit our needs, and to make these designs more cost effective.


Proposed Configuration 1

This configuration is designed to modularize storage and keep related data on separate mirrors. Could be useful if we have a failover system, such as the old Einstein hardware, because individual components ( Mail, home folders, etc ) could be relocated physically to another machine in the event of some failure rather than copying large amounts of data over the network. This design could be modified to use fewer drives by storing Virtual machines either on the /var array or the /home array, opening up the bays to store spare drives.

Drive Bay Raid Type Contents Volume Size Disk Size
1 Raid 1 Operating System ( / ) 250 GB 250 GB
2 250 GB
3 Raid 1 Mail/LDAP ( /var ) 250 GB 250 GB
4 250 GB
5 Raid 1 Home Folders 500 GB 500 GB
6 500 GB
7 Raid 1 Virtual Machines
Data
250 GB 250 GB
8 250 GB



Proposed Configuration 2

This configuration provides a larger amount of contiguous storage space than the previous design with redundancy provided by either a Raid 6 or Raid 5 array. Raid 5 would provide more usable storage, but Raid 6 will withstand more than one disk failure. It may be preferable to err on the side of caution with our user's data and use a Raid 6 for home folders.

Drive Bay Raid Type Contents Volume Size Disk Size
1 Raid 1 Operating System ( / ) 250 GB 250 GB
2 250 GB
3 Raid 1 Mail/LDAP ( /var ) 250 GB 250 GB
4 250 GB
5 Raid 5 or
Raid 6
Home Folders
Virtual Machines
Other data
1000 GB (Raid6)
1500GB(Raid5)
500 GB
6 500 GB
7 500 GB
8 500 GB



Proposed Configuration 3

This configuration would create one large data store for all user and system data, which could be stored on separate appropriately sized partitions. This design is less modular, but uses fewer drives than previous designs while leaving bays open to store spares in the event of a drive failure.

Drive Bay Raid Type Contents Volume Size Disk Size
1 Raid 1 Operating System ( / ) 250 GB 250 GB
2 250GB
3 Raid 6 Home Folders ( /home )
Mail/LDAP ( /var )
Virtual Machines
1000 GB 500 GB
4 500 GB
5 500 GB
6 500 GB
7 None Empty / Spare Drive 0 250/500
8 None Empty / Spare Drive 0 250/500



Proposed Configuration 4

Considering that "large storage" is both dangerous and inflexible, and we really don't want to have a large volume for /home or /var/spool/mail, the following configuration may actually be ideal. We use only RAID1 for the mail storage spaces, so that there is always the option of breaking the RAID and using one of the disks in another server for near instant failover. This needs to be tested for Areca RAID1. We also need to consider that a RAID card has "physical drives", "Volume Set" and "Raid Set". The individual physical drives are grouped into a "Volume Set". This volume set is then partitioned into "Raid Sets", and the Raid Sets are exposed to the operating system as disks. These disks can then (if you insist, as you do for the system) partitioned by the operating system using parted or LVM into partitions, which will hold the filesystems.

We only need 4 of the drive bays to meet our code needs. The other 4 drive bays can hold an additional storage of less critical data, exta VMs, a hot spare, and an empty, which can be filled with a 1 TB drive and configured like a "Time Machine" to automatically backup the /home and VMs to, so that this system no longer depends on Lentil for its core backup. (Just an idea for the future.)

Drive Bay Disk size Raid Set Volume Set Volume Raid Level Volume Size Partitions
1 250 GB Set 1 Volume 1 Raid1 250 GB System: (/, /boot, /var, /tmp, /swap, /usr, /data )
2 > 250GB
3 750 GB Set 2 Volume 2 Raid1 500 GB Home Dirs: /net/home
Volume 3 Raid1 100 GB Var: Mail and LDAP
4 750 GB Volume 4 Raid1 150 GB Virtual Machines: Einstein/Roentgen/Corn
5 750 GB Set 3 Volume 4 Raid1 250 GB Additional VM space
6 750 GB Volume 4 Raid1 500 GB Additional Data space
7 750 GB Hot Swap
8 None Empty