Difference between revisions of "Lentil"

From Nuclear Physics Group Documentation Pages
Jump to navigationJump to search
m (Drive letter fix.)
 
(32 intermediate revisions by 4 users not shown)
Line 1: Line 1:
 
== General Information ==
 
== General Information ==
Lentil performs [[backups]]. Its backup script needs further investigation to determine exactly how it works.
+
Lentil performs [[backups]]. Its autofs is configured to mount harddrives labeled npg-daily-XX onto /mnt/npg-daily/XX where XX is a label number. Its backup script needs further investigation to determine exactly how it works.
  
Hostnames: <code>lentil.unh.edu</code>, <code>lentil.farm.physics.unh.edu</code>
+
As of Feb. 15, 2015 Lentil is mounted with the following which will last us awhile:
 +
*/dev/sdd1 mounted at /mnt/npg-daily/51; A filled 2TB hard drive
 +
*/dev/sda1 mounted at /mnt/npg-daily/52; A 4TB hard drive
 +
*/dev/sdc1 mounted at /mnt/npg-daily/53; A 4TB hard drive
 +
*/dev/sdb1 as the root file system. Don't hot swap this.
 +
 
 +
== Hardware Information ==
 +
  Motherboard: Asus P5QL-CM
 +
    Specifications: [http://nuclear.unh.edu/wiki/pdfs/motherboards/16744.pdf Specifications]
 +
    User Manual: [http://nuclear.unh.edu/wiki/pdfs/motherboards/E4411_P5QL-CM V2.pdf Users Manual]
 +
  Memory: 2 GB DDR2
  
== Network Configuration ==
 
Currently connected to the unh and farm networks via the switch and VLAN.
 
=== ifconfig ===
 
<pre>
 
eth0      Link encap:Ethernet  HWaddr 00:23:54:BC:70:F1 
 
          inet addr:10.0.0.250  Bcast:10.0.0.255  Mask:255.255.255.0
 
          inet6 addr: fe80::223:54ff:febc:70f1/64 Scope:Link
 
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
 
          RX packets:30404 errors:0 dropped:0 overruns:0 frame:0
 
          TX packets:1778 errors:0 dropped:0 overruns:0 carrier:0
 
          collisions:0 txqueuelen:1000
 
          RX bytes:3723593 (3.5 MiB)  TX bytes:227139 (221.8 KiB)
 
          Interrupt:66 Base address:0x6000
 
  
eth0.2    Link encap:Ethernet  HWaddr 00:23:54:BC:70:F1 
+
== Authentication ==
          inet addr:132.177.88.254  Bcast:132.177.91.255  Mask:255.255.252.0
+
Lentil authenticates against the LDAP server running on Einstein, by connecting to einstein.farm.physucs.unh.edu using sssd.
          inet6 addr: fe80::223:54ff:febc:70f1/64 Scope:Link
+
Previously, Lentil went on the UNH network to einstein.unh.edu, but this is blocked (I think by ip-tables). The farm network is the better choice anyhow.
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
 
          RX packets:28826 errors:0 dropped:0 overruns:0 frame:0
 
          TX packets:1032 errors:0 dropped:0 overruns:0 carrier:0
 
          collisions:0 txqueuelen:0
 
          RX bytes:3068357 (2.9 MiB)  TX bytes:114723 (112.0 KiB)
 
  
lo        Link encap:Local Loopback 
+
== Network Configuration ==
          inet addr:127.0.0.1  Mask:255.0.0.0
+
Currently connected to farm networks via our switch and a direct port to UNH network.
          inet6 addr: ::1/128 Scope:Host
+
Note: Previously, lentil went through the switch and a VLAN network. New network policy at UNH makes this not possible.
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
 
          RX packets:99 errors:0 dropped:0 overruns:0 frame:0
 
          TX packets:99 errors:0 dropped:0 overruns:0 carrier:0
 
          collisions:0 txqueuelen:0
 
          RX bytes:9335 (9.1 KiB)  TX bytes:9335 (9.1 KiB)
 
  
virbr0    Link encap:Ethernet  HWaddr 00:00:00:00:00:00 
 
          inet addr:192.168.122.1  Bcast:192.168.122.255  Mask:255.255.255.0
 
          inet6 addr: fe80::200:ff:fe00:0/64 Scope:Link
 
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
 
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
 
          TX packets:54 errors:0 dropped:0 overruns:0 carrier:0
 
          collisions:0 txqueuelen:0
 
          RX bytes:0 (0.0 b)  TX bytes:10754 (10.5 KiB)
 
 
<pre>
 
<pre>
=== /etc/sysconfig/network-scripts/ifcfg-farm ===
+
eth0
<pre># Intel Corporation 82541GI Gigabit Ethernet Controller
+
  Hostname: lentil.farm.physics.unh.edu
DEVICE=eth0
+
  HWaddr 00:30:48:63:BB:40
BOOTPROTO=none
+
  IP:10.0.0.250
#HWADDR=00:0E:0C:4C:E1:52
+
eth1
ONBOOT=yes
+
  Hostname: lentil.unh.edu
DHCP_HOSTNAME=lentil.unh.edu
+
  HWaddr 00:30:48:63:BB:41
IPADDR=10.0.0.250
+
  IP:132.177.88.254
NETMASK=255.255.255.0
 
TYPE=Ethernet
 
USERCTL=no
 
IPV6INIT=no
 
PEERDNS=yes
 
</pre>
 
 
 
=== /etc/sysconfig/network-scripts/ifcfg-unh ===
 
<pre>VLAN=yes
 
# Please read /usr/share/doc/initscripts-*/sysconfig.txt
 
# for the documentation of these parameters.
 
GATEWAY=132.177.88.1
 
TYPE=Ethernet
 
DEVICE=eth0.2
 
#HWADDR=00:0e:0c:4c:e1:52
 
BOOTPROTO=none
 
NETMASK=255.255.252.0
 
IPADDR=132.177.88.254
 
ONBOOT=yes
 
USERCTL=no
 
IPV6INIT=no
 
PEERDNS=yes
 
 
 
</pre>
 
 
 
=== /etc/sysconfig/network-scripts/ifcfg-lo ===
 
<pre>DEVICE=lo
 
IPADDR=127.0.0.1
 
NETMASK=255.0.0.0
 
NETWORK=127.0.0.0
 
# If you're having problems with gated making 127.0.0.0/8 a martian,
 
# you can change this to something else (255.255.255.255, for example)
 
BROADCAST=127.255.255.255
 
ONBOOT=yes
 
NAME=loopback
 
 
</pre>
 
</pre>
  
Line 95: Line 39:
  
 
== Backup Configuration ==
 
== Backup Configuration ==
=== /etc/rsync-backup.conf ===
+
===Location of Backups ===
<pre># Backups are 'pull' only. Too bad there isn't a better way to enforce this.
+
<pre>
read only      = yes
+
/mnt/npg-daily-current
 +
/mnt/npg-daily/xx/
 +
</pre>
 +
All backup related scripts are:
 +
  /etc/auto.npg-daily
 +
  /usr/local/bin/rsync_backup.py
 +
  /etc/cron.daily/0rsync_backup
 +
  /usr/sbin/vgcfgbackup
 +
  /etc/rsync-backup.conf
  
# Oh for the ability to retain CAP_DAC_READ_SEARCH, and no other. 
 
#uid            = root
 
# XXX There seems to be an obscure bug with pam_ldap and rsync whereby
 
# getpwnam(3) segfaults when (and only when) archiving /etc.  Using a numeric
 
# uid avoids this bug.  Only verified on Fedora Core 2.
 
uid            = 0
 
 
# There's not much point in putting the superuser in a chroot jail
 
# use chroot    = yes
 
 
# This isn't really an effective "lock" per se, since the value is per-module,
 
# but there really ought never be more than one, and it would at least
 
# ensure serialized backups.
 
max connections = 1
 
 
[usr_local]
 
        path    = /usr/local
 
        comment = unpackaged software
 
 
[opt]
 
        path    = /opt
 
        comment = unpackaged software
 
 
[etc]
 
        path    = /etc
 
        comment = conf files
 
 
[var]
 
        path    = /var
 
        comment = user and system storage
 
</pre>
 
 
== SNMP Configuration ==
 
== SNMP Configuration ==
 
   /etc/snmp/snmpd.conf
 
   /etc/snmp/snmpd.conf
 
   Copied from [[Pepper]].
 
   Copied from [[Pepper]].
== Backup Location ==
+
== Smartd Configuration ==
/mnt/npg-daily/34/
+
The configuration file is at /etc/smartd.conf. The smartd.conf does a silent check, which only emails reports if the SMART health status comes back as failed. This smartd.conf will look different from alot of the other computers because it doesn't have a RAID card installed, so each disk is mounted seperately for backups.
== Hardware Information ==
+
 
  Motherboard: Asus P5QL-CM
+
[[SMARTD]] Smartd setup and configuration
    Specifications: [http://nuclear.unh.edu/wiki/images/7/72/16744.pdf Specifications]
+
== rc.local Configuration ==
    User Manual: [http://nuclear.unh.edu/wiki/images/7/72/E4411_P5QL-CM V2.pdf Users Manual]
+
This script is modified to run commands when the system is done powering on.
  Memory: 2 GB DDR2
 
  Wake On Lan Command: sudo ether-wake 00:1e:4f:9b:13:90
 
  
 +
This will send the boot.log to npg-admins everytime the pc is started.
 +
  mail -s "$HOSTNAME Started, Here is the boot.log" npg-admins@physics.unh.edu < /var/log/boot.log
 +
== If Lentil isn't sending e-mails ==
 +
Sometimes after a reboot Lentil won't send its regular e-mail reports. To fix this you simply need to restart sendmail. Be aware that it has saved all of those messages it didn't send, and once sendmail is working you'll get all of them at once.
 +
== Wake On LAN ==
 +
This is used so we can shutdown the server and remotely turn it back on.
 +
Wake On Lan Command:
 +
  sudo ether-wake 00:1e:4f:9b:13:90
 
== Fixes ==
 
== Fixes ==
 
*Kernel Crash Fix (2/24/2009)
 
*Kernel Crash Fix (2/24/2009)
 
**[[Hardware Issues History]]
 
**[[Hardware Issues History]]
 +
 +
*Hard Drive Enclosure Replacement (12/19/2009)
 +
**[[Hardware Issues History]]
 +
'''Important Note:'''
 +
  If this appears while booting:
 +
    request_module: runaway loop modprobe binfmt-464c
 +
  This is an indication that a drive (in the supermicro hot swap bay) is plugged in that
 +
  can't be mounted, like a drive with a software raid setup on it, so just pull the drive
 +
  and reboot and it should boot properly.

Latest revision as of 15:11, 12 March 2015

General Information

Lentil performs backups. Its autofs is configured to mount harddrives labeled npg-daily-XX onto /mnt/npg-daily/XX where XX is a label number. Its backup script needs further investigation to determine exactly how it works.

As of Feb. 15, 2015 Lentil is mounted with the following which will last us awhile:

  • /dev/sdd1 mounted at /mnt/npg-daily/51; A filled 2TB hard drive
  • /dev/sda1 mounted at /mnt/npg-daily/52; A 4TB hard drive
  • /dev/sdc1 mounted at /mnt/npg-daily/53; A 4TB hard drive
  • /dev/sdb1 as the root file system. Don't hot swap this.

Hardware Information

 Motherboard: Asus P5QL-CM
   Specifications: Specifications
   User Manual: V2.pdf Users Manual
 Memory: 2 GB DDR2


Authentication

Lentil authenticates against the LDAP server running on Einstein, by connecting to einstein.farm.physucs.unh.edu using sssd. Previously, Lentil went on the UNH network to einstein.unh.edu, but this is blocked (I think by ip-tables). The farm network is the better choice anyhow.

Network Configuration

Currently connected to farm networks via our switch and a direct port to UNH network. Note: Previously, lentil went through the switch and a VLAN network. New network policy at UNH makes this not possible.

eth0
  Hostname: lentil.farm.physics.unh.edu
  HWaddr 00:30:48:63:BB:40
  IP:10.0.0.250
eth1
  Hostname: lentil.unh.edu
  HWaddr 00:30:48:63:BB:41
  IP:132.177.88.254

Access Configuration

 /etc/security/access.conf
 Any valid user can log into lentil from any machine on the Internet.

Backup Configuration

Location of Backups

/mnt/npg-daily-current
/mnt/npg-daily/xx/

All backup related scripts are:

 /etc/auto.npg-daily
 /usr/local/bin/rsync_backup.py
 /etc/cron.daily/0rsync_backup
 /usr/sbin/vgcfgbackup
 /etc/rsync-backup.conf

SNMP Configuration

 /etc/snmp/snmpd.conf
 Copied from Pepper.

Smartd Configuration

The configuration file is at /etc/smartd.conf. The smartd.conf does a silent check, which only emails reports if the SMART health status comes back as failed. This smartd.conf will look different from alot of the other computers because it doesn't have a RAID card installed, so each disk is mounted seperately for backups.

SMARTD Smartd setup and configuration

rc.local Configuration

This script is modified to run commands when the system is done powering on.

This will send the boot.log to npg-admins everytime the pc is started.

 mail -s "$HOSTNAME Started, Here is the boot.log" npg-admins@physics.unh.edu < /var/log/boot.log

If Lentil isn't sending e-mails

Sometimes after a reboot Lentil won't send its regular e-mail reports. To fix this you simply need to restart sendmail. Be aware that it has saved all of those messages it didn't send, and once sendmail is working you'll get all of them at once.

Wake On LAN

This is used so we can shutdown the server and remotely turn it back on. Wake On Lan Command:

 sudo ether-wake 00:1e:4f:9b:13:90

Fixes

Important Note:

 If this appears while booting:
   request_module: runaway loop modprobe binfmt-464c
 This is an indication that a drive (in the supermicro hot swap bay) is plugged in that 
 can't be mounted, like a drive with a software raid setup on it, so just pull the drive 
 and reboot and it should boot properly.