Change Notes
Dynamic list of change notes for
ChangeControl policy.

NOTE: This is not working as intended yet. Daniel 25/6/08 (will fix this week, I am trying a few options).
- WORKFLOW variables are not being expanded. Even after setting the plugin order.
- This worked at SAPAC
- Location of set WORKFLOW = makes no difference
- Expandvariables otherwise seems to work even with included topics
See:
VarSEARCH,
FormattedSearch,
TablePlugin
|
ChangeNote200806-002 | 30 Jun 2008 - 17:10 | ---+!! arcs.org.au host downtime
Description
A Xen server running on the VPAC network requires downtime for a switchover. The current physical host has reached its limit in terms of available memory and therefore a new server has been allocated to replace it with more resources. This host is used primarily for VPAC servers (including our corporate web site and staff services), but also hosts the arcs.org.au virtual machine.
Estimated Duration
Approximately 1 hour - should be much less.
Systems/Services Affected
The following services are hosted on arcs.org.au, and will be unavailable during the downtime:
- rt.arcs.org.au
- wiki.arcs.org.au
- pfc.org.au
- goc.arcs.org.au
- gocdev.arcs.org.au
- www.arcs.org.au
- ARCS Jabber server
DNS and CA services are hosted on a difference machine.
Staff Responsible
AndyBotting,
SamMorrison
Detailed Instructions
The existing server will be shut down and disconnected from the fibre channel storage unit. The new server will be installed and reconnected to the fibre channel unit, and booted up. The Xen Dom0 is installed on the fibre channel unit, so no transfer of data is required, limiting the required down time.
Testing Procedures
Boot the Dom0 and start all virtual machines. Ping all the virtual machines and test all services hosted by them. In terms of ARCS infrastructure, visiting the web sites hosted, and connecting to the Jabber server.
Back-out Procedures
If anything should go wrong and it cannot be fixed within the allocated time frame, the old server can be reinstated.
Review
25/06/08 sent to techstaff list. Change date to Monday 7th July 8am to avoid Questnet conference.
30/06/08 discussed at systems services meeting.
Schedule/Notification
8am EST 7 July 08
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200806-003 | 07 Jul 2008 - 17:06 | ---+!! Cronjob for SRB Zone Sync
Description
This note describes the steps necessary to implement a cronjob for SRB user federation on a ARCS SRB server.
Estimated Duration
5 minutes
Systems/Services Affected
SRB. at iVEC, eRSA, UQ, TPAC, ANUSF
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon
Detailed Instructions
as root:
run
vim /etc/logrotate.d/srbzonesync
and paste
/usr/srb/data/log/zonesync.log {
daily
rotate 8
compress
missingok
}
save and exit vim.
as srb server user:
run
crontab -e
and add line
30 */2 * * * cd ~ && /usr/bin/Szonesync.pl -u >> /usr/srb/data/log/zonesync.log 2>&1
Testing Procedures
Monitor /usr/srb/data/log/zonesync.log for a number of days to ensure proper logging.
Back-out Procedures
as srb server user:
run
crontab -e
and remove line
30 */2 * * * cd ~ && /usr/bin/Szonesync.pl -u >> /usr/srb/data/log/zonesync.log 2>&1
as root:
remove file
/etc/logrotate.d/srbzonesync
Review
Looks good to me. --
StephenMcMahon - 30 Jun 2008
Looks fine. --
PaulineMak - 1 Jul 2008
Good to me too. --
ShundeZhang - 01 Jul 2008
Schedule/Notification
no downtime required.
Tracking completion
iVEC: done.
UQ: Done.
SAPAC: Done.
TPAC: Done.
ANU: Done.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
--
FlorianGoessmann - 30 Jun 2008
|
|
ChangeNote200806-004 | 12 Sep 2008 - 11:40 | ---+!! Gridpulse and MDS updates including SIP
Description
Upgrade gridpulse (to do certificate checks and use correct ARCS email address).
Upgrade MDS with SIP to extract software for the grid from APAC Software Map.
Estimated Duration
Less than 1 hour per site. A time frame of less than 2 weeks will be given for all sites to apply the change.
Making more software information available and correcting existing information will be an ongoing task.
Systems/Services Affected
MDS at all sites - Few problems expected, additional information will be provided by MDS about software.

Warning: It is possible during the upgrade, that a client tool - such as GGC or Grisu may read incorrect information from the central index. The index is updated every 10min :-
- this could be because of upgrade errors (unlikely)
- or because the current static software information is being merged with new dynamic data from the software map. More software may be listed as available but may not be 100% correct yet. Sites will aim to have all software published correctly soon after the upgrade. MdsScoreCard will be updated to measure success of software information.
Staff Responsible
Daniel/Gerson, Leigh, Jason, Andy, Joel, Will, Darran, Youzhen, Vlad, Andrey
Detailed Instructions
Upgrade RPMs and MIP as described in Systems Trac
ReleaseNotes/080401
Once software is being published correctly, the static information in
/usr/local/mip/modules/default/SubCluster/softwareInfoData/localSoftware.xml can be removed. Test and double check that this does not affect the MIP information being published - minor changes to Software Map may still be required.
5/5/08 Will
- Check if
/opt/vdt/globus/etc/globus_wsrf_mds_index/server-config.wsdd is owned by daemon, otherwise the file will need correct ownership for globus to start successfully.
-
chown daemon:daemon /opt/vdt/globus/etc/globus_wsrf_mds_index/server-config.wsdd
- not needed for eRSA 11/8/08
11/8/08 Daniel
- slightly changed SIP download example to make it quiet and prevent mail from cron to root user
Testing Procedures
Check SIP, see:
Trac notes
Check MIP output:
/usr/local/mip/mip
Check via Globus WSRF (from a machine with Globus client tools and your proxy credentials), eg:
wsrf-query -s https://ng2.sapac.edu.au:8443/wsrf/services/DefaultIndexService "//*[local-name()='Site']"
Back-out Procedures
Backup
/usr/local/mip first.
Try and work out issues with Gerson, if data is not completely wrong.
Otherwise if MIP is not working at all or providing completely wrong data it will end up in the central index after approx 10min and may then affect users. Restore the backup and try again on a development machine first.
Review
Release notes, RPMs reviewed by Darran and Gerson in April 08.
30/06/08 Send to techstaff@, discussed at systems services meeting.
1/07/08 Darran - suggest adding warnings. Daniel - actually there could be issues if SIP not working. Instructions to be improved.
14/07/08 Daniel - instructions fixed to describe using wget and cron instead.
17/07/08 Vlad - notified that the preuninstall scriptlet would disable MIP, globus code. With step 4. (mip-globus-config), the procedure is now ready to roll.
5/08/08 Will - file ownership not set correctly after MIP rpms upgrade, added extra info in detailed instruction.
Schedule/Notification
31/7/08 Developers list because of possible risk of disruption to GGC and Grisu users.
Notification will not be sent per site, but once on approving this change.
A "good news" announcement will be sent once all sites are updated and have some software information being published.
Tracking completion
All sites should upgrade the software, but there may be a few exceptions (NZ) who are not using the software map.
| Site |
Date Completed or Comments |
| eRSA |
Completed 11/08/08 |
| TPAC |
Completed 08/2008 |
| VPAC |
New gridpulse installed 2/07/08 |
| AC3 |
|
| CSIRO HPSC |
Change completed, tested and working 01/08/08 |
| ESSCC |
Completed 5/08/08 |
| QUT |
|
| UQ |
Completed 5/08/08 |
| JCU |
Completed 5/08/08 |
| IVEC |
Completed 12/08/08 |
| ANU |
Completed 11/08/08 |
| Canterbury |
|
| Aukland |
|
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200806-005 | 28 Jul 2008 - 15:14 | ---+!! MyProxy Changeover
Description
Move and upgrade MyProxy service (from v3.6 to v4.2) onto a new machine, change to ARCS domain/host name.
NOTE: Backwards Compatible
Estimated Duration
4 days (over a weekend) - no actual downtime expected.
DNS TTL 30min
Systems/Services Affected
Grid submission hosts and tools (Grisu/Grix) run by end users.
User might upload a proxy to old server, DNS records then change and when they go to download the proxy it's not there (as on the old box).
Staff Responsible
SamMorrison, ANU (Jonathan)
Detailed Instructions
Stage 1
-
Set up myproxy2.arcs.org.au
-
Test
-
Set up a cron job to copy all proxies from the old machine onto the new machine
-
Change APAC DNS records shown below
-
Change ARCS DNS records to point myproxy.arcs.org.au, myproxy1.arcs.org.au to -> myproxy2.arcs.org.au
-
Test
Stage 2 - To happen at a later date
- set up myproxy1.arcs.org.au at Queensland ?
- Test
- Set up unison to sync the data dirs
- Test
- Change ARCS DNS records as shown below
DNS Changes
INITIAL ARCS RECORDS (Stage 1)
myproxy A 202.158.218.233
myproxy1 A 202.158.218.233
myproxy2 A 202.158.218.233
Final ARCS DNS RECORDS (stage 2)
myproxy A IP ADDRESS 1
A 202.158.218.233
myproxy1 A IP ADDRESS 1
myproxy2 A 202.158.218.233
APAC DNS RECORDS
myproxy CNAME myproxy.arcs.org.au
myproxy1 CNAME myproxy1.arcs.org.au
myproxy2 CNAME myproxy2.arcs.org.au
Things that Grid Admins need to do
Can be changed any time and are not absolutely necessary yet. A separate Change Note will be created before the old addresses are eventually removed.
Command line clients
Command line client - myproxy-logon etc. uses an environment variable to determine the server to speak to.
You can check this by doing:
env | grep MYPROXY
This needs to be myproxy.arcs.org.au and is normally set by scripts in
/opt/vdt/post-setup loaded by
/etc/profile.d/vdt_setup.{sh,csh}.
Other tools that use MyProxy
Need to change the host in their code. Already done for Grisu, but not Grix (30/6/08).
Testing Procedures
Try and create/get a myproxy proxy using command line tools.
Run the following on command line :-
-
make sure myproxy env is set as stated above
myproxy-init
myproxy-logon
myproxy-info
Back-out Procedures
- Leave the old myproxy machines running after the change
- Change the DNS records back
Review
30/06/08 Daniel - minor changes to explanation, would be nice to have actual IP addresses listed above and details of upgrade versions inc host ...
15/07/08 Joel - Tested with new myproxy server (myproxy2.arcs.org.au), everything works fine
17/07/08 Daniel - confirm possible problems with DNS changing on a user once created a proxy
Schedule/Notification
30/06/08 Announcement to techstaff@ and developers@ by Sam
Tracking completion
10/06/08 Successfully tested creating a proxy on old myproxy, copying it to new one and downloading
17/07/08 Ticket 1585 - problems with
CSIRO firewall and new address!
State of Play
myproxy2.arcs.org.au given the old myproxy servers IP address to address firewall issues. New server running fine, need to start thinking about stage 2 and building another myproxy server.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200807-001 | 14 Jul 2008 - 20:52 | ---+!! TITLE
Description
This note describes the steps necessary to add timestamps to the output of Szonesync.pl.
Estimated Duration
30 seconds
Systems/Services Affected
SRB. at iVEC, eRSA, UQ, TPAC, ANUSF
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon
Detailed Instructions
as root:
run
vi /usr/bin/Szonesync.pl
and add two lines (line 3 and 4) to function
runCmd (at the bottom of file) so that it looks like
sub runCmd {
my($option, $cmd) = @_;
($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst)=localtime(time);
printf "[%4d-%02d-%02d %02d:%02d:%02d]",$year+1900,$mon+1,$mday,$hour,$min,$sec;
print "running: $cmd\n";
$cmdOutput=`$cmd`;
Testing Procedures
as srb server user:
run
Szonesync.pl -u
and you'll see something like
[2008-07-09 11:58:58]running: Spullmeta -F GET_CHANGED_USER_INFO 1997-01-01 > ngdev2.its.utas.edu.au.GET_CHANGED_USER_INFO 2> ngdev2.its.utas.edu.au.Spullmeta.stderr
[2008-07-09 11:58:58]running: Spullmeta -F GET_CHANGED_USER_INFO 1997-01-01 > ngdev2.its.utas.edu.au.GET_CHANGED_USER_INFO.orig
Timestamps are added to the beginning of lines where Spullmeta, Spushmeta or /usr/bin/ZoneUserSync.py is executed.
Back-out Procedures
as root:
run
vi /usr/bin/Szonesync.pl
and delete the following lines from function
runCmd (at the bottom of file)
($sec,$min,$hour,$mday,$mon,$year,$wday,$yday,$isdst)=localtime(time);
printf "[%4d-%02d-%02d %02d:%02d:%02d]",$year+1900,$mon+1,$mday,$hour,$min,$sec;
Review
tested. works. --
FlorianGoessmann - 09 Jul 2008
Looks good to me. Don't have a test server so will wait for the downtime to implement.
StephenMcMahon - 09 Jul 2008
It works. --
KaiLu - 11 Jul 2008
Schedule/Notification
no downtime required.
Tracking completion
SAPAC: Done.
UQ: Done.
ANU: Done.
StephenMcMahon - 11 Jul 2008
iVEC: done.
FlorianGoessmann
TPAC: done.
PaulineMak - 14 Jul 2008
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200807-002 | 25 Jul 2008 - 15:16 | ---+!! New GTest Cert for Inca test
Description
This note describes the steps necessary to add a new GTest Cert to inca user. The new GTest cert is owned by Will, who is now the main Inca person. Gerson's GTest cert will continue to be used but as a backup.
Estimated Duration
30 seconds
Systems/Services Affected
SRB. at iVEC, eRSA, UQ, TPAC, ANUSF
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon
Detailed Instructions
as srb server user:
echo \"/C=AU/O=APACGrid/OU=QCIF/CN=William Hsu GTest\" inca@<YOUR DOMAIN> >> /etc/grid-security/grid-mapfile.srb
run
Sinit
SmodifyUser addDN inca <YOUR_DOMAIN> '/C=AU/O=APACGrid/OU=QCIF/CN=William Hsu GTest'
Testing Procedures
as srb server user:
run
SgetU -D inca
and you will see two certs like this
-----------------------------------------------------------------
user_name: inca
domain_desc: ngspare.sapac.edu.au
distin_name: /C=AU/O=APACGrid/OU=QCIF/CN=William Hsu GTest
auth_scheme: GSI_AUTH
zone_id: ngspare.sapac.edu.au
-----------------------------------------------------------------
user_name: inca
domain_desc: ngspare.sapac.edu.au
distin_name: /C=AU/O=APACGrid/OU=SAPAC/CN=Gerson Galang GTest
auth_scheme: GSI_AUTH
zone_id: ngspare.sapac.edu.au
Back-out Procedures
as srb server user:
delete line
"/C=AU/O=APACGrid/OU=QCIF/CN=William Hsu GTest" inca@srb.ivec.org
from
/etc/grid-security/grid-mapfile.srb
run
SmodifyUser delDN inca <YOUR_DOMAIN> '/C=AU/O=APACGrid/OU=QCIF/CN=William Hsu GTest'
Review
all good. --
FlorianGoessmann - 18 Jul 2008
Schedule/Notification
Tracking completion
iVEC: done. --
FlorianGoessmann - 18 Jul 2008
TPAC: done. --
PaulineMak - 18 Jul 2008
UQ: Done. --
KaiLu - 18 Jul 2008
ANU: Done. --
StephenMcMahon - 24 Jul 2008
SAPAC: Done. --
ShundeZhang - 25 Jul 2008
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200807-003 | 27 Nov 2008 - 14:38 | ---+!! Cron Job for SRB Log Rotation
Description
This note describes the steps necessary to create a cron job for SRB log rotation
Estimated Duration
5 minutes
Systems/Services Affected
SRB. at iVEC, eRSA, UQ, TPAC, ANUSF
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon
Detailed Instructions
As root:
run
cd /usr/bin
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/SlogsCleanup.py?format=raw -O SlogsCleanup.py
chmod +x SlogsCleanup.py
Make sure that you have installed the python interpreter on your system
As SRB server user:
Run
crontab -e
and add line
15 5 1 * * /usr/bin/SlogsCleanup.py -k 5 -d >> /usr/srb/data/log/logsCleanup.log 2>&1
The cron job will be run at 5:15 AM on the first of every month
Testing Procedures
Check if the script can be excuted
As SRB server user:
run
/usr/bin/SlogsCleanup.py -l
It works if a list of current SRB logs files are displayed (sorted by date)
Back-out Procedures
as srb server user:
run
crontab -e
and remove line
15 5 1 * * /usr/bin/SlogsCleanup.py -k 5 -d >> /usr/srb/data/log/logsCleanup.log 2>&1
as root:
remove file
rm -rf /usr/bin/SlogsCleanup.py
Review
looks good. --
FlorianGoessmann - 22 Jul 2008
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200808-002 | 07 Aug 2008 - 10:11 | ---+!! RT username and email address changes
Description
Two issues have arisen with the ARCS RT system over time:
- Current usernames are not consistent and it is not easily identifiable who owns a username, especially when assigning ownership of a ticket to another user.
- When emailing a comment to an RT ticket, the email address sending the email must match that of the registered user. This a problem with institution email addresses used for registration, then email sent using the @arcs.org.au email alias.
To fix this, each user account (for ARCS staff) will need to have the username changed to the ARCS current standard of
firstname.lastname, and the email address of that user to
firstname.lastname@arcs.org.au. This will most likely be done by hand.
Estimated Duration
It is anticipated that the change should take no longer than 1 hour to complete. This will include time for testing. During this time, the service will remain unaffected, but users will need to be notified that if their original username does not work, then to try the new ARCS standard username.
Systems/Services Affected
ARCS RT System:
http://rt.arcs.org.au
Staff Responsible
AndyBotting
Detailed Instructions
Take a backup of the database, then one by one, change each of the ARCS staff usernames to
firstname.lastname and email address to
firstname.lastname@arcs.org.au.
Testing Procedures
Ask RT users to:
- log in using new username.
- test commenting on tickets via email, using their ARCS email address
Back-out Procedures
Restore the RT database from a backup taken prior to changes.
Review
- 06/08/2008 Daniel - no need to wait an extra week since there will be no actual downtime or affect on ARCS users. Change procedure to cover ARCS staff only
Schedule/Notification
- Sent an initial email on Thu 7th August to the Techstaff list. No need to notify other lists since only ARCS technical staff will be changed.
- Make an announcement at the Monday 11th meeting
- Send out an email on the morning of Tuesday 12 August, prior to the change
- Send out an email confirming the change has taken place
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200808-003 | 10 Sep 2008 - 11:48 | ---+!!
MyProxy? Change
Description
MyProxy? Cert CN Change and reverse DNS change
Estimated Duration
30 secs for cert change, up to 24 hours for DNS propagation
Systems/Services Affected
All myproxy clients - biggest user of myproxy is INCA
Staff Responsible
SamMorrison
Detailed Instructions
Change the certificate CN of myproxy2.arcs.org.au to myproxy.arcs.org.au
change the reverse DNS for IP 202.158.218.205 from myproxy2.arcs.org.au to myproxy.arcs.org.au
This is to try and fix an issue where clients complain that the forward lookup of myproxy aka. myproxy.arcs.org.au and the certificate (myproxy2.arcs.org.au) don't match. (It shouldn't care and only check the reverse lookup(currently matches the certificate)
See also
http://wiki.arcs.org.au/bin/view/Main/ChangeNote200806-005
Testing Procedures
Using myproxy clients
see
http://wiki.arcs.org.au/bin/view/Main/ChangeNote200806-005#Testing_Procedures
Back-out Procedures
Change DNS Back although this could take another 24hours to propergate
Backup old cert/key
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200808-005 | 12 Sep 2008 - 14:09 | ---+!! TITLE
Host OS upgrade for
CSIRO grid gateway apacgridgw.hpsc.csiro.au
Description
Current operating system on host apacgridgw.hpsc.csiro.au is to be upgraded from CentOS 4.4 to CentOS 5.2
This will be a complete wipe of the current system and a fresh install of the new OS.
All VM's will be backed up (simple tar backup of the given volume) to cherax.hpsc.csiro.au and can then be extracted to new logical volumes once the host OS is setup.
Current host setup is currently being backed up using rsync (the whole thing, just in case) to cherax.hpsc.csiro.au.
Estimated Duration
1 Day (just to be safe in case anything goes wrong)
Services themselves should not be down for more than a couple of hours.
Systems/Services Affected
- CSIRO grid gateway apacgridgw.hpsc.csiro.au
- ng2.hpsc.csiro.au
- ngdata.hpsc.csiro.au
- nggums.hpsc.csiro.au
- ngportal.hpsc.csiro.au
- ng2dev.hpsc.csiro.au
Logs show little recent use, by: Terry Rankine, Will Hsu and Megan Hough/Monash.
Staff Responsible
Joel Ludbey-Bruhwel
Detailed Instructions
Stage 1
- Backup current VM's
- sudo tar -cvzpf /some/dir/backup.tar.gz --same-owner --atime-preserve /mounted/volume/
- Backup current state of host
Stage 2
- Wipe current host
- Install CentOS 5.2 CentOS 5.2 host install
- Configure network and authentication settings
- /etc/hosts, /etc/resolv.conf, /etc/nsswitch.conf, /etc/sysconfig/network-scripts/ifcfg-eth0, /etc/krb5.conf, /etc/pam.d/system-auth
- Restore all VM's
Stage 3
- Test that VM's are up and running as expected
- Install any third party software needed/desired by the host
Testing Procedures
- ng2, submit some test jobs from globus client Test Suite
- nggums, ensure that https://nggums.hpsc.csiro.au:8443/gums/ is up, attempt to map a known user, perform an update of members database.
- ngdata, again submit a few test jobs from globus client Test Suite
- ngportal, at this point in time this is not being used or working to my understanding. Possibly could be removed but will leave it be for the time being.
- ng2dev, currently not doing anything important. Provided that it boot's it should be fine.
Back-out Procedures
- Wipe the new install of CentOS5.2 and revert back to previous version (4.4)
- Restore backups of the host using rsync backups
- Restore all VM's
Review
26/08/08 Daniel - a suggestion to reduce risk and prevent relying on a single backup
- backup as suggested
- install new OS root volume into an LV on the existing disks (without clobbering anything)
- setup grub to easily boot between old and new OS
Schedule/Notification
Developers list 28/8/08 11:40AM
Tracking completion
29/08/08 - Host upgraded successfully.
Issues starting ng2, ngdata and nggums.
04/09/08 - Finally got one of the VM's to boot, seemed to be some issue with kernel libraries on the host (????, not sure what) and needed to create custom ramdisk for VM's. Sam's notes helped but there were some slight differences to get stuff to work here.
VM's however, do not have any network access
08/09/08 - Finally solved all networking issues, xen was bridging a connection to the wrong interface, this has now been fixed.
udev doesn't start, but this is an expected issue.
Also noticed that on odd occassions, when starting a VM it will get about halfway through the boot process and then throws up a kernel panic.
No consistency with this, it appears to be random. Restarting the VM usually fixes the issue, looking around on the internet this is apparently a 'known' issue with running centOS4 VM's under a centOS5 host.
This shouldn't be a problem provided that ng2, ngdata and nggums are not restarted.
If this issue does occur, simply kill the consol, run "xm destroy guest_name" and re-create the guest.
Should then boot without issue.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200808-006 | 10 Sep 2008 - 12:21 | ---+!! New ng2 for ESSCC cluster Savanna
Description
ESSCC has now commissioned the new cluster
Savanna. The old cluster will be turned off, hopefully by the end of 2008 to free up UPS capacity. This is a good opportunity to build a new ng2 to interact with new cluster.
References: ARCS Ticket 1694.
Estimated Duration
Less than 1 hour to rename and test, but allocating 1 Day to be safe in case anything else goes wrong.
Systems/Services Affected
ng2.esscc.uq.edu.au will be offline for a short period and then be connected to Savanna instead of ESS.
This upgrade will prevent jobs from being sent via Grid to the old cluster
ESS. Recent logs show only test jobs, so we don't expect this to cause any problems for users - but it will provide a benefit for those that need to use the new cluster.
If there is a need, ARCS could setup the old gateway with a different name to access ESS.
Staff Responsible
Will Hsu
Detailed Instructions
Build one CentOS 5 VM, configured to use new ESSCC authentication and home filesystem.
Boot up new ng2 under different dns name/ip and install PBSpro client and vdt software.
Modify nggums to use new ESSCC authentication.
Shutdown old ng2. Keep a backup!
Rename new ng2 to old ng2 according to this
How to Upgrade NG2 document.
Configure globus, MDS/MIP.
Test job submission.
Testing Procedures
submit some test jobs from globus client to ng2
Test Suite
Back-out Procedures
Shutdown new ng2
Revert changes to nggums
Boot up old ng2
Review
2/09/08 Daniel - discuss options with Ryan and ask for input
10/09/08 Daniel
- home file system should have nothing to do with GUMS?
- CentOS4 is currently approved OS but we expect that the instructions work properly with CentOS5. This will be a good chance to test on a machine that is not currently receiving any real jobs
- please try installing a new ng2 with a different name, and then rename following these instructions http://projects.arcs.org.au/trac/systems/wiki/HowTo/UpgradeNg2. To confirm that this process works and provides less down time
Schedule/Notification
Separate email discussion with Ryan, CC Terry and Darran on 2/09/08. They are happy to start using the new cluster - code will need to be recompiled for different architecture.
10/09/08 Update ticket 1694. Send a message to developers list.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200809-002 | 27 Nov 2008 - 15:09 | ---+!! Improving performance of MCAT with Vacuum and Indexing
Description
This note describes the steps necessary to run vacuum and index for first time and implement a cronjob to run weekly.
Estimated Duration
5 - 10 Minutes
Systems/Services Affected
SRB. at iVEC, eRSA, UQ, TPAC, ANUSF, ac3, VPAC
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon,
SridharReddapani,
GrahamJenkins
Detailed Instructions
Login as user srb,
Run
mkdir /usr/srb/bin/VacuumIndexFiles
cd /usr/srb/bin/VacuumIndexFiles
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/Vacuum_Index_First?format=raw -O Vacuum_Index_First
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/Vacuum_Index_Cron?format=raw -O Vacuum_Index_Cron
psql -h <Srb server name> -d MCAT -U srb -f /usr/srb/bin/VacuumIndexFiles/Vacuum_Index_First >> /usr/srb/data/log/vacuum_index_first.log 2>&1
Above command will take a while depends on size of your MCAT database
crontab -e
add this line to end of crontab( run every sunday at 1:15am).
15 1 * * 0 psql -h <Srb server name> -d MCAT -U srb -f /usr/srb/bin/VacuumIndexFiles/Vacuum_Index_Cron >> /usr/srb/data/log/vacuum_index_cron.log 2>&1
Testing Procedures
Check /usr/srb/data/log/vacuum_index_first.log for any errors and check /usr/srb/data/log/vacuum_index_cron.log next monday morning.
Back-out Procedures
as srb user:
run
crontab -e
and remove line
15 1 * * 0 psql -h <Srb server name> -d MCAT -U srb -f /usr/srb/bin/VacuumIndexFiles/Vacuum_Index_Cron >> /usr/srb/data/log/vacuum_index_cron.log 2>&1
remove directory
rm -rf /usr/srb/bin/VacuumIndexFiles
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200809-003 | 19 Aug 2009 - 13:22 | ---+!! Configuring SRB for Automatic Account Creation
Description
This note describes the steps necessary to enable automatic account creation
Estimated Duration
30 minutes
Systems/Services Affected
SRB at iVEC, eRSA, UQ, TPAC, ANUSF,
CSIRO, AC3, VPAC
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon,
SridharReddapani,
GrahamJenkins,
GarethWilliams
Detailed Instructions
As the srb user, run:
cd /usr/srb/bin/
mkdir autoUserCreationScripts
cd autoUserCreationScripts
wget "http://projects.gridaus.org.au/trac/systems/browser/trunk/dataFabricScripts/addDomain.sh?format=raw" -O addDomain.sh
wget "http://projects.gridaus.org.au/trac/systems/browser/trunk/dataFabricScripts/addGroup.sh?format=raw" -O addGroup.sh
wget "http://projects.gridaus.org.au/trac/systems/browser/trunk/dataFabricScripts/addUser.sh?format=raw" -O addUser.sh
wget "http://projects.gridaus.org.au/trac/systems/browser/trunk/dataFabricScripts/dnToUserDomain.py?format=raw" -O dnToUserDomain.py
chmod 700 *
Then edit the file:
/usr/srb/bin/autoUserCreationScripts/dnToUserDomain.py
and change line 25 to match your institution's OU and your server's domain
domains = {
'TPAC':'srb.tpac.org.au',
}
Then edit the file:
/usr/srb/data/shibConfig
And replace the entire file with the following lines:
# The full path to the dn to user/domain script
dnToUserDomainScript=/usr/srb/bin/autoUserCreationScripts/dnToUserDomain.py
# The full path to the add user script
addUserScript=/usr/srb/bin/autoUserCreationScripts/addUser.sh
# The full path to the add group script
addGroupScript=/usr/srb/bin/autoUserCreationScripts/addGroup.sh
# The full path to the add domain script
addDomainScript=/usr/srb/bin/autoUserCreationScripts/addDomain.sh
# The OID of the certificate extension
shibCertOid=1.2.3.4.5.6.7.8.9
# The group separator in the certificate extension
shibGroupSeparator=;
As root, run:
cd /etc/grid-security
wget http://www.arcs.org.au/slcs/arcs-slcs-ca.tar.gz -O - | tar xvz
cd arcs-slcs-ca
wget "http://projects.arcs.org.au/trac/slcs-client/attachment/wiki/WikiStart/663bbd41.0?format=raw" -O 663bbd41.0
wget "http://projects.arcs.org.au/trac/slcs-client/attachment/wiki/WikiStart/663bbd41.namespaces?format=raw" -O 663bbd41.namespaces
wget "http://projects.arcs.org.au/trac/slcs-client/attachment/wiki/WikiStart/663bbd41.signing_policy?format=raw" -O 663bbd41.signing_policy
chown root:root *
chmod 644 *
cp * /etc/grid-security/certificates
To make the certificates are retained between certificate updates, edit the file
/opt/vdt/vdt/etc/vdt-update-certs.conf and add the following lines:
include=/etc/grid-security/arcs-slcs-ca/1ed4795f.0
include=/etc/grid-security/arcs-slcs-ca/1ed4795f.namespaces
include=/etc/grid-security/arcs-slcs-ca/1ed4795f.signing_policy
include=/etc/grid-security/arcs-slcs-ca/663bbd41.0
include=/etc/grid-security/arcs-slcs-ca/663bbd41.namespaces
include=/etc/grid-security/arcs-slcs-ca/663bbd41.signing_policy
Restart the SRB server.
Testing Procedures
Create a new SRB connection to your SRB server using Shibboleth as the authentication method. This should create a valid connection. If a connection is not created, check your SRB log (/usr/srb/data/logs) for error messages.
Note: Shibboleth authentication
will NOT work if you are behind a proxy - this is because the
SLCS-client java library must be repackaged with your proxy username and password.
Back-out Procedures
As root, remove the installed certificates:
cd /etc/grid-security/certificates
rm -f 1ed4795f.*
rm -f 663bbd41.*
rm -rf /etc/grid-security/certificates/arcs-slcs-ca
Then remove the user creation scripts directory:
rm -rf /usr/srb/bin/autoUserCreationScripts/
Edit the file:
/opt/vdt/vdt/etc/vdt-update-certs.conf
And remove these lines:
include=/etc/grid-security/arcs-slcs-ca/1ed4795f.0
include=/etc/grid-security/arcs-slcs-ca/1ed4795f.namespaces
include=/etc/grid-security/arcs-slcs-ca/1ed4795f.signing_policy
include=/etc/grid-security/arcs-slcs-ca/663bbd41.0
include=/etc/grid-security/arcs-slcs-ca/663bbd41.namespaces
include=/etc/grid-security/arcs-slcs-ca/663bbd41.signing_policy
Restart the SRB server
Review
--
FlorianGoessmann - 19 Sep 2008
Done for ANU. --
StephenMcMahon - 17 Nov 2008
Schedule/Notification
Tracking completion
- UQ: Done. Not yet tested because our Idp does not work now. -- KaiLu - 19 Sep 2008
- ANU: Also done and not tested for the same reason as Kai. -- StephenMcMahon - 22 Sep 2008
- VPAC: Done. Not yet tested. -- GrahamJenkins - 22 Sep 2008
- ac3: Done. Not tested for same reason. -- SridharReddapani - 22 Sep 2008.
- CSIRO: Done. -- GarethWilliams - pre 27 Nov 2008
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200811-001 | 27 Nov 2008 - 15:10 | ---+!! slcs1 certificates for automatically created accounts
Description
Estimated Duration
30 minutes
Systems/Services Affected
SRB at iVEC, eRSA, UQ, TPAC, ANUSF,
CSIRO, AC3, VPAC
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon,
SridharReddapani,
GrahamJenkins,
GarethWilliams
Detailed Instruction
If you have not installed the script, please see this change note
http://wiki.arcs.org.au/bin/view/Main/ChangeNote200809-003
and follow the instructions before applying this change note.
As the srb user and move the existing dnToUserDomain.py:
cd /usr/srb/bin/autoUserCreationScripts
mv dnToUserDomain.py dnToUserDomain.py.orig
Then download the new file and make sure it has executable permission for the srb user:
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/dnToUserDomain.py?format=raw -O dnToUserDomain.py
chmod 700 dnToUserDomain.py
Edit line 80 of dnToUserDomain.py to include IdPs that your server will accept (ac3 will need to add AC3 and
ANSTO, while VPAC will need to add VPAC and ARCS OpenIdP), e.g.
Where the first value is 'O' part of the slcs1 DN, and the the second value is the name of your SRB domain.
domains = {
'VPAC':'srb.vpac.org',
'ARCS OpenIdP':'srb.vpac.org'
}
Since we're only using the slcs1 certificates, we can remove the slcstest CA certificates.
Become the root user:
cd /etc/grid-security/certificates/
rm 663bbd41.0
rm 663bbd41.namespaces
rm 663bbd41.signing_policy
cd /etc/grid-security/arcs-slcs-ca/
rm 663bbd41.0
rm 663bbd41.namespaces
rm 663bbd41.signing_policy
Then edit
/opt/vdt/vdt/etc/vdt-update-certs.conf
and
remove the following lines:
include=/etc/grid-security/arcs-slcs-ca/663bbd41.0
include=/etc/grid-security/arcs-slcs-ca/663bbd41.namespaces
include=/etc/grid-security/arcs-slcs-ca/663bbd41.signing_policy
Restart the SRB server.
Testing Procedures
Attempt to connect to your SRB server using Hermes 1.0rc8. This version uses slcs1 certificates.
Back-out Procedures
Get the previous version of the script and overwrite the dnToUserDomain.py script.
As the srb user:
cd /usr/srb/bin/autoUserCreationScripts
wget 'http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/dnToUserDomain.py?rev=718&format=raw' -O dnToUserDomain.py
and change line 25 to match your institution's OU and your server's domain
domains = {
'TPAC':'srb.tpac.org.au',
}
Then add the slcstest certificates.
As the root user:
cd /etc/grid-certificates/arcs-slcs-ca/
wget "http://projects.arcs.org.au/trac/slcs-client/attachment/wiki/WikiStart/663bbd41.0?format=raw" -O 663bbd41.0
wget "http://projects.arcs.org.au/trac/slcs-client/attachment/wiki/WikiStart/663bbd41.namespaces?format=raw" -O 663bbd41.namespaces
wget "http://projects.arcs.org.au/trac/slcs-client/attachment/wiki/WikiStart/663bbd41.signing_policy?format=raw" -O 663bbd41.signing_policy
Then edit the file
/opt/vdt/vdt/etc/vdt-update-certs.conf
and
insert the following lines:
include=/etc/grid-security/arcs-slcs-ca/663bbd41.0
include=/etc/grid-security/arcs-slcs-ca/663bbd41.namespaces
include=/etc/grid-security/arcs-slcs-ca/663bbd41.signing_policy
Restart the SRB server.
Review
Schedule/Notification
Tracking completion
- ac3/Intersect: Done. SridharReddapani - 17/11/2008
- TPAC: Done. -- PaulineMak - 17/11/2008
- VPAC: Done. -- GrahamJenkins - 17/11/2008 (both srb.vpac.org and srbdev.vpac.org)
- ANU: Done. -- StephenMcMahon - 17 Nov 2008. Highlighted an issue with the NCI National Facility IDP but that is now working.
- eResearch SA: Done. -- ShundeZhang - 20 Nov 2008 (both srb.sapac.edu.au and ngspare.sapac.edu.au)
- CSIRO: Done -- GarethWilliams - pre 27 Nov 2008
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200811-002 | 02 Dec 2008 - 10:32 | ---+!! Installing data fabric usage scripts
Description
This will install data fabric usage scripts at individual sites.
Estimated Duration
10 minutes
Systems/Services Affected
SRB at iVEC, eRSA, UQ, TPAC, ANUSF,
CSIRO, AC3, VPAC
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon,
SridharReddapani,
GrahamJenkins,
GarethWilliams
Detailed Instructions
As the root user:
Install postgres module for Python, run:
yum install postgresql-python.i386
As the srb user, run:
Create and enter the directory:
mkdir /usr/srb/bin/usageScripts
cd /usr/srb/bin/usageScripts
Download the required scripts:
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/usageScripts/usageFromMCAT.py?format=raw -O usageFromMCAT.py
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/usageScripts/UsageStatsUpload.sh?format=raw -O UsageStatsUpload.sh
Make these scripts executable:
chmod 700 *
Make a directory locally to store XML output files generated by the scripts, e.g.
mkdir xml
If you have folders under
/projects please make sure these folders are owned by a group. Without this setting, nothing will be collected by the usage script for that folder. You can change the ownership of a folder like so:
Schmod o <groupname> groups <folder name>
Setup the cron jobs crontab -e, then add the following line:
1 0 * * * /usr/srb/bin/usageScripts/UsageStatsUpload.sh -d /usr/srb/bin/usageScripts/xml/
Testing Procedures
To do a quick test to see if it's working, run:
/usr/srb/bin/usageScripts/UsageStatsUpload.sh -d /usr/srb/bin/usageScripts/xml/
You should see XML files in the local directory /usr/srb/bin/usageScripts/xml/ and also on the data fabric, under /<zone name>/projects/dataFabricStats/.
Note that the full access permission for the XML files under this collection has been granted to
srbAdmin@srb.hpcu.uq.edu.au (the user that will read the XML files using the processing scripts) automatically when the script is run.
Back-out Procedures
as the srb user:
run
crontab -e
and remove line
1 0 * * * /usr/srb/bin/usageScripts/UsageStatsUpload.sh -d /usr/srb/bin/usageScripts/xml/
remove directory
rm -rf /usr/srb/bin/usageScripts
rm -rf /usr/srb/data/log/useLog
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200811-003 | 02 Dec 2008 - 12:16 | ---+!! Davis for SRB
Description
Davis is ready for beta test. please install it on your production box and use it!
Estimated Duration
15-30 mins
Systems/Services Affected
All
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon,
SridharReddapani,
GrahamJenkins,
GarethWilliams
Detailed Instructions
Ensure your https port (443) is opened to the public.
See
http://projects.arcs.org.au/trac/davis/wiki/HowTo/Install#Jetty6standalone and
http://projects.arcs.org.au/trac/davis/wiki/HowTo/Configuration
Testing Procedures
See
http://projects.arcs.org.au/trac/davis/wiki#HowtoUse
Back-out Procedures
Stop Davis; Delete davis home folder, /etc/init.d/jetty.sh, /etc/default/jetty
Review
Had to
chmod +x /etc/init.d/davis
Location of
jetty.xml was not specified. Exact fields to change were not specified either.
--
StephenMcMahon - 27 Nov 2008
I've modified the doc. --
ShundeZhang - 02 Dec 2008
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200811-004 | 02 Dec 2008 - 10:41 | ---+!! Fixing bugs in dnToUserDomain.py script
Description
Update dnToUserDomain.py script with some bug fixes
Estimated Duration
10 minutes
Systems/Services Affected
SRB at iVEC, eRSA, UQ, TPAC, ANUSF,
CSIRO, AC3, VPAC
Staff Responsible
FlorianGoessmann,
ShundeZhang,
KaiLu,
PaulineMak,
StephenMcMahon,
SridharReddapani,
GrahamJenkins,
GarethWilliams
Detailed Instructions
If you have not installed the script, please see this change note
http://wiki.arcs.org.au/bin/view/Main/ChangeNote200809-003
and follow the instructions before applying this change note.
As the srb user:
cd /usr/srb/bin/autoUserCreationScripts/
mv dnToUserDomain.py dnToUserDomain.py.old
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/dnToUserDomain.py?format=raw -O dnToUserDomain.py
chmod 700 dnToUserDomain.py
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/DNCONFIG.py?format=raw -O DNCONFIG.py
Then modify DNCONFIG.py to match your institution's OU and your server's domain.
domains = {
'TPAC':'srb.tpac.org.au',
}
Testing Procedures
Try logging in with Hermes, using both shibboleth and GSI. You should be able to log in with your existing account(s). Then try running dnToUserDomain.py with other DNs, either using slcs or APACGrid certificates, e.g.
./dnToUserDomain.py '/DC=au/DC=org/DC=arcs/DC=slcs/O=TPAC/CN=Pauline Mak 78ISwLgFjnpuw14IZ4X8XYbIp-4'
or
./dnToUserDomain.py '/C=AU/O=APACGrid/OU=TPAC/CN=Pauline Mak'
Make sure your DN matches something in the DNCONFIG.py file. The script should print out something with a username@domain. If the username is too long, then it will be shortened to something like a.b.surname@domain.
Back-out Procedures
As the srb user:
cd /usr/srb/bin/autoUserCreationScripts/
mv dnToUserDomain.py.old dnToUserDomain.py
rm DNCONFIG.py
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200812-002 | 22 Dec 2008 - 14:27 | ---+!! TWiki Upgrade from 4.1 to 4.2.4
Description
A cross site scripting vulnerability have been discovered, and a TWiki upgrade is required for security reasons.
Estimated Duration
One day
Systems/Services Affected
All ARCS hosted TWiki services, including the ARCS Wiki, Exec Wiki and all other webs created for research groups.
Staff Responsible
AndyBotting,
LevLafayette
Detailed Instructions
A copy of the 4.2.4 TWiki code has been installed, parallel with the existing 4.1 TWiki code, and data has been syncronised between the two via a manual method. The new site will be available as
http://wikitest.arcs.org.au from now, up to the change-over time. During this time, it is encouraged that ARCS staff test the site as much as possible.
Once the change-over commenced, the existing TWiki will be marked as read-only, to prevent updates while the data is being migrated to the new TWiki. Once completed, the Apache will be configured to use the new TWiki directory.
Testing Procedures
As much testing will be done as possible before the switch, but due to the complex nature of TWiki installations, it is expected that some minor issues may arise, which will be worked through.
Back-out Procedures
If any major issues arise during the change-over, we can switch back to the old code-base as it should remain untouched.
Review
Schedule/Notification
An email was sent out on 16th Dec 2008 to all registered TWiki users outlining the upgrade procedure.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200903-001 | 01 Apr 2009 - 03:28 | ---+!! Apache configuration change on MARCS IdPs
Description
We are experiencing some problems when the
SLCS server opens a connection to an IdP to resolve the Shibboleth Artifact. It has been determined that a minor change to the Apache configuration on the Idp will stop this error from occurring. There will be a small performance hit, but the reliability of the IdP is deemed more important that the performance. We will also take this opportunity to replace the schacCountryOfResidence with the l (locality) attribute.
Estimated Duration
10 minutes.
Systems/Services Affected
Authentication services provided by the IdPs.
Staff Responsible
DarranCarey,
GarthDenley,
LeighGordon,
WillHsu,
JoelLB,
SamMorrison,
PaulWarren,
SimonYin
Detailed Instructions
Make a backup copy of /etc/httpd/conf.d/ssl.conf before proceeding
The file /etc/httpd/conf.d/ssl.conf should have any existing entries of the form:
SSLSessionCache shmcb:/var/cache/mod_ssl/scache(512000)
commented out, and a new line should be added:
SSLSessionCache none
The Apache server will need to be restarted for the configuration change to take affect.
Make a backup copy of /usr/local/shibboleth-idp/etc/resolver.ldap.xml and /usr/local/shibboleth-idp/etc/arps/arp.site.xml before proceeding
Edit /usr/local/shibboleth-idp/etc/resolver.ldap.xml on the IdP, and in the
<StaticDataConnector id="staticLibraryEPE">
section change:
<Attribute name="schacCountryOfResidence">
to
<Attribute name="l">
and comment out or remove the section:
<Attribute name="urn:mace:terena.org:schacCountryOfResidence">
<AnyValue release="permit"/>
</Attribute>
Tomcat will need to be restarted.
Testing Procedures
Verify that you can authenticate against an IdP by visiting
https://slcs1.arcs.org.au/SLCS/login
Back-out Procedures
If for some reason you cannot authenticate against an IdP, restore the file /etc/httpd/conf.d/ssl.conf, and restart the Apache server. Perform the testing procedures again.
Review
Schedule/Notification
The changes are scheduled for Monday the 23rd of March, An email notification will be sent to all affected users on Monday 16th of March, informing them of the change.
Tracking completion
PaulWarren - completed for NCI NF
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200903-002 | 03 Apr 2009 - 09:47 | ---+!! Migrating ARCS mail from VPAC to Google Apps
Description
ARCS has decided to move it's mail to a Google App domain. This means that mail intended for ARCS staff will be stored in a separate mailbox, as opposed to forwarding the mail to the staff member's institution email account.
Estimated Duration
The process should take anywhere from a few hours to a few days.
Systems/Services Affected
All systems relying on mail:
- Staff email
- Mailing lists (mailman)
- Help desk tickets (RT)
Staff Responsible
Detailed Instructions
VPAC is currently the MX host for arcs.org.au, and this will change to Google. While this is good for staff mailboxes, it makes it is inconvenient for other systems which rely on mail to @arcs.org.au. This includes the mailing lists and RT system.
To overcome this, we need to create Google mailing lists on the Google App domain to forward mail from the @arcs.org.au domain to @lists.arcs.org.au domain, which will be hosted at VPAC to process mailing lists and RT messages. The mail server at VPAC will then be configured to handle mail from @lists.arcs.org.au.
For information about configuring mail clients for POP/IMAP and SMTP to their Google Apps mail, you can see the documentation by Google at
http://mail.google.com/support/a/arcs.org.au/bin/topic.py?topic=12773
Basic connection information:
Incoming Mail (IMAP) Server
Outgoing Mail (SMTP) Server
- Server: smtp.gmail.com
- Username: firstname.lastname@arcs.org.au
- Use Authentication: Yes
- Use STARTTLS: Yes (some clients call this SSL)
- Port: 465 or 587
Testing Procedures
Once completed, it will take some time for the DNS MX records to be propagated. Once it is done, we will need to test staff email addresses, mailing lists and RT queues.
Back-out Procedures
DNS records will be reverted back to VPAC and the system will carry on as it was before.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200904-001 | 11 Jun 2009 - 14:13 | ---+!! Migrate ARCS Plone Service to VPAC
Description
Migration of an existing system to new infrastructure. Purpose is to strategically position the Plone infrastructure in a more centrally connected MARC and leverage the systems provided by the Data Services team.
Estimated Duration
One hour. Which is the time taken for DNS entry to expire.
Systems/Services Affected
www.arcs.org.au
Staff Responsible
RussellSim (Plone Migration)
AndyBotting (DNS change)
Detailed Instructions
- Turn off Zope service on zope9.cluster on the JCU cluster
- Turn off Zeo service on zeo2.cluster on the JCU cluster
- Change Zeo Database to READ-ONLY
- Turn on Zeo service on zeo2.cluster on the JCU cluster
- Turn on Zope service on zope9.cluster on the JCU cluster
- Copy Data file from zeo2.cluster to db-vpac.arcs.org.au at VPAC.
- Turn on Zeo service at VPAC
- Turn on Zope service (plone.arcs.org.au) at VPAC
- Change ARCS website cache service url in portal_squidtool
- Adjust DNS entry from:
www.arcs.org.au. 3600 IN CNAME plone.jcu.edu.au.
to
www.arcs.org.au. 3600 IN CNAME plone.arcs.org.au.
Test
Testing Procedures
- Local data access.
- Check DNS entry.
Back-out Procedures
- Point DNS entry back at JCU and
- Turn infrastructure back on at JCU.
- Test Plone service is operating within expected paramaters at JCU.
- Wait for DNS entry to expire.
- Test DNS entry.
Review
Schedule/Notification
15th June, Notification sent on 8th June
Tracking completion
RT Ticket
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-001 | 07 May 2009 - 12:58 | ---+!! Migration of critical grid infrastructure virtual machines at VPAC
Description
Currently we have 9 virtual machines being hosted on hardware which is out of warranty. With newer hardware set up and running, we are now in a position to migrate these virtual machines over.
The machines in question are:
- grid.vpac.org
- hudson.vpac.org (for Grisu/Grix builds)
- ng2.vpac.org
- ng2dev.vpac.org
- myproxy2.arcs.org.au
- nggums.vpac.org
- ngportal.vpac.org
- ngportaldev.vpac.org
- vomrs.arcs.org.au
Estimated Duration
2 hours for the whole procedure, but only about 20 minutes per machine.
Systems/Services Affected
Most of these virtual machines are infrastructure for the grid computing at VPAC, but vomrs.arcs.org.au and myproxy2.arcs.org.au are critical for grid jobs ARCS wide.
During the migration, these machines in order will be taken down for up to 20 minutes, affecting all grid jobs.
Staff Responsible
AndyBotting,
SamMorrison
Detailed Instructions
The migration procedure is:
- Create a new VM container on either arcs-vh01.vpac.org or arcs-vh02.vpac.org, using the virt-install command.
- Shut down the VM running on xen-d.vpac.org
- Using SCP, copy the VM image from the old box, to the new
- While the copying is taking place, get the MAC address out of the Xen config file
- Using virsh edit _vmname_ on the new VM host, replace the MAC address with the original
- Once the copy has completed, start the VM on the new host.
- Rinse and repeat for each of the VM's.
Testing Procedures
Boot each VM and watch the console for messages. Once it's done, SSH in and check everything is sane. Run some test grid jobs.
Back-out Procedures
Boot the VM's again on the original Xen host
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-002 | 14 May 2009 - 20:08 | ---+!! db-1.arcs.org.au database server rebuild
Description
Needs to be consistent with db-2 (currently one has 32bit OS and the other 64bit)
Estimated Duration
2 hr
Systems/Services Affected
mysql database service
Staff Responsible
Gareth Williams, Darran Carey
Detailed Instructions
Make both an explicit backup of the mysql databases files and and an extra dump intend to restore from the files - falling back to the dump, falling back to the slave, falling back to an old backup...
Phase 1 - preparation. Things to note/backup (there is some documentation at
http://projects.arcs.org.au/trac/systems/wiki/DataServices/Database - in particular the mysql install and backup sections
- copy/check script /usr/local/bin/mysql_backup.sh (Gareth)
- copy/check sudo settings (Gareth/Darran)
- copy/check iptables settings (Darran)
- copy custom mysqld service script (though I'm going to replace it) (Gareth)
- copy/note network/host settings (Darran)
- plan desired disk/OS configuration (Darran with Andy/Jim/Gareth)
- clean out old binary logs to reduce clutter (Gareth)
Phase 2 - outage pre-shutdown
- optionally lock out clients (not sure how... maybe break firewall - must be a better way) (Gareth/Darran)
- start a root mysql client and run "FLUSH TABLES WITH READ LOCK;" - leave mysql session open to preserve lock until service stopped
- run a full backup and copy to pbstore (Darran)
- stop mysqld service
- make a binary copy of /data/mysql (root tar) and copy it to pbstore (Darran)
Phase 3 - rebuild and restore
- rebuild host with 64 bit OS and disk configuration like db-2 (Darran)
- setup networking and restore ssh access (Darran)
- setup arcs-admin user and add ssh authorized_keys (from db-2 maybe with mods) (Darran)
- setup sudo for arcs-admin (Darran)
- copy full mysql backup/dump to a scratch area (Darran)
- copy binary backup to a scratch area (Darran)
- unpack binary backup to new area and sets up service (install mysql-server with yum if necessary) (Gareth)
- restore other iptables setup (for mysql access) (Darran)
- check, check, check, including replication (Gareth)
- re-setup backup as per documentation on projects site (Gareth)
- notify clients (Gareth)
No need to further document alternative mysql restore strategies at this point.
Testing Procedures
can connect to database server; databases are present and useable
Back-out Procedures
No provision is made for full restore to current state. Once the reinstall is started we are committed to building a new working platform onto which to restore the service.
Review
Schedule/Notification
clients notified 05 May 2009 - only specific database clients and directly involved staff
outage to commence 06 May 2009 12pm EST
rescheduled for 14 May 10am EST
Tracking completion
- prior to start, bcees, arcs_odl and irodsUsage were moved to db-2
- service locked at 9.55am EST
- phases 1 and 2 complete at about 10.10am EST
- base rebuild done and admin access restored at 11.53am EST
- disk reconfig and physical move within rack complete Darran's work at 1.50-pm EST
- Sridhar back in to set up postgresql at that time.
- mysql configured for new location and started by about 3pm EST but firewall needs fixes
- update sent to clients at 4.47pm EST
- firewall fixed around 7pm EST - replication recommenced
- final notification sent at 8pm EST
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-003 | 18 May 2009 - 09:43 | ---+!! TITLE
Description
Change slcs1.arcs.org.au entityID
Estimated Duration
1 hour
Systems/Services Affected
All IdP's that can access
SLCS
Staff Responsible
SamMorrison
Detailed Instructions
Change the shibboleth entityID from urn:mace:federation.org.au:testfed:slcs1.arcs.org.au to
https://slcs1.arcs.org.au/shibboleth
IdP Instructions
Make a copy of any files to be changed
*Once the changes are complete you don't need to restart tomcat or any other service
Shibboleth 1.3
Edit $SHIBBOLETH_HOME/etc/arps/arp.site.xml
Add
<Rule>
<Description>SLCS 1 New</Description>
<Target>
<Requester matchFunction="urn:mace:shibboleth:arp:matchFunction:exactShar">https://slcs1.arcs.org.au/shibboleth</Requester>
</Target>
<Attribute name="urn:mace:federation.org.au:attribute:auEduPersonSharedToken">
<AnyValue release="permit"/>
</Attribute>
</Rule>
Shibboleth2
Edit /etc/shibboleth/attribute-map.xml (could be different at each site)
<AttributeFilterPolicy>
<PolicyRequirementRule xsi:type="basic:OR">
<basic:Rule xsi:type="basic:AttributeRequesterString" value="urn:mace:federation.org.au:testfed:slcs1.arcs.org.au" />
<basic:Rule xsi:type="basic:AttributeRequesterString" value="https://slcs1.arcs.org.au/shibboleth" />
<basic:Rule xsi:type="basic:AttributeRequesterString" value="https://services.arcs.org.au/shibboleth" />
</PolicyRequirementRule>
<AttributeRule attributeID="auEduPersonSharedToken">
<PermitValueRule xsi:type="basic:ANY" />
</AttributeRule>
</AttributeFilterPolicy>
Testing Procedures
One user of each IdP needs to visit
https://slcs1.arcs.org.au/SLCS/login
Back-out Procedures
Change name back
Review
Schedule/Notification
Tracking completion
| VPAC |
Done |
| Canterbury |
Done |
| IVEC |
Done |
| TPAC |
Done |
| NCI NF |
?Not used |
| eRSA |
Done |
| ARCS |
Done |
| ANSTO |
Done |
| CSIRO |
Done |
| ac3 |
Done |
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-004 | 29 May 2009 - 15:12 | ---+!! Update iRODS User Records
Description
Update the iRODS user records so that shared tokens (where present) are stored in the 'user_info' field. This will make it possible for Shibboleth logins to be performed using Davis.
Estimated Duration
45 minutes.
Systems/Services Affected
arcs-df.ivec.org, arcs-df.vpac.org, arcs-df.ac3.edu.au, arcs-df.hpcu.uq.edu.au
Staff Responsible
Shunde Zhang, Graham Jenkins.
Detailed Instructions
- Do: iadmin lu graham.jenkins .. check that 'user_info' field contains: VI8SEdbk_8Ph3E7M1O8jdORVTF4
- Do: iadmin lu graham.jenkins1 .. check that 'user_info' field is empty.
- Do: export irodsConfigDir=/data/rods/iRODS/server/config
- Do: /data/rods/iRODS/server/bin/cmd/createUser "/C=AU/O=APACGrid/OU=VPAC/CN=Jane Doe" .. check result
- Do:
/data/rods/iRODS/server/bin/cmd/createUser "/DC=au/DC=org/DC=arcs/DC=slcs/O=TPAC/CN=Jane Doe VI8SEdbkx8Ph3E7M1O8jdABRA99"
.. check result
- Do: iadmin rmuser jane.doe ; iadmin rmuser jane.doe1
Back-out Procedures
- Change non-empty 'user-info' fields so that they contain: 1
- Change empty ones so that they contain: 0
- Do: cp /data/rods/iRODS/server/bin/cmd/createUser.PRE /data/rods/iRODS/server/bin/cmd/createUser
- Test as above
Review
Schedule/Notification
Implementation date to be advised.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-005 | 26 May 2009 - 13:22 | ---+!! ARCS Mail server migration
Description
The ARCS mail server will be migrated from its current host, to a new virtual machine.
This is for two reasons:
- The current host has a outdated OS, and upgrading is not an option.
- The current host has a hardware fault
- The version of Mailman installed is outdated, and a new version is required for fixing some issues highlighted by the AuScope group.
Estimated Duration
9am - 5pm, Saturday 6th June
Systems/Services Affected
ARCS services affected by this change are:
Staff Responsible
Detailed Instructions
- The current postfix configuration will be transferred from isis.vpac.org to lists.vpac.org.
- A new version of mailman will be installed and configured, with the archives transferred.
- The RT mailgate tool will be installed for delivering mail into RT.
Testing Procedures
Mail will be sent to RT and the postfix mailing lists to ensure they are accepting and delivering mail correctly.
RT Queues:
- Auth
- CertAlert?
- Collaboration
- CollabSprints?
- Customer
- Data
- DataSprints?
- General
- Grisu
- Grix
- Inca
- Mailman
- Phone
- Systems
- Test
- VideoCollaboration?
- WebContent?
- Wiki
Mailing lists:
- Allstaff
- Arcs_admin
- Auscope
- Auscope-dev
- Auscope-geosciml
- Auscope-geosciml-testbed
- Cloudadvice
- Community
- DataMINX?
- Developers
- Exec
- Exec_conf
- Fullmoon
- Fullmoon-dev
- Macddap-ref-group
- Managers
- Systems
- Techstaff
Back-out Procedures
If we cannot complete the migration in the current timeframe, the old mail server will be re-instated.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-006 | 26 May 2009 - 13:20 | ---+!! wiki.arcs.org.au upgrade
Description
The TWiki software hosting wiki.arcs.org.au will be upgraded to the latest version.
A cross-site scripting vulnerability has been discovered.
For more information, visit:
http://twiki.org/cgi-bin/view/Codev/SecurityAlert-CVE-2009-1339
To protect from this, TWiki will be upgraded to the current version, 4.3.1.
Estimated Duration
9-10am, Tuesday 2nd June, 2009 (1 hour)
Systems/Services Affected
http://wiki.arcs.org.au
Staff Responsible
AndyBotting
Detailed Instructions
Data will be migrated from the arcs.org.au VM to the hosting.arcs.org.au VM. Once migrated, the upgrade tarball will be extracted over the top of the exisiting TWiki installation, and tested.
If the testing proves that the upgrade was a success, then the DNS records will be switched to point to the new host.
Testing Procedures
Simple tests to ensure the the TWiki is working correctly are:
- Logging in
- Modify some pages
- Test TWiki configure script
Back-out Procedures
The current TWiki install will be kept running, so at any time we can switch back to the old version.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-007 | 28 May 2009 - 11:19 | ---+!!
SLCS Delegation Service (SDS)
Description
Update
SLCS software on slcs1.arcs.org.au to enable the SDS
Estimated Duration
1 hour
Systems/Services Affected
All services that rely on
SLCS. DataFabric, Grid.
Staff Responsible
SamMorrison
Detailed Instructions
Redeploy the
SLCS WAR file with update code. Restart Tomcat server. All this done at 8am
Testing Procedures
Use test client on plonedev.arcs.org.au
Back-out Procedures
Save a backup of old WAR file. Redploy old one if something wrong.
Review
Schedule/Notification
8am Friday. - install WAR and restart tomcat
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-008 | 09 Jun 2009 - 13:15 | ---+!! Running Davis as unprivileged davis user
Description
Running Davis as unprivileged davis user with Apache.
Estimated Duration
1 hours
Systems/Services Affected
ARCS Data Fabric
Staff Responsible
Shunde Zhang
Detailed Instructions
Stop Davis
Install Apache and Apache SSL module, configure it properly
Modify $DAVIS_HOME/etc/jetty.xml to enable ajp interface (port 8009) and disable SSL interface
<Call name="addConnector">
<Arg>
<New class="org.mortbay.jetty.ajp.Ajp13SocketConnector">
<Set name="port">8009</Set>
<Set name="ThreadPool">
<New class="org.mortbay.thread.BoundedThreadPool">
<Set name="minThreads">50</Set>
<Set name="maxThreads">500</Set>
<Set name="MaxIdleTimeMs">0</Set>
</New>
</Set>
<Set name="statsOn">false</Set>
</New>
</Arg>
</Call>
Add the following to /etc/httpd/conf.d/ssl.conf
ProxyPass /ARCS/ ajp://localhost:8009/ARCS/ flushpackets=on
RewriteEngine On
RewriteRule ^/$ /ARCS/home [R]
create user davis, change owner of davis directory to davis user.
Modify /etc/default/davis to use davis user to start Davis.
JETTY_HOME=/opt/davis/davis
JAVA_OPTIONS="-server -Xms1024m -Xmx1562m -XX:+AggressiveHeap -XX:PermSize=192m -XX:MaxPermSize=192m -XX:+UseParallelGC"
JAVA_HOME=/usr/java/jdk1.6.0_12
JETTY_USER=davis
JETTY_PID=/var/run/davis.pid
Restart Apache.
Copy dojoroot to /var/www/html
Start Davis.
Testing Procedures
Test Davis.
Back-out Procedures
Stop Apache, use Jetty for SSL
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-009 | 28 May 2009 - 11:29 | ---+!! GSI patch for iRODS
Description
Patch iRODS to fix a bug with GSI authentication.
Estimated Duration
10 mins
Systems/Services Affected
All
Staff Responsible
Shunde Zhang
Detailed Instructions
1. Replace the existing $IRODS_HOME/server/api/src/rsGsiAuthRequest.c with
https://projects.arcs.org.au/trac/davis/attachment/wiki/ReleaseNotes/0.7.0/rsGsiAuthRequest.c
2. Go to $IRODS_HOME and run make
3. restart iRODS.
Testing Procedures
Use Davis 0.7.0 for test.
Back-out Procedures
You can back up your old rsGsiAuthRequest.c before making the change, or get it from iRODS 2.0.1 tarball.
Go to $IRODS_HOME and run make
restart iRODS.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-010 | 05 Jun 2009 - 18:02 | ---+!! Upgrade TDS to use Jetty 6.1.18 and Apache
Description
Update Jetty to 6.1.18 with a few bug fixes and also security updates.
Estimated Duration
1 hour
Systems/Services Affected
OPeNDAP? (THREDDS Data Server) at TPAC, QCIF, iVEC, eRSA and Intersect
Also indirectly for the ARCS
OPeNDAP? Digital Library
Staff Responsible
Pauline Mal, Kai Lu, Florian Goessmann, Shunde Zhang, Sridhar Reddapani
Detailed Instructions
Stop TDS:
/etc/init.d/tds stop
Download and expand Jetty 6.1.18:
wget http://dist.codehaus.org/jetty/jetty-6.1.18/jetty-6.1.18.zip
unzip jetty-6.1.18.zip -d /opt/jetty-6.1.18
As
root, make symbolic link to the latest version in /opt and set that as your $JETTY_HOME:
ln -s /opt/jetty-6.1.18 /opt/tds
export JETTY_HOME=/opt/tds
usermod -d /opt/tds jetty
Copy across your previous installation:
cd /opt/jetty-6.1.15/webapps
cp -r thredds/ /opt/jetty-6.1.18/webapps/thredds/
cd /opt/jetty-6.1.15/content
mkdir /opt/jetty-6.1.18/content/
cp -r thredds/ /opt/jetty-6.1.18/content/thredds/
Change ownership of files back to the jetty user:
chown -R jetty:jetty /opt/jetty-6.1.18/
Create a new environment variable file in /etc/default/tds with the following content symbolic link:
JETTY_HOME=/opt/tds
JAVA_HOME=/usr/java/jdk1.6.0_13/jre
JAVA_OPTIONS="-server -Xmx1536m -Xms512m"
JETTY_PID=/var/run/tds.pid
JETTY_USER="jetty"
Backup and copy the start/stop script to /etc/init.d
mv /etc/init.d/tds ~/
cp /opt/jetty-6.1.18/bin/jetty.sh /etc/init.d/tds
chmod u+x /etc/init.d/tds
Modify the start/stop script to point to the new environment file.
##################################################
# See if there's a default configuration file
##################################################
if [ -f /etc/default/jetty6 ] ; then
. /etc/default/jetty6
elif [ -f /etc/default/tds ] ; then
. /etc/default/tds
fi
Make sure these variables are picked up when running the script. Running
/sbin/service/tds check
, you should get something like the following:
JETTY_HOME = /opt/tds
JETTY_CONF =
JETTY_RUN = /var/run
JETTY_PID = /var/run/tds.pid
JETTY_PORT =
JETTY_LOGS =
CONFIGS = /opt/tds/etc/jetty-logging.xml /opt/tds/etc/jetty.xml
JAVA_OPTIONS = -server -Xmx1536m -Xms512m -Djetty.home=/opt/tds -Djava.io.tmpdir=/tmp
JAVA = /usr/java/jdk1.6.0_13/jre/bin/java
CLASSPATH =
RUN_CMD = /usr/java/jdk1.6.0_13/jre/bin/java -server -Xmx1536m -Xms512m -Djetty.home=/opt/tds -Djava.io.tmpdir=/tmp -jar /opt/tds/start.jar /opt/tds/etc/jetty-logging.xml /opt/tds/etc/jetty.xml
Modify /opt/jetty-6.1.18/etc/jetty.xml to make sure you only have one"SelectChannelConnector" the server to only listen to localhost on 8008 like so:
<Call name="addConnector">
<Arg>
<New class="org.mortbay.jetty.nio.SelectChannelConnector">
<Set name="host">localhost</Set>
<Set name="port">8008</Set>
<Set name="maxIdleTime">30000</Set>
<Set name="Acceptors">2</Set>
<Set name="statsOn">false</Set>
<Set name="confidentialPort">8443</Set>
<Set name="lowResourcesConnections">5000</Set>
<Set name="lowResourcesMaxIdleTime">5000</Set>
</New>
</Arg>
</Call>
All sites apart from iVEC will not have Apache installed, so install it using yum
yum install httpd
Modify the Apache config for ajp:
Now configure the THREDDS server to connect to Apache. Create the file /etc/httpd/conf.d/tds.conf and add the following content:
<VirtualHost *:80>
ServerName opendap-your.host.name.arcs.org.au
DocumentRoot "/var/www/html"
RewriteEngine On
RewriteRule ^/$ /thredds/ [R]
ProxyPass /thredds/ http://localhost:8008/thredds/
</VirtualHost>
Now to run the server, start Jetty first, the Apache:
/sbin/service tds start
/sbin/service httpd start
Testing Procedures
This set of instruction has been tested on irodsdev.vpac.org.
When you start the server, you should get the following message:
Redirecting stderr/stdout to /opt/jetty-6.1.18/logs/2009_05_27.stderrout.log
(note that the log is in the jetty-6.1.18 directory). Also make sure that it is running as jetty by running ps. You should get something like this:
jetty 24612 67.2 16.7 1738880 87936 ? Sl 13:38 0:02 /usr/java/jdk1.6.0_13/jre/bin/java -server -Xmx1536m -Xms512m -Djetty.home=/opt/tds -Djava.io.tmpdir=/tmp -jar /opt/tds/start.jar /opt/tds/etc/jetty-logging.xml /opt/tds/etc/jetty.xml
Browse to:
http://opendap-your.host.name.arcs.org.au/thredds/catalog.html
And if you browse to
http://opendap-your.host.name.arcs.org.au/
it should redirect you to
http://opendap-your.host.name.arcs.org.au/thredds/catalog.html
You should still have the same catalog as the previous installation.
IF AND ONLY IF you're satisfied with the new installation, remove the old /opt/jetty-6.1.15 directory and ~/tds.
Back-out Procedures
To back out, stop TDS if is is not running, as
root,
Stop Apache and Jetty (if both are running)
/sbin/service httpd stop
/sbin/service tds stop
rm /etc/init.d/tds
And remove jetty 6.1.18
rm /opt/tds
rm -rf /opt/jetty-6.1.18/
Use the old startup script
mv ~/tds /etc/init.d/tds
And restart the old server:
/etc/init.d/tds start
Review
Schedule/Notification
Tracking completion
TPAC - done. Note that I had to change the Apache config to:
<VirtualHost *:80>
ServerName opendap-tpac.arcs.org.au
DocumentRoot "/var/www/html"
RewriteEngine On
ProxyPreserveHost On
ProxyRequests Off
ProxyPass /thredds http://localhost:8008/thredds
RewriteRule ^/$ /thredds/ [R]
</VirtualHost>
QCIF- done.
AC3 - done.
iVEC - done.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-011 | 28 May 2009 - 12:34 | ---+!! Upgrade Davis to 0.7.0
Description
Upgrade current Davis to 0.7.0
Estimated Duration
20 mins
Systems/Services Affected
All
Staff Responsible
Shunde Zhang
Detailed Instructions
Download 0.7.0 tarball from google code
extract it somewhere
replace all jars in $DAVIS_HOME/webapps/root/WEB-INF/lib with those from the tarball
replace jetty jars (in $DAVIS_HOME/lib) with those from the tarball
update $DAVIS_HOME/webapps/root/WEB-INF/web.xml according to the one in the tarball
restart davis
Testing Procedures
You know what to do.
Back-out Procedures
Better to back up old Davis before making the change.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-012 | 08 Aug 2009 - 10:07 | ---+!! Libtar and GSI Patches for iRODS
Description
Install iRODS-specific version of
libtar on all ARCS-DF Production servers.
libtar will enable them to process
ibun requests from clients. The iRODS-specific version is used to that files larger than 2Gb can be handled.
Also install a GSI patch required for
Davis 0.7.
Estimated Duration
60 minutes on each server
Systems/Services Affected
- Phase 1: arcs-df.vpac.org .. no service outage.
- Phase 2: arcs-df.ivec.org .. 60 minutes service outage.
- Phase 3: Other Production servers.
Staff Responsible
Graham Jenkins.
Detailed Instructions
Ref:
https://www.irods.org/index.php/Mounted_iRODS_Collection
Copy the attached
libtar-1.2.11_64bit.tar.gz file to directory:
/var/tmp on your server.
Also copy the attached
rsGsiAuthRequest.c file to the same directory.
Then proceed as shown hereunder. It's necessary to copy header files into the
$LT_HOME/compat directory because the iRODS configuration utilities expect to find them there.
service irods stop
su - rods
mkdir $IRODS_HOME/../libtar
cd $IRODS_HOME/../libtar
export LT_HOME=`pwd`
mkdir /tmp/build
cd /tmp/build
tar xf /var/tmp/libtar-1.2.11_64bit.tar.gz
cd libtar-1.2.11
./configure --prefix=$LT_HOME
make
make install
mkdir $LT_HOME/compat
cp compat/compat.h $LT_HOME/compat
cp config.h $LT_HOME/compat
cp $LT_HOME/include/* $LT_HOME/compat
cd $IRODS_HOME
vi config/config.mk config/config.mk.in
.. uncomment the line: TAR_STRUCT_FILE=1
.. set the parameter tarDir to the value of $LT_HOME (e.g. /opt/rods/libtar)
cp server/api/src/rsGsiAuthRequest.c server/api/src/rsGsiAuthRequest.c.ORI
cp /var/tmp/rsGsiAuthRequest.c server/api/src/
make clean; make
exit
service irods start
Testing Procedures
Proceed as follows:
su - rods
imkdir Test
iput /etc/group Test
iput /etc/passwd Test
ibun -R arcs-df.ivec.org -cDtar Test.tar Test .. Use the name of your own resource!
ils -l Test.tar .. check create-time and length
imkdir Test2
ibun -R arcs-df.ivec.org -x Test.tar Test2 .. Use the name of your own resource!
ils -l Test2 .. check files
irm -rf Test Test2 Test.tar
Back-out Procedures
service irods stop
su - rods
cd $IRODS_HOME
vi config/config.mk config/config.mk.in
.. comment the line: TAR_STRUCT_FILE=1
cp server/api/src/rsGsiAuthRequest.c.ORI server/api/src/rsGsiAuthRequest.c
make clean; make
exit
service irods start
Review
Phase 1 installation performed on arcs-df.vpac.org 20090529; tarfile generated by 'ibun'command contains mutilated filenames. This installation has been left in place pending resolution.
Phase 2 installation performed on arcs-df.ivac.org 20090529 .. rsGsiAuthRequest.c patch Only. Generic libtar installation installed during initial build has been left in place pending satisfactory completion of Phase 1.
Phase 3 has not been done yet.
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200905-013 | 28 May 2009 - 15:34 | ---+!! Upgrade of ng2.sf.utas.edu.au
Description
Migrate the existing ng2.sf.utas.edu.au to CentOS 5.3 and VDT 1.10.1v.
Estimated Duration
1 hour
Systems/Services Affected
Grid services at TPAC.
Staff Responsible
LeighGordon
Detailed Instructions
The existing ng2.sf.utas.edu.au will be shutdown. The new installation will be renamed and tested.
Testing Procedures
Tests 1 through 4 of the Globus 4 TestSuite located here
http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite will be performed. If they all succeed, this installation will be put into production.
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated hour, then this installation will be turned off, and the previous production installation will be brought back online.
Review
Schedule/Notification
Notifications are to be sent to the Community, Developers, and Techstaff mailing lists.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-001 | 02 Jun 2009 - 10:50 | ---+!! Updating user information stored in usage statistics data base
Description
An additional script needs to be installed in order to fetch the sharedToken of each user from the Table r_user_main of ICAT (Postgres) and write them to Table users of usage statistics DB – irodsUsage (
MySQL)
This will make it possible to get usage statistics of individual user to be displayed while logging into services.arcs.org.au.
Estimated Duration
15 Mins
Systems/Services Affected
Data Fabric machine in iVEC, data base server in db2.arcs.org.au
Staff Responsible
KaiLu, Gareth Williams
Detailed Instructions
A backup of usage statistics DB - irodsUsage at the machine db-2.arcs.org.au is required in advance
As DB user irodsUsage, connecting to DB - irodsUsage located at db-2.arcs.org.au:
run
ALTER TABLE users ADD COLUMN sharedToken VARCHAR(255) DEFAULT '' AFTER username;
As user rods at the machine arcs-df.ivec.org:
run
cd /opt/rods/iRODS/server/bin/usageScripts/DBScripts
wget http://projects.arcs.org.au/trac/systems/browser/trunk/dataFabricScripts/iRODS/utils/dbOps.py?format=raw -O dbOps.py
chmod +x dbOps.py
Modify the data bases connection information in the script
Run
crontab -e
and add line
45 1 * * * python /opt/rods/iRODS/server/bin/usageScripts/DBScripts/dbOps.py >> /opt/rods/iRODS/server/log/useLog-DataFabric 2>&1
The cron job will be run at 1:45 AM every day
Testing Procedures
To do a quick test to see if it's working, run:
python /opt/rods/iRODS/server/bin/usageScripts/DBScripts/dbOps.py >> /opt/rods/iRODS/server/log/useLog-DataFabric 2>&1
As DB user irodsUsage, connecting to DB - irodsUsage located at db-2.arcs.org.au:
Run
select username, sharedToken from users;
They should be the same as shown in the Table r_user_main of ICAT
Back-out Procedures
As user rods at the machine arcs-df.ivec.org:
Run
crontab -e
and remove line
45 1 * * * /opt/rods/iRODS/server/bin/usageScripts/DBScripts/dbOps >> /opt/rods/iRODS/server/log/useLog-DataFabric 2>&1
remove file
rm -rf /opt/rods/iRODS/server/bin/usageScripts/DBScripts/dbOps.py
As DB user irodsUsage, connecting to DB - irodsUsage located at db-2.arcs.org.au:
run
ALTER TABLE users DROP COLUMN sharedToken;
or
Make a recovery from the latest backup of usage statistics DB - irodsUsage at the machine db-2.arcs.org.au
Review
Schedule/Notification
Tracking completion
Done. --
KaiLu - 01 Jun 2009
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-002 | 03 Jun 2009 - 08:21 | ---+!! Upgrade and Migration of ARCS Sakai
Description
Upgrade from Sakai 2.4 to 2.5, migration to new hardware, retirement of existing virtual machine.
Resolves multiple problems with existing setup:
- Existing setup uses private IP, problematic because:
- Cannot use mail-archive tool.
- Automatic monitoring (eg. NAGIOS) not practical.
- OpenVZ? less flexible and tied to host configuration.
- Content is currently proxied by an unreliable and improperly supported intermediate host.
- Existing host has limited storage options.
- New hardware:
- is more powerful;
- has more resources dedicated to the Sakai VM;
- has remote management capabilities;
- has redundant power supplies;
- is located in air-conditioned facilities;
- offers far more (and much faster) storage; and
- has many other benefits.
Proposed Date
Wednesday, 10th June 2009, 10am-2pm AEST.
Estimated Duration
Four hours
Systems/Services Affected
All sites hosted on the ARCS Sakai will be completely unavailable during the period of the outage.
Level of Impact
3D
- 3: downtime more than 2 hours but less than 24 hours
- D: multiple services at multiple sites OR significant loss of functionality affecting more than 10 people
Staff Responsible
- Denny (via ARCS email, Jabber)- contact for Sakai-level issues.
- Garth Denley (via ARCS email, Jabber)- contact for system-level issues.
- Andy Botting- for DNS record change only.
Detailed Instructions
- Update DNS records (A and MX) for the new Sakai host. (Andy)
- Disable access to both the old and new Sakai instances.
- On both hosts: service shutdown tomcat
- Firewall off new ARCS Sakai to allow selected access only.
- will be done using iptables rules
- connecting users will have packet rejected.
- if practical, may be redirected to a downtime page- although this may prove difficult.
TODAY=`date +%F`
mysqldump -h localhost -u sakai -p sakai > sakai-$TODAY.sql
scp sakai-$TODAY.sql denny@sakai-new.arcs.org.au:~/storage
cd /home/tomcat/sakai
tar czf ~/sakai-files.tar.gz db
scp sakai-files.tar.gz denny@sakai-new.arcs.org.au:~/storage
- On sakai-new (new Sakai):
-
- Run content conversion tool to fix up the site descriptions
./fixd < sakai-xxxx.sql > sakai-xxxx-fixed.sql
-
- Both the tar.gz and sql files on both the old and new hosts should be checked to ensure nothing went wrong in the transfer:
md5sum sakai-xxx.sql
md5sum sakai-files.tar.gz
-
- The MD5 sum reported should be the same on each host.
-
- Create database for Sakai and import the database from the dump file, and also extract the binary content as well
mysql -u root -p
create database sakai_prod default character set utf8;
grant all on sakai_prod.* to sakai_prod@localhost identified by 'xxxx';
grant all on sakai_prod.* to sakai_prod@127.0.0.1 identified by 'xxxx';
mysql -h localhost -u sakai_prod -p sakai_prod < sakai-xxxxx
mkdir -p /storage/sakai/db
cd /storage/sakai
tar xzf /storage/denny/sakai-files.tar.gz
-
- Run conversion script to update the database structure
cd /home/denny/sakai-2-5-x/reference/docs/conversion
mysql -u sakai_prod -p sakai_prod
source sakai_2_4_0-2_5_0_mysql_conversion.sql
source sakai_2_5_0-2_5_2_mysql_conversion.sql
-
- Update sakai.properties (if needed)
bodyPath
bodyVolumes
username@javax.sql.BaseDataSource=sakai_prod
password@javax.sql.BaseDataSource=xxxx
url@javax.sql.BaseDataSource=
-
- Update hostname (sakai-new -> sakai)
system-config-network
-
- Other references to hostname
find . -type f -exec grep sakai /dev/null {} \;
-
- and clean up anything found.
- Reboot new server to ensure new hostname is being used properly, all services come up automatically, etc.
- Test. Additional volunteers have been sought. This step will be performed in the last hour of the four hour period.
- Remove firewall rules to allow world-wide access.
Testing Procedures
- Login with accounts created on old Sakai server
- Check/compare list of usernames on both server
- Test Reset Password feature
- Compare contents of the existing Sakai server and the new server visually
- Check site descriptions in Home tool
- Check content of Resources tool
- Check announcement contents
- Check other tools contents, e.g. Site Stats
- Upload file & create folder using Resources tool with various size: small, slightly less than 20 MB, above 20 MB (should be rejected, because the limit is 20 MB)
- Upload file & create folder using WebDAV? with different Operating System
- Test Email Archive tool: sending a mail to @sakai.arcs.org.au, with various email format
Back-out Procedures
As the original data remains untouched on the old server, backing out is not a particular onerous process:
- DNS records returned to point at old VM.
- New server reconfigured to use temporary name (sakai-new) so as to not interfere with old system.
- Old system brought back up.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-003 | 15 Jun 2009 - 11:15 | ---+!! Update and migration of ARCS projects.arcs.org.au
Description
| ARCS is migrating all hosted sites that use Trac and Subversion to |
|
|
| Trac |
0.11.4 |
| Subversion |
1.4.2 |
| OS (Xen Guest) |
32-bit CentOS 5.3 |
| OS (Xen Host) |
64-bit CentOS 5.3 |
| The reason for this change is to |
|
|
| * Transfer from AC3 to Intersect owned host machines |
|
| * Increase storage space for Subversion repositories |
|
| * Partition the administrative interface to allow greater autonomy for project leaders |
|
Estimated Duration
2 hours
Systems/Services Affected
http://projects.arcs.org.au
https://projects.arcs.org.au
This covers all Trac and Subversion sites, as well as the vdt mirror and ARCS/MARCS yum repositories
Staff Responsible
Simon Yin -
simon.yin@arcs.org.au, Tel: 0410609094
Detailed Instructions
Migration of the ARCS Trac/Subversion site is summarised by the following steps:
- On the old system, dump the PostgreSQL? database to a backup file
- On the old system, dump the Trac wiki/attachments directory to a backup directory
- On the new system, initialise the new Trac and Subversion environment
- Restore the PostgreSQL? database and wiki/attachments directory to the new system
- Synchronise a new Subversion repository using the old system as the source.
The mirror and repositories are tranferred with rsync. Note the root of the server filesystem on the new server is /var/www. This is changed from the old system where the root of the server filesystem was /repository
Server certificates for Apache (from /etc/pki/tls/certs/ and /etc/pki/tls/private/ ) and for Globus (from /etc/grid-security/ ) are also transferred to the new server.
Links to detailed instructions for each individual site:
https://projects.arcs.org.au/trac/systems/wiki/HowTo/UpgradeAndMigrateTracSubversion
Testing Procedures
An Internet accessible IP address, 203.202.139.98, is configured on the new system for system testing.
The following scenarios are tested for each Trac/Subversion site:
- Windows XP, IE8 user
- Connect anonymously to http://203.202.139.98/trac/project-under-test
- View tickets
- Register as "TestUser"
- (https only) Connect as "TestUser" with no extra permissions to https://203.202.139.98/trac/project-under-test
- Create a new ticket
- (https only) Connect as user with full admin permissions to https://203.202.139.98/trac/project-under-test
- Change the password of "TestUser" (and then login again as "TestUser" to check the password)
-
- Connect to http://203.202.139.98/svn/project-under-test
- Repeat the tests in 1. for a Mac OSX, Safari user
- Repeat the tests in 1. for a Linux, Firefox 3.0.x user
- (https only) Linux, Subversion from command line (if necessary, check out from a known sub-directory of the project-under-test to save time)
- svn co https://203.202.139.98/svn/project-under-test
Where tests listed above have used the http protocol, repeat them with the https protocol.
For the mirror and yum repositories, simply confirm that a web browser can browse
http://203.202.139.98/mirror/vdt/ and
http://203.202.139.98/dist/
Once all tests are passed, the IP address of projects.arcs.org.au (203.202.30.92) will replace the test IP address and the system deemed as accepted into production.
All references in Trac configuration files to srb-dev.ac3.edu.au (the temporary URL) are then replaced with projects.arcs.org.au
Back-out Procedures
The old server and host machine are unaffected by these changes - moving the service IP 203.202.30.92 back to the old system will revert to the previous system configuration.
Review
Schedule/Notification
The following notice will be posted on the Trac front page of each project:
| Scheduled downtime for this site: Monday, 22 June 2009 at 5:00pm AEST |
| Service will be restored on Monday, 22 June 2009 at 7:00pm AEST |
Notification by email:
| For ARCS internal projects: |
|
| |
techstaff@arcs.org.au, developers@arcs.org.au |
| For Fullmoon |
|
| |
simon.cox@csiro.au, ryan.fraser@csiro.au, Alexandre.Robin@spotimage.fr, bryan.lawrence@stfc.ac.uk, eboisver@nrcan.gc.ca, Francois.Letourneau@RNCan-NRCan.gc.ca, Jacek.Radajewski@csiro.au, johannes.echterhoff@igsi.eu, Jonathan.Yu@csiro.au, mggr@pml.ac.uk, pavel.golodoniuc@csiro.au, Peter.Taylor@csiro.au, pcd@ecosystem.com, Rick.Meng@csiro.au, rob.atkinson@csiro.au, robert.woodcock@csiro.au, rocky@cc.gatech.edu, stefan.hansen@lisasoft.com |
| For pht3d |
|
| |
Janek.Greskowiak@csiro.au, aaron.mcdonough@csiro.au |
| For aodaac |
|
| |
ken.suber@csiro.au, Edward.King@csiro.au, matt.paget@csiro.au |
| For common-grid-libs |
|
| |
aron.abrook@qut.edu.au |
| For commons-vfs-grid |
|
| |
manish.saroha@its.monash.edu.au |
| For CAPSnav |
|
| |
Edward.King@csiro.au, Leon.Majewski@bom.gov.au, Peter.Turner@csiro.au |
| For DataMINX? |
|
| |
p.turner@chem.usyd.edu.au, alexa@intersect.org.au, andrey@intersect.org.au, carlos@intersect.org.au, Doug.duBoulay@gmail.com, xiao.wang@stfc.ac.uk |
| For Generic Grid Client |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org, vladimir.mencl@canterbury.ac.nz |
| For grisu |
|
| |
y.halytskyy@auckland.ac.nz |
| For grix |
|
| |
aron.abrook@qut.edu.au |
| For gsub |
|
| |
sam.moskwa@sapac.edu.au |
| For jmoleditor |
|
| |
Joachim.Mai@ac3.edu.au, vvv900@gmail.com, Martin.Thompson@unsw.edu.au |
| For macddap |
|
| |
jason.lohrey@arcitecta.com, R.Williams@utas.edu.au, paolap@utas.edu.au, pblain@postoffice.utas.edu.au |
| For access |
|
| |
martin.dix@csiro.au, l.logan@bom.gov.au, mrezny@sgi.com |
| For mem |
|
| |
chris.jackett@csiro.au, peter.turner@csiro.au, jenny.lovell@csiro.au, r.williams@utas.edu.au |
| For mgplot |
|
| |
frank.colberg@utas.edu.au |
| For pht3d |
|
| |
Henning.Prommer@csiro.au, Aaron.McDonough@csiro.au |
| For portal |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org |
| For sggc |
|
| |
Terry.Rankine@csiro.au |
| For slcs-client |
|
| |
chi.nguyen@melco.mq.edu.au |
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-004 | 17 Jun 2009 - 16:50 | ---+!! Upgrade Davis
Description
Upgrade Davis
Estimated Duration
1 mins
Systems/Services Affected
Data Fabric
Staff Responsible
Shunde Zhang, Florian Goessmann
Detailed Instructions
- Download davis tar ball from google code
- Extract it to /opt/davis
- Modify web.xml
- Stop davis service
- Change soft link from last version to current version
- Start Davis
Testing Procedures
- Try litmus.
- Try WebDAV? clients.
Back-out Procedures
Stop service, change soft link back to the last working version. Restart service.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-005 | 19 Jun 2009 - 09:15 | ---+!! iRODS 'gethostbyname' Problem Resolution
Description
It has been found that all iRODS Production servers can, under heavy network traffic conditions, log a message of the following form:
WARNING: gethostbyname of df.arcs.org.au is taking 2 sec. This could severely affect interactivity of your Rods system
To overcome this, the name of partner hosts will be added to the
/etc/hosts file in each server.
Estimated Duration
This Change will take 30 minutes, including testing and backout steps.
Systems/Services Affected
The
/etc/hosts file will be updated on each of the following systems:
arcs-df.ivec.org
arcs-df.vpac.org
arcs-df.ac3.edu.au
arcs-df.hpcu.uq.edu.au
Staff Responsible
Graham Jenkins
Detailed Instructions
Save the attached hostbyname test script in /tmp on each machine, and make it executable. Then do:
time /tmp/hostbyname arcs-df.ivec.org arcs-df.vpac.org arcs-df.ac3.edu.au arcs-df.hpcu.uq.edu.au
and note the 'real' time.
Then add the following lines to
/etc/hosts on each of the above systems, commenting the entry for the system itself:
# Entries added 20090619 to improve 'gethostbyname' performance
#192.65.130.247 arcs-df.ivec.org
202.158.218.58 arcs-df.vpac.org
203.202.30.74 arcs-df.ac3.edu.au
130.102.163.138 arcs-df.hpcu.uq.edu.au
Testing Procedures
Repeat the test described above, ensure that it produces the same output and that the 'real' time is significantly less than before.
Back-out Procedures
Remove the entries added to
/etc/hosts and repeat the test, ensuring that it produces the same results.
Review
Schedule/Notification
It is proposed that this Change should be performed at 9am Friday June 19.
Tracking completion
Test results ("before" and "after" real times) were as follows:
arcs-df.ivec.org: 0.84, 0.09
arcs-df.vpac.org: 0.77, 0.07
arcs-df.ac3.edu.au: 1.54, 0.10
arcs-df.hpcu.uq.edu.au: 1.06, 0.08
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-006 | 26 Jun 2009 - 10:54 | ---+!! Upgrade Davis 0.7.2a
Description
Upgrade Davis
Estimated Duration
1 mins
Systems/Services Affected
Data Fabric
Staff Responsible
Shunde Zhang, Florian Goessmann
Detailed Instructions
- Download davis tar ball from google code
- Extract it to /opt/davis
- Modify web.xml
- Stop davis service
- Change soft link from last version to current version(0.7.2a)
- Start Davis
Testing Procedures
- Try litmus.
- Try WebDAV? clients.
Back-out Procedures
Stop service, change soft link back to the last working version. Restart service.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-007 | 23 Jun 2009 - 23:11 | ---+!! Upgrade of nggums.sf.utas.edu.au
Description
Migrate the existing nggums.sf.utas.edu.au to
CentOS? 5.3 and VDT 1.10.1y.
Estimated Duration
1 hour.
Systems/Services Affected
Grid services at TPAC.
Staff Responsible
LeighGordon
Detailed Instructions
The existing nggums.sf.utas.edu.au will be shutdown. The new installation will be renamed and tested.
Testing Procedures
Tests 1 through 4 of the Globus 4
TestSuite? located here
http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite will be performed. If they all succeed, this installation will be put into production.
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated hour, then this installation will be turned off, and the previous production installation will be brought back online.
Review
Schedule/Notification
Notifications are to be sent to the Community, Developers, and Techstaff mailing lists.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-008 | 08 Aug 2009 - 10:10 | ---+!! Adding rules for
IMOS collections
Description
Files under
IMOS collections will need to be stored at different resources.
Estimated Duration
1 hour
Systems/Services Affected
Data Fabric on all production boxes:
arcs-df.hpcu.uq.edu.au, arcs-df.ivec.org, arcs-df.ac3.edu.au, arcs-df.sf.utas.edu.au, arcs-df.eresearchsa.edu.au, arcs-df.vpac.org
Staff Responsible
Pauline Mak
Detailed Instructions
Modify core.dvm to map $objPath to the appropriate data structure in the
acSetRescSchemeForCreate rule.
Note that: $IRODS_HOME = /opt/iRODS-2.0v/iRODS
Open up
$IRODS_HOME/server/config/reConfigs/core.dvm
Insert in line 17:
objPath||rei->doinp->objPath
Resulting in line 16-18 of:
objPath||rei->doi->objPath
objPath||rei->doinp->objPath
rescName||rei->doi->rescName
Download the imos rules file
cd $IRODS_HOME/server/config/reConfigs
wget http://projects.arcs.org.au/trac/systems/export/993/trunk/dataFabricScripts/iRODS/Rules/imos.irb -O imos.irb
Now add the
IMOS rules into the server config $IRODS_HOME/server/config/server.config
reRuleSet imos,arcs,core
Testing Procedures
Login as the rods user, and modify the ~/.irods/.irodsEnv file and comment out the default resource setting.
Put a file in the following directory on the data fabric. Create a dummy text file, then
icd /ARCS/projects/IMOS/archive/eMII/
iput testForIMOSRules
ils -L
The result should say the file is now on emii.resource.tpac.org.au.
Remove the file using irm and uncomment the default resource setting in ~/.irods/.irodsEnv.
Repeat for all hosts.
Back-out Procedures
Remove imos from the server configuration file: $IRODS_HOME/server/config/server.config
reRuleSet arcs,core
Remove the imos rules file:
rm $IRODS_HOME/server/config/reConfigs/imos.irb
Revert $IRODS_HOME/server/config/reConfigs/core.dvm to previous state by removing line 17. Resulting in the following line 16-18:
objPath||rei->doi->objPath
rescName||rei->doi->rescName
destRescName||rei->doi->destRescName
Review
Schedule/Notification
Enable
IMOS rules after the outage on 27th June.
Tracking completion
iVEC - applied on 29th June
TPAC - applied on 3rd July
VPAC - applied on 3rd July
UQ - applied on 3rd July
eRSA - applied on 3rd July
AC3 - applied on 3rd July
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-009 | 01 Jul 2009 - 16:46 | ---+!! Upgrade of grid virtual machines at iVEC
Description
Migrate the existing ng2, ngdata, and nggums virtual machines to CentOS 5.3 and VDT 1.10.1 on new hardware.
Estimated Duration
2 hours.
Systems/Services Affected
All ARCS grid services in WA.
Staff Responsible
DarranCarey
Detailed Instructions
Shutdown the existing ng2, ngdata and nggums virtual machines. Rename the new ng2, ngdata and nggums installations and test the grid workflow.
Testing Procedures
Tests 1 through 4 of the Globus 4 TestSuite located here
http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite will be performed. If they all succeed, this installation will be put into production.
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated time, then these installations will be turned off, and the previous production installations will be brought back online.
Review
Schedule/Notification
Notifications are to be sent to the Community, Developers, and Techstaff mailing lists.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
|
ChangeNote200906-010 | 18 Sep 2009 - 10:17 | ---+!! Deploy TDS 4.0 stable
Deploy TDS 4.0 Stable and modify file system permission
Description
TDS 4.0 stable will be replacing the current alpha version. Additionally, TDS will now be access data managed by the DF. Underlying file permission has to be changed to accommodate this.
Estimated Duration
2 hours
Systems/Services Affected
ARCS
OPeNDAP? Network
arcs-df.ivec.org, arcs-df.sf.utas.edu.au, arcs-df.eresearchsa.edu.au, arcs-df.intersect.org, arcs-df.hpcu.uq.edu.au
Staff Responsible
Pauline Mak, Shunde Zhang
Detailed Instructions
Stop the TDS server.
/sbin/service tds stop
Disable robots
To disable robots from touching all files, create a file in /var/www/html/robots.txt and put in the following text"
User-agent: *
Disallow: /
Modify Permission
As the
rods user, create the directory structure on your local resource for the
OPeNDAP? directory /ARCS/projects/IMOS/opendap/ e.g.
icd /ARCS/projects/IMOS/opendap/
iput -R irodsdev.vpac.org test.txt
irm test.txt
Then, as
rods user, get the script from subversion:
cd $IRODS_HOME/server/bin/local
wget http://projects.arcs.org.au/trac/systems/export/1095/trunk/dataFabricScripts/iRODS/utils/opendap_acl.sh
chmod 700 opendap_acl.sh
You will need to replace the value of vaultDir and user.
To your vault, use
iadmin lr <local resource name>
Then find out the name of the TDS user by looking at /etc/default/tds and look for "JETTY_USER".
For example, on irodsdev.vpac.org, the values would be:
vaultDir=/data/Vault
user=jetty
Run the script:
$IRODS_HOME/server/bin/local/opendap_acl.sh
To test that this is working, become the jetty user (or whatever this user is set to on your system). You should be able to cd into the opendap directory, however, you should not be able to list or view any of the directories
above opendap. Check that the ACL is set, you should get something like the following using getfacl:
-bash-3.2$ getfacl --access /data/Vault/ARCS/projects/IMOS/opendap
getfacl: Removing leading '/' from absolute path names
# file: data/Vault/ARCS/projects/IMOS/opendap
# owner: rods
# group: rods
user::rwx
user:jetty:r-x
group::---
mask::r-x
other::---
As the
rods user, modify the cronjob such that the script is executed hourly:
0 * * * * /opt/iRODS-2.1v/iRODS/server/bin/local/opendap_acl.sh >/dev/null 2>&1 || :
Installing new TDS
Download the latest version of TDS.
wget ftp://ftp.unidata.ucar.edu/pub/thredds/4.0/thredds.war
Then, as root:
cp -r /opt/tds/webapps/thredds ~/thredds_old
rm -rf /opt/tds/webapps/thredds/
unzip thredds.war -d /opt/tds/webapps/thredds/
chown jetty:jetty -R /opt/tds/
As the
jetty user, configure your TDS server to point to the new
OPeNDAP? directory by modifying catalog.xml. This file is in
/opt/tds/content/thredds
If you are using the default catalog, then remove all XML tags between <catalog> and </catalog> and replace it with the following:
<catalogRef name="IMOS" xlink:title="Integrated Marine Observation System" xlink:href="imos.xml" />
Otherwise, sure the above tag is somewhere in between the catalog element.
Modify the name attribute in the
element to read:
<your institution's name>/ARCS OPeNDAP server
Save and close the file.
Create a new file in the same directory (/opt/tds/content/thredds) with the name imos.xml
And put in the following
<-- Regularly Gridded Datasets -->
<service name="regGriddedServices" base="" serviceType="compound">
<service name="dapService" serviceType="OpenDAP" base="/thredds/dodsC/" />
<service name="httpService" serviceType="HTTPServer" base="/thredds/fileServer/" /> <!-- direct file download -->
<service name="wcsService" serviceType="WCS" base="/thredds/wcs/" /> <!-- OGC Web Coverage Service -->
<service name="wmsService" serviceType="WMS" base="/thredds/wms/" /> <!-- OGC Web Map Service -->
<service name="ncss" serviceType="NetcdfSubset (Experimental)" base="/thredds/ncss/grid/" /> <!-- NetCDF Subset service -->
</service>
<!-- Irregularly Gridded Dataset -->
<service name="irregGriddedServices" base="" serviceType="compound"> <service name="dapService" serviceType="OpenDAP" base="/thredds/dodsC/" /> <service name="httpService" serviceType="HTTPServer" base="/thredds/fileServer/" /> <!-- direct file download -->
<service name="wmsService" serviceType="WMS" base="/thredds/wms/" /> <!-- OGC Web Map Service -->
<service name="ncss" serviceType="NetcdfSubset (Experimental)" base="/thredds/ncss/grid/" /> <!-- NetCDF Subset service -->
<!-- Station Datasets -->
<service name="station" base="" serviceType="compound">
<service name="dapService" serviceType="OpenDAP" base="/thredds/dodsC/" />
<service name="httpService" serviceType="Driect Download" base="/thredds/fileServer/" /> <!-- direct file download -->
</service>
<!-- Trajectory Datasets -->
<service name="trajectory" base="" serviceType="compound">
<service name="dapService" serviceType="OpenDAP" base="/thredds/dodsC/" />
<service name="httpService" serviceType="HTTPServer" base="/thredds/fileServer/" /> <!-- direct file download -->
</service>
<!-- Files that are not served by OPeNDAP, just plain HTTP. E.g. Matlab Scripts -->
<service name="httpOnly" serviceType="HTTPServer" base="/thredds/fileServer/" />
<!-- Now configure the dataset directory - note that IMOS may request this to be changed from time to time to include extra metadata about datasets -->
When datasets are moved to the OPeNDAP directory, you may want to ask your eMII/IMOS contact whether this dataset belongs to any of
the above service categories (regGriddedServices, etc) and configure the "serviceName" value accordingly . -->
<dataset name="IMOS - VPAC">
<datasetScan name="IMOS OPeNDAP directory" path="IMOS" location="/data/Vault/ARCS/projects/IMOS/opendap/" serviceName="basic">
<filter>
<include wildcard="*.nc" />
<include wildcard="*.hdf" />
</filter>
</datasetScan>
</dataset>
Modify the THREDDS server to run with the following options (in /etc/default/tds)
JAVA_OPTIONS="-server -Xmx1536m -Xms512m -Djava.awt.headless=true"
Then start the server
/sbin/service tds start
Testing Procedures
Browse to your TDS installation:
http://hostname/thredds
The footer should be updated with the latest version name: "THREDDS Data Server [Version 4.0.26 - 20090831.2140] Documentation"
TDS 4.0 is now installed!
Back-out Procedures
To reinstate the old version:
service tds stop
rm /opt/jetty/webapps/thredds.war
rm -rf /opt/jetty/webapps/thredds
mv ~/thredds_old /opt/jetty/webapps/thredds
Review
Schedule/Notification
TPAC - Done
QCIF - Done
eRSA - Done
Intersect - Done[Sridhar Reddapani]
iVEC - Done
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200906-011 | 08 Aug 2009 - 10:09 | ---+!! Rules Update Script for iRODS
Description
The updateRules.sh script executes hourly on each ARCS Production and Development iRODS server to download whatever version of the relevant arcs.irb (Rules) file is currently in the SVN repository. The file is downloaded with a temporary name, and edited so as insert the local default resource before being moved into place.
An imos.irb Rules file has now been manually deployed to Production and Development servers, and there is a requirement for versions of this to be automatically updated on all such servers when it changes.
The updateRules.sh script has therefore been extended so as to accommodate this requirement.
It is anticipated that the script will need to handle additional Rules files in the future, so the extended version has the capability of updating itself from the SVN repository.
This Change covers the deployment of the extended script to all Development (Phase 1) and Production (Phase 2) servers.
The notification address shown in the createUser.config file on each server will also be updated (to arcs-data@arcs.org.au) during each Phase. This will ensure that all iRODS server administrators are notified when new user identities are created.
Estimated Duration
The maximum duration for each Phase (including testing and backout steps) is one hour. There should be no interruption to service as seen by users.
Systems/Services Affected
Phase 1: irodsdev.vpac.org, data-dev.eresearchsa.edu.au, ngdata-dev.hpcu.uq.edu.au
Phase 2: arcs-df.vpac.org, arcs-df.ivec.org, arcs-df.ac3.edu.au, arcs-df.hpcu.uq.edu.au, arcs-df.tpac.org.au, arcs-df.eresearchsa.edu.au
Staff Responsible
Graham Jenkins
Detailed Instructions
On each machine listed, do crontab -e and comment the updateRules.sh line; take note of the machine's default resource-name as shown on that line. Then perform the following steps (as the 'rods' user):
cd $IRODS_HOME/server/bin/local
mv updateRules.sh /var/tmp
wget http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/Rules/updateRules.sh
chmod a+xr updateRules.sh
cp $IRODS_HOME/server/config/createUser.config /var/tmp
vi $IRODS_HOME/server/config/createUser.config
# Change address in the line which starts in 'A' to: arcs-data@arcs.org.au
Also (Phase 1 only) .. update http://projects.gridaus.org.au/trac/systems/wiki/DataServices/iRODS_Server to reflect procedure in this Change.
Testing Procedures
On each machine listed, perform the following steps (as the 'rods' user):
cp updateRules.sh /tmp
echo "##" >> updateRules.sh
$IRODS_HOME/server/bin/local/updateRules.sh irodsdev.vpac.org # <= Substitute Machine's Default Resource-Name
cmp /tmp/updateRules.sh updateRules.sh # Should be the same
cd $IRODS_HOME/server/config/reConfigs
cp *.irb /var/tmp
echo "##">>arcs.irb
echo "##">>imos.irb
$IRODS_HOME/server/bin/local/updateRules.sh irodsdev.vpac.org # <= Substitute Machine's Default Resource-Name
cmp /var/tmp/arcs.irb arcs.irb # Should be the same
cmp /var/tmp/imos.irb imos.irb # Should be the same
If the tests are successful, do crontab -e and uncomment the line containing: updateRules.sh
Back-out Procedures
Perform the following steps:
cd /var/tmp
cp arcs.irb $IRODS_HOME/server/config/reConfigs/
cp imos.irb $IRODS_HOME/server/config/reConfigs/
cp updateRules.sh $IRODS_HOME/server/bin/local
Then do crontab -e and uncomment the line containing: updateRules.sh
Review
Schedule/Notification
Phase 1: 1:30 pm Wednesday July 1.
Phase 2: 9:00 am Friday July 3.
Tracking completion
Phase 1 completed: 2:15 pm Wednesday July 1.
Phase 2 completed: 10:05 am Friday July 3.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-001 | 09 Jul 2009 - 12:59 | ---+!! TITLE
Description
Moving Master ICAT from arcs-df.ivec.org to arcs-df.vpac.org and moving ICAT to Pgpool
Estimated Duration
8 Hours
Systems/Services Affected
Data Fabric
Staff Responsible
Sridhar Reddapani, Pauline Mak
Detailed Instructions
- 1. Install Pgpool on arcs-df.vpac.org under /opt/PgPool
- Download latest Pgpool from http://pgfoundry.org/projects/pgpool/ to /tmp
- mkdir /opt/PgPool [as root]
- chown -R rods:rods /opt/PgPool [as root]
- su - rods
- cd /opt/PgPool/
- mkdir pgpool
- mkdir logs
- cd /tmp
- tar -xvf pgpool-II-2.2.tar.gz
- cd pgpool-II-2.2
- ./configure --prefix=/opt/PgPool/pgpool
- make
- make install
- 3. Stop iRODS on all servers, stop Davis on arcs-df.ivec.org
- service davis stop
- service irods stop
- service postgres stop [if Running]
- service davis stop [ on arcs-df.ivec.org]
- 4. Setup password less ssh access between arcs-df.ivec.org and arcs-df.vpac.org for user rods
- 5. Take backup of ICAT from arcs-df.ivec.org and copy it to arcs-df.vpac.org, run below commands on arcs-df.ivec.org
- Copy whole /opt/iRODS-2.0v/Postgres/pgsql/data directory to off site and DVD?
- service postgres start
- pg_dump ICAT > ICAT_ivec_DUMP
- Write ICAT_ivec_DUMP file to DVD and check CheckSum? is OK
- scp ICAT_ivec_DUMP rods@arcs-df.vpac.org:/tmp/
- 6. Change local postgres port to 5431 on arcs-df.vpac.org and make change shared_buffers[25% of RAM] and effective_cache_size [50% of RAM] on both postgres servers
- edit .../Postgres/pgsql/data/postgresql.conf
- change port = 5431 [ on arcs-df.vpac.org]
- shared_buffers = 25% of RAM [ on both arcs-df.vpac.org & arcs-df.ivec.org]
- effective_cache_size = 50% of RAM [on both arcs-df.vpac.org & arcs-df.ivec.org]
- 7. Start postgres, drop ICAT and restore ICAT from dump from arcs-df.vpac.org
- service postgres start
- dropdb -p 5431 ICAT
- createdb -p 5431 ICAT
- psql -p 5431 ICAT < /tmp/ICAT_ivec_DUMP
- 8. Install cronolog on arcs-df.vpac.org
- 9. Configure Pgpool and add local postgres[5431] on arcs-df.vpac.org and postgres on arcs-df.ivec.org to Pgpool as documented here http://projects.arcs.org.au/trac/systems/wiki/DataServices/Database/PgpoolConfiguration
- Edit /opt/PgPool/pgpool/etc/pgpool.conf as
- listen_addresses = '*'
- port = 5432
- pcp_port = 9898
- logdir = '/opt/PgPool/logs'
- pid_file_name = '/opt/PgPool/logs/pgpool.pid'
- replication_mode = true
- load_balance_mode = true
- insert_lock = false
- num_init_children = 200
- max_pool = 1
- backend_hostname0 = 'arcs-df.vpac.org'
- backend_port0 = 5431
- backend_weight0 = 1
- backend_hostname1 = 'arcs-df.ivec.org'
- backend_port1 = 5432
- backend_weight1 = 0
- Edit /opt/PgPool/pgpool/etc/pool_hba.conf and give access to all irods server
- Edit /opt/PgPool/pgpool/etc/pcp.conf and add rods md5 password to it.
- 10. Make sure you have right access to other servers in pg_hba.conf on both servers
- 11. Start postgres on both servers and then start pgpool -- check logs for any errors
- 12. Config irods arcs-df.vpac.org to talk to pgpool for Master ICAT and start iRODS
- edit /opt/iRODS-2.0v/iRODS/server/config/server.config as
- icatHost arcs-df.vpac.org
- service irods start
- 13. Config irods arcs-df.ivec.org to talk to pgpool for Master ICAT and local postgres for Slave ICAT then start iRODS
- edit /opt/iRODS-2.0v/iRODS/server/config/server.config as
- icatHost arcs-df.vpac.org
- slaveIcatHost arcs-df.ivec.org
- service irods start
- 14. By now we finished basic configuration of Pgpool, Check everything is working as expected and proceed to configure Pgpool Online Recovery
- 15. Run these to stop everything
- service irods stop [ on arcs-df.ivec.org]
- service irods stop [ on arcs-df.vpac.org]
- pgpool stop [ on arcs-df.ivec.org]
- service postgres stop [ on arcs-df.vpac.org]
- service postgres stop [ on arcs-df.ivec.org]
- 16. Enable WAL archiving on both servers and start postgres, Test WAL archiving is working..
- mkdir /opt/iRODS-2.0v/Postgres/pgsql_wal_archive/
- touch /opt/iRODS-2.0v/Postgres/backup_in_progress
- edit /opt/iRODS/Postgres/pgsql/data/postgresql.conf as
- archive_mode = on
- archive_command = 'test ! -f /opt/iRODS-2.0v/Postgres/backup_in_progress || rsync -a %p /opt/iRODS-2.0v/Postgres/pgsql_wal_archive/%f'
- archive_timeout = 5
- service postgres start
- After 5 minutes check whether you have WAL files in /opt/iRODS-2.0v/Postgres/pgsql_wal_archive/ if so WAL archiving is OK
- rm /opt/iRODS-2.0v/Postgres/backup_in_progress
- edit /opt/iRODS/Postgres/pgsql/data/postgresql.conf as
- 17. Install Pgpool function on arcs-df.vpac.org and copy untared pgpool directory to arcs-df.ivec.org
- cd /tmp/pgpool-II-2.2/sql/pgpool-recovery/
- make install
- psql -p 5431 -f pgpool-recovery.sql template1
- scp -r /tmp/pgpool-II-2.2/ rods@arcs-df.ivec.org:/tmp/
- 18. Install Pgpool function on arcs-df.ivec.org
- cd /tmp/pgpool-II-2.2/sql/pgpool-recovery/
- make install
- psql -p 5432 -f pgpool-recovery.sql template1
- 19. Edit /opt/PgPool/pgpool/etc/pgpool.conf as..
- health_check_timeout = 90
- health_check_period = 30
- failover_command = 'echo WARNING from ARCS DF | mail -s "One of the Pgpool node on arcs-df.vpac.org is DOWN!" arcs-data@arcs.org.au'
- failback_command = 'echo NOTIFICATION from ARCS DF | mail -s "Pgpool on arcs-df.vpac.org is now HEALTHY" arcs-data@arcs.org.au'
- backend_data_directory1 = '/opt/iRODS-2.0v/Postgres/pgsql/data/'
- backend_data_directory2 = '/opt/iRODS-2.0v/Postgres/pgsql/data/'
- recovery_user = 'rods'
- recovery_password = 'rods-password'
- recovery_1st_stage_command = 'copy_base_backup'
- recovery_2nd_stage_command = 'pgpool_recovery_pitr'
- 20. Download attached copy_base_backup, pgpool_recovery_pitr and pgpool_remote_start files and make (any required) changes to copy_base_backup and pgpool_remote_start
- 21. Testing Online Recovery
- Start postgres and pgpool on arcs-df.vpac.org
- service postgres start
- pgpool -d -n 2>&1 | cronolog --hardlink=/opt/PgPool/logs/pgpool.log '/opt/PgPool/logs/%Y-%m-%d-pgpool.log' &
- watch log file while doing recovery on another window
- tail -f opt/PgPool/logs/pgpool.log
- Run recovery command
- pcp_recovery_node -d 20 localhost 9898 rods <rods-password> 1
- If recovery is done, postgres will be started on arcs-df-ivec.org, if not TROUBLESHOOT
- Stop pgpool and postgres
- pgpool stop
- service postgres stop [ on both servers]
- 22. If everything OK by now, CONGRATULATIONS... Start everything as below sequence
- service postgres start [ on arcs-df.vpac.org]
- service postgres start [ on arcs-df.ivec.org]
- pgpool -d -n 2>&1 | cronolog --hardlink=/opt/PgPool/logs/pgpool.log '/opt/PgPool/logs/%Y-%m-%d-pgpool.log' & [ on arcs-df.vpac.org]
- service irods start [ on arcs-df.vpac.org]
- service irods start [ on arcs-df.ivec.org]
- 23. Configure all other DF servers talk to pgpool for Master ICAT and start iRODS
- Edit /opt/iRODS-2.0v/iRODS/server/config/server.config as
- icatHost arcs-df.vpac.org
- service irods start
- 25. Check all rules are in place on arcs-df.vpac.org.
Testing Procedures
- 1. Try accessing DF for both Reading and Writing.
- 2. Check Automatic User creation is functional.
- 3. Check all rules are functional.
Back-out Procedures
- 1. Stop Everything in below sequence
- service irods stop [ on all other irods servers]
- service irods stop [ on arcs-df.ivec.org]
- service irods stop [ on arcs-df.vpac.org]
- pgpool stop [ on arcs-df.vpac.org]
- service postgres stop [ on both arcs-df.vpac.org & arcs-df.ivec.org]
- 2. Configure iRODS on arcs-df.ivec.org to talk to local postgres for Master ICAT and start iRODS
- Edit /opt/iRODS-2.0v/iRODS/server/config/server.config as
- icatHost arcs-df.ivec.org
- service postgres start
- service irods start
- 3. Configure iRODS on all servers to talk to arcs-df.ivec.org for Master ICAT and start iRODS and start iRODS
- Edit /opt/iRODS-2.0v/iRODS/server/config/server.config as
- icatHost arcs-df.ivec.org
- service irods start
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
| ChangeNote200907-002 | 09 Jul 2009 - 09:20 | ---+!! Upgrade of grid virtual machines at CSIRO
Description
Migrate the existing ng2, ngdata, and nggums virtual machines to CentOS 5.3 and VDT 1.10.1 on new hardware
Estimated Duration
2 hours.
Systems/Services Affected
All ARCS grid services provided by CSIRO in Victoria.
Staff Responsible
JoelLudbey? -Bruhwel
Detailed Instructions
Shutdown the existing ng2, ngdata and nggums virtual machines. Rename the new ng2, ngdata and nggums installations and test the grid workflow.
Testing Procedures
Tests 1 through 4 of the Globus 4 TestSuite? located here http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite will be performed. If they all succeed, this installation will be put into production.
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated time, then these installations will be turned off, and the previous production installations will be brought back online.
Review
Schedule/Notification
Notifications are to be sent to the Community, Developers, and Techstaff mailing lists.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-003 | 08 Aug 2009 - 10:08 | ---+!! Moving Davis from iVEC to VPAC
Description
This is a follow up step to the pgpool migration to VPAC (see http://wiki.arcs.org.au/bin/view/Main/ChangeNote200907-001). Davis will be installed at VPAC as it contains the master ICAT
Estimated Duration
1 hour
Systems/Services Affected
df.arcs.org.au, arcs-df.vpac.org
Staff Responsible
Pauline Mak, Andy Botting
Detailed Instructions
As root:
Stop Davis, as root:
/etc/init.d/davis stop
Remove old version of davis
mv /opt/davis ~/davis
Download, and install davis
mkdir /opt/davis
wget http://code.arcs.org.au/hudson/job/davis/ws/trunk/dist/davis-0.7.2d.tar.gz
tar -zxvf davis-0.7.2.tar.gz
mv davis-0.7.2 /opt/davis/davis-0.7.2d
cd /opt/davis/
chown -R davis:davis /opt/davis/davis-0.7.2d
ln -s /opt/davis/davis-0.7.2d davis
Then edit the configuration file:
/opt/davis/davis/webapps/root/WEB-INF/web.xml
Change the settings of the following parameters:
<init-param>
<param-name>webdavis.Log.threshold</param-name>
<param-value>WARNING</param-value>
</init-param>
<init-param>
<param-name>insecureConnection</param-name>
<param-value>shib</param-value>
</init-param>
<init-param>
<description>
server type, srb or irods
</description>
<param-name>server-type</param-name>
<param-value>irods</param-value>
</init-param>
<init-param>
<description>
default idp name
</description>
<param-name>default-idp</param-name>
<param-value>ARCS IdP</param-value>
</init-param>
<init-param>
<description>
server port of srb/irods
</description>
<param-name>server-port</param-name>
<param-value>1247</param-value>
</init-param>
<init-param>
<description>
server name of srb/irods
</description>
<param-name>server-name</param-name>
<param-value>localhost</param-value>
</init-param>
<init-param>
<description>
zone name of srb/irods
</description>
<param-name>zone-name</param-name>
<param-value>ARCS</param-value>
</init-param>
<init-param>
<description>
default resource of user
</description>
<param-name>default-resource</param-name>
<param-value></param-value>
</init-param>
<init-param>
<param-name>anonymousCollections</param-name>
<param-value>/ARCS/worldview,/ARCS/projects/IMOS/public</param-value>
</init-param>
<init-param>
<param-name>shared-token-header-name</param-name>
<param-value>auEduPersonSharedToken</param-value>
</init-param>
<init-param>
<param-name>cn-header-name</param-name>
<param-value>cn</param-value>
</init-param>
<init-param>
<param-name>admin-cert-file</param-name>
<param-value>/etc/grid-security/daviscert.pem</param-value>
</init-param>
<init-param>
<param-name>admin-key-file</param-name>
<param-value>/etc/grid-security/daviskey.pem</param-value>
</init-param>
Remember to remove settings for proxy (proxy-host and proxy-port)! The other options can be left unchanged.
Make a copy of the irods and key PEM files in /etc/grid-security.
cp irodscert.pem daviscert.pem
cp irodskey.pem daviskey.pem
chown davis:davis davis*
Now modify jetty to use the AJP connector (/opt/davis/davis/etc/jetty.xml). Comment out the existing addConnector element, and add in the following XML:
<Call name="addConnector">
<Arg>
<New class="org.mortbay.jetty.ajp.Ajp13SocketConnector">
<Set name="port">8009</Set>
<Set name="ThreadPool">
<New class="org.mortbay.thread.BoundedThreadPool">
<Set name="minThreads">50</Set>
<Set name="maxThreads">500</Set>
<Set name="MaxIdleTimeMs">0</Set>
</New>
</Set>
<Set name="statsOn">false</Set>
</New>
</Arg>
</Call>
In the same file, also modify the ThreadPool? element to optomise performance:
<New class="org.mortbay.thread.QueuedThreadPool">
<Set name="minThreads">50</Set>
<Set name="maxThreads">2000</Set>
<Set name="lowThreads">100</Set>
<Set name="SpawnOrShrinkAt">200</Set>
</New>
Now install startup script at init.d.
cp /opt/davis/davis/bin/jetty.sh /etc/init.d/davis
chmod +x /etc/init.d/davis
And change the script to point to the davis environment variable file. Make the following change to line 134-6:
elif [ -f /etc/default/davis ] ; then
. /etc/default/davis
fi
Then setup the environment variables in the file /etc/default/davis
JETTY_HOME=/opt/davis/davis
JAVA_OPTIONS="-server -Xms1024m -Xmx1562m -XX:+AggressiveHeap -XX:PermSize=192m -XX:MaxPermSize=192m -XX:+UseParallelGC"
JAVA_HOME=/usr/java/default
JETTY_USER=davis
JETTY_PID=/var/run/davis.pid
Double check your settings by running /etc/init.d/davis check:
Checking arguments to Jetty:
JETTY_HOME = /opt/davis/davis
JETTY_CONF =
JETTY_RUN = /var/run
JETTY_PID = /var/run/davis.pid
JETTY_PORT =
JETTY_LOGS =
CONFIGS = /opt/davis/davis/etc/jetty-logging.xml /opt/davis/davis/etc/jetty.xml
JAVA_OPTIONS = -server -Xms1024m -Xmx1562m -XX:+AggressiveHeap -XX:PermSize=192m -XX:MaxPermSize=192m -XX:+UseParallelGC -Djetty.home=/opt/davis/davis -Djava.io.tmpdir=/tmp
JAVA = /usr/java/default
CLASSPATH =
RUN_CMD = /usr/java/jdk1.6.0_10/bin/java -server -Xms1024m -Xmx1562m -XX:+AggressiveHeap -XX:PermSize=192m -XX:MaxPermSize=192m -XX:+UseParallelGC -Djetty.home=/opt/davis/davis -Djava.io.tmpdir=/tmp -jar /opt/davis/davis/start.jar /opt/davis/davis/etc/jetty-logging.xml /opt/davis/davis/etc/jetty.xml
Now configure Apache to use ajp - add the following line in /etc/httpd/conf/httpd.conf
LoadModule proxy_ajp_module modules/mod_proxy_ajp.so
Then modify the SSL config file: /etc/httpd/conf.d/ssl.conf
Install IPSCABUNDLE:
cd /etc/grid-security/
wget http://certs.ipsca.com/companyIPSipsCA/IPS-IPSCABUNDLE.crt
Then modify ssl.conf:
SSLCertificateChainFile /etc/grid-security/IPS-IPSCABUNDLE.crt
chmod 744 IPS-IPSCABUNDLE.crt
Secure copy across the df.arcs.org.au IPCSCA certificate from iVEC to VPAC and put them in /etc/grid-security
/etc/grid-security/df-hostcert.pem
/etc/grid-security/df-hostkey.pem
Then modify VPAC's apache's ssl.conf to point to this pair of files:
SSLCertificateFile /etc/grid-security/df-hostcert.pem
SSLCertificateKeyFile /etc/grid-security/df-hostkey.pem
And add the following to the end of the file (before the closing VirtualHost? tag):
ProxyRequests Off
ProxyPreserveHost On
ProxyPass /ARCS/ ajp://localhost:8009/ARCS/ flushpackets=on
RewriteEngine On
RewriteRule ^/$ /ARCS/home [R]
Make sure the SLCS certificate is installed. If not, copy the /etc/grid-security/arcs-slcs-ca folder from iVEC across to VPAC.
Now modify httpd config (/etc/httpd/conf/httpd.conf) and set the following values:
Timeout 1200
KeepAlive On
MaxRequestsPerChild 0
Download the latest dojokit and store it under Apache document root:
cd /tmp
wget http://download.dojotoolkit.org/release-1.3.1/dojo-release-1.3.1.tar.gz
tar -zxvf dojo-release-1.3.1.tar.gz /var/www/html
cd /var/www/html
ln -s /var/www/html/dojo-release-1.3.1 dojoroot
Start davis and httpd
service davis start
service httpd start
Testing
Browse to http://arcs-df.vpac.org/ARCS/home and see if you can login.
Upload/download files to see if it is working. Also check that anonymous access to public areas under IMOS and worldview are functional.
Also check connectivity through webdev using Linux, Windows and Mac using Litmus.
DNS Change
Once everything is in working order, switch DNS entry to point df.arcs.org.au to arcs-df.vpac.org
Testing Procedures
Browse to http://arcs-df.vpac.org/ARCS/home and see if you can login.
Upload/download files to see if it is working. Also check that anonymous access to public areas under IMOS and worldview are functional.
Check webdav using Litmus.
Back-out Procedures
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-004 | 02 Jul 2009 - 16:19 | ---+!! Upgrade of grid virtual machines at Intersect
Description
Migrate the existing ng2, ngdata, and nggums virtual machines to CentOS 5.3 and VDT 1.10.1 on new hardware.
Estimated Duration
2 hours.
Systems/Services Affected
All ARCS grid services in NSW.
Staff Responsible
SimonYin
Detailed Instructions
Shutdown the existing ng2, ngdata and nggums virtual machines. Rename the new ng2, ngdata and nggums installations and test the grid workflow.
Testing Procedures
Tests 1 through 4 of the Globus 4 TestSuite? located here http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite will be performed. If they all succeed, this installation will be put into production.
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated time, then these installations will be turned off, and the previous production installations will be brought back online.
Review
Schedule/Notification
Notifications are to be sent to the Community, Developers, and Techstaff mailing lists.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-005 | 21 Jul 2009 - 15:40 | ---+!! Migrate remaining ARCS Plone services to VPAC.
Description
Migration of an existing system to new infrastructure. Purpose is to strategically position the Plone infrastructure in a more centrally connected MARC and leverage the systems provided by the Data Services team. This outage secificly affects the ANDS site (http://plone.jcu.edu.au/ands)
Estimated Duration
Two hour. Which is the time taken for DNS entry to expire, plus the time to copy the data to VPAC.
Systems/Services Affected
plone.arcs.org.au
plone.jcu.edu.au/ands
Staff Responsible
RussellSim (Plone Migration)
Systems Staff Member (Provisioning VM)
Systems Staff Member (Enabling DNS plone.arcs.org.au)
Detailed Instructions
- Turn off Zope service on zope4.cluster on the JCU cluster
- Turn off Zeo service on zeo1.cluster on the JCU cluster
- Change Zeo Database to READ-ONLY
- Turn on Zeo service on zeo1.cluster on the JCU cluster
- Turn on Zope service on zope4.cluster on the JCU cluster
- Copy Data file from zeo1.cluster to db-vpac.arcs.org.au at VPAC.
- Turn on Zeo service at VPAC
- Turn on Zope service (plone.arcs.org.au) at VPAC
- Change website cache service url in portal_squidtool
- Change SMTP server location
- Replace plone.jcu.edu.au/ands(.*) with a 403 redirect to plone.arcs.org.au/ands(.*)
Testing Procedures
- Local data access.
- Test Backup procedure.
Back-out Procedures
- Disable redirect from plone.jcu.edu.au
- Change Zeo Database to at JCU to RW
Review
Schedule/Notification
6:30am 29th Jul 2009
Tracking completion
RT Ticket
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-006 | 08 Aug 2009 - 10:08 | ---+!! iRODS Update to createUser Script
Description
It has been found that some educational institutions (e.g. University of Queensland) include a courtesy title in their LDAP CN fields. In consequence, a user identified as "Ms Jane Doe" will be allocated a username like "mrs.doe".
The createUser script has therefore been enhanced so as to swallow the first word found in a CN field if that word can be found (case-transparently) as a "T" record in the createUser.config file.
The updated createUser and createUser.config files can be found at: http://projects.gridaus.org.au/trac/systems/wiki/DataServices/iRODS_Server For convenience, createUser and createUser.table (containing extensions to createUser.config) have been copied to directory: tango.vpac.org:/tmp/GJ
The arcs.irb file used on Production systems will also be updated (with addition of double-quote characters) during this Change to match what was previously done on Development machines.
Estimated Duration
The estimated durations for each Phase (including testing and backout steps) are as follows. There should be no interruption to service as seen by users.
Phase 1: 30 mins
Phase 2: 60 mins
Systems/Services Affected
Phase 1: irodsdev.vpac.org, data-dev.eresearchsa.edu.au, ngdata-dev.hpcu.uq.edu.au
Phase 2: arcs-df.hpcu.uq.edu.au, arcs-df.ivec.org, arcs-df.ac3.edu.au, arcs-df.sf.utas.edu.au, arcs-df.eresearchsa.edu.au, arcs-df.vpac.org
Staff Responsible
Graham Jenkins
Detailed Instructions
On each server, login as the 'rods' user and proceed as follows.
cd /tmp
scp graham@tango.vpac.org:/tmp/GJ/* .
cat createUser.table >>$IRODS_HOME/server/config/createUser.config
cp $IRODS_HOME/server/bin/cmd/createUser $IRODS_HOME/server/bin/cmd/createUser.20090707
cp createUser $IRODS_HOME/server/bin/cmd
For Phase 2 only, also use SVN update the arcs.irb file at: https://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/Rules/ so that the acGetUserByDN rule looks like this:
acGetUserByDN(*arg,*OUT)||msiExecCmd(createUser,'"*arg"',null,null,null,*OUT)|nop
Testing Procedures
Perform the following test, and check that the name of the created user is printed as: jane.doe
export irodsConfigDir=$IRODS_HOME/server/config
$IRODS_HOME/server/bin/cmd/createUser "/C=AU/O=APACGrid/OU=VPAC/CN=Ms Jane Doe"
iadmin rmuser jane.doe
Back-out Procedures
Perform the following, then repeat the above test.
cp $IRODS_HOME/server/bin/cmd/createUser.20090707 $IRODS_HOME/server/bin/cmd/createUser
Review
Schedule/Notification
Phase 1: 3:00 pm Tuesday July 7. Update: 11:00 am Thursday July 9.
Phase 2: 4:00 pm Tuesday July 7. Update: 9:00 am Friday July 10.
Tracking completion
Phase 1: Completed 3:20 pm Tuesday July 7. Update Completed 11:30 am Thursday July 9.
Phase 2: Completed 4:50 pm Tuesday July 7. Update Completed 10:10 am Friday July 10.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-007 | 19 Aug 2009 - 12:12 | ---+!! Installing AWSTATS
Description
Install Awstats to analyze your Apache log.
Estimated Duration
20 mins
Systems/Services Affected
DF
Staff Responsible
Shunde Zhang
Detailed Instructions
Get awstats
wget "http://prdownloads.sourceforge.net/awstats/awstats-6.9.tar.gz"
Extract it to /usr/local as root
tar xfvz awstats-6.9.tar.gz
mv awstats-6.9 /usr/local/awstats
Execute configuration script.
cd /usr/local/awstats/tools
perl awstats_configure.pl
And answer the questions
----- AWStats awstats_configure 1.0 (build 1.8) (c) Laurent Destailleur -----
This tool will help you to configure AWStats to analyze statistics for
one web server. You can try to use it to let it do all that is possible
in AWStats setup, however following the step by step manual setup
documentation (docs/index.html) is often a better idea. Above all if:
- You are not an administrator user,
- You want to analyze downloaded log files without web server,
- You want to analyze mail or ftp log files instead of web log files,
- You need to analyze load balanced servers log files,
- You want to 'understand' all possible ways to use AWStats...
Read the AWStats documentation (docs/index.html).
-----> Running OS detected: Linux, BSD or Unix
-----> Check for web server install
Enter full config file path of your Web server.
Example: /etc/httpd/httpd.conf
Example: /usr/local/apache2/conf/httpd.conf
Example: c:\Program files\apache group\apache\conf\httpd.conf
Config file path ('none' to skip web server setup):
> /etc/httpd/conf/httpd.conf
-----> Check and complete web server config file '/etc/httpd/conf/httpd.conf'
Add 'Alias /awstatsclasses "/usr/local/awstats/wwwroot/classes/"'
Add 'Alias /awstatscss "/usr/local/awstats/wwwroot/css/"'
Add 'Alias /awstatsicons "/usr/local/awstats/wwwroot/icon/"'
Add 'ScriptAlias /awstats/ "/usr/local/awstats/wwwroot/cgi-bin/"'
Add '<Directory>' directive
AWStats directives added to Apache config file.
-----> Update model config file '/usr/local/awstats/wwwroot/cgi-bin/awstats.model.conf'
File awstats.model.conf updated.
-----> Need to create a new config file ?
Do you want me to build a new AWStats config/profile
file (required if first install) [y/N] ? y
-----> Define config file name to create
What is the name of your web site or profile analysis ?
Example: www.mysite.com
Example: demo
Your web site, virtual server or profile name:
> df.arcs.org.au
-----> Define config file path
In which directory do you plan to store your config file(s) ?
Default: /etc/awstats
Directory path to store config file(s) (Enter for default):
>
-----> Create config file '/etc/awstats/awstats.df.arcs.org.au.conf'
Config file /etc/awstats/awstats.df.arcs.org.au.conf created.
-----> Restart Web server with '/sbin/service httpd restart'
Stopping httpd: [ OK ]
Starting httpd: [ OK ]
-----> Add update process inside a scheduler
Sorry, configure.pl does not support automatic add to cron yet.
You can do it manually by adding the following command to your cron:
/usr/local/awstats/wwwroot/cgi-bin/awstats.pl -update -config=df.arcs.org.au
Or if you have several config files and prefer having only one command:
/usr/local/awstats/tools/awstats_updateall.pl now
Press ENTER to continue...
A SIMPLE config file has been created: /etc/awstats/awstats.df.arcs.org.au.conf
You should have a look inside to check and change manually main parameters.
You can then manually update your statistics for 'df.arcs.org.au' with command:
> perl awstats.pl -update -config=df.arcs.org.au
You can also read your statistics for 'df.arcs.org.au' with URL:
> http://localhost/awstats/awstats.pl?config=df.arcs.org.au
Press ENTER to finish...
Add a cron job to update stats
crontab -e
0 * * * * /usr/local/awstats/tools/awstats_updateall.pl now >/dev/null 2>&1 || :
Edit configuration file
vi /etc/awstats/awstats.df.arcs.org.au.conf
LogFile="cat /var/log/httpd/ssl_request_log /var/log/httpd/ssl_request_log.1 |"
Edit /etc/httpd/conf.d/ssl.conf to use combineio format
#CustomLog logs/ssl_request_log \
# "%t %h %{SSL_PROTOCOL}x %{SSL_CIPHER}x \"%r\" %b"
CustomLog logs/ssl_request_log \
"%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\" %I %O"
Create a directory for awstats results
mkdir /var/lib/awstats
Analyze Apache log for the first time
/usr/local/awstats/tools/awstats_updateall.pl now
Apply basic authentication to awstats. Firstly, generate a htpasswd file.
cd /etc/httpd/conf.d
htpasswd -c http_passwd arcs-data
password: davisguru
Modify httpd.
vi /etc/httpd/conf/httpd.conf
<Directory "/usr/local/awstats/wwwroot">
Options None
AllowOverride None
Order allow,deny
Allow from all
AuthType Basic
AuthName "ARCS Data"
AuthUserFile /etc/httpd/conf.d/http_passwd
Require valid-user
</Directory>
Reload HTTPD
/etc/init.d/httpd reload
Testing Procedures
Go to http://df.arcs.org.au/awstats/awstats.pl?config=df.arcs.org.au and see if you can get the result page.
Back-out Procedures
Remove /etc/awstats, /usr/local/awstats and delete relevant entries from httpd.conf
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-008 | 03 Sep 2009 - 10:52 | ---+!! Installing MAMS level-1 SP
Description
Install a SP (MAMS level-2) to get shibboleth authentication.
Estimated Duration
1 hour
Systems/Services Affected
ARCS DF
Staff Responsible
Shunde Zhang
Detailed Instructions
Register your SP on MAMS registry. Go to http://www.federation.org.au/FedManager/jsp/index.jsp and create a new Host, under that Host, create a new SP (required attributes are sharedToken and cn).
Install SP 2.1, get these RPMs
wget http://shibboleth.internet2.edu/downloads/shibboleth/cppsp/archive/2.1/RPMS/i386/RHE/5/log4shib-1.0-1.i386.rpm
wget http://shibboleth.internet2.edu/downloads/shibboleth/cppsp/archive/2.1/RPMS/i386/RHE/5/opensaml-2.1-1.i386.rpm
wget http://shibboleth.internet2.edu/downloads/shibboleth/cppsp/archive/2.1/RPMS/i386/RHE/5/shibboleth-2.1-1.i386.rpm
wget http://shibboleth.internet2.edu/downloads/shibboleth/cppsp/archive/2.1/RPMS/i386/RHE/5/xerces-c-2.8.0-1.i386.rpm
wget http://shibboleth.internet2.edu/downloads/shibboleth/cppsp/archive/2.1/RPMS/i386/RHE/5/xml-security-c-1.4.0-1.i386.rpm
wget http://shibboleth.internet2.edu/downloads/shibboleth/cppsp/archive/2.1/RPMS/i386/RHE/5/xmltooling-1.1-1.i386.rpm
Install dependencies and the above RPMs
yum install libicu unixODBC
rpm -ivh *
Get a pre-made configuration file and server cert.
cd /etc/shibboleth
wget http://projects.arcs.org.au/trac/systems/attachment/wiki/HowTo/InstallSLCSServer2/shibboleth2.xml?format=raw -O /etc/shibboleth/shibboleth2.xml
wget http://projects.arcs.org.au/trac/systems/attachment/wiki/HowTo/InstallSLCSServer2/attribute-map.xml?format=raw -O /etc/shibboleth/attribute-map.xml
wget http://www.federation.org.au/twiki/pub/Federation/UpdateMetadata/www.federation.org.au.pem
Generate a certificate for back-channel communication.
mkdir /etc/certs && cd /etc/certs
openssl req -newkey rsa:1024 -nodes -keyout mykey.pem -out newreq.pem
cat newreq.pem
On the SP machine, open up a browser (elinks) and go to this URL http://www.federation.org.au/CA/CA-sign.html
Open the file newreq.pem with a text editor and copy the section between:
-----BEGIN CERTIFICATE REQUEST-----
......
-----END CERTIFICATE REQUEST-----
Make sure you also include the above lines when copying.
Paste the information into the form on the browser and hit Submit. Copy the resultant certificate into a file and call it mycert.pem.
Make sure that your SP user (or whatever user that you use for SP shibd daemon) has access to these certificate files
Note that if you do not run the browser from the same machine as your SP, you will not be able to obtain a certificate
Download the level-1 CA certificate from this URL: http://www.federation.org.au/level-1-ca.crt into /etc/certs directory
Check that at this point you have the three files:
- Your SP x509 certificate: mycert.pem
- Your SP key: mykey.pem
- Level-1 CA x509 certificate: level-1-ca.crt
Modify shibboleth2.xml, change MY_DNS to df.arcs.org.au. And change the following section to enable metadata auto-update.
<!-- Chains together all your metadata sources. -->
<MetadataProvider type="Chaining">
<!-- Example of remotely supplied batch of signed metadata. -->
<MetadataProvider type="XML" uri="https://www.federation.org.au/level-1/level-1-metadata.xml"
backingFilePath="level-1-metadata.xml" reloadInterval="7200">
<SignatureMetadataFilter certificate="www.federation.org.au.pem"/>
</MetadataProvider>
<!-- Example of locally maintained metadata. -->
<!--
<MetadataProvider type="XML" file="level-1-metadata.xml"/>
-->
</MetadataProvider>
Restart httpd and start shibd
/etc/init.d/shibd start
/etc/init.d/httpd start
Testing Procedures
Put a PHP test file in /var/www/html/secure to test.
<?php
// Version: 2007-07-26
// Latest version of this script will be on https://aai-viewer.switch.ch/viewer.php?source
//
// Author: Lukas Haemmerle <lukas.haemmerle@switch.ch>
// Bug reports etc. please to aai@switch.ch
// Specify your attribute-map.xml file and make sure it is readable by the web server
$attribute_map_file = '/etc/shibboleth/attribute-map.xml';
//Set header
header('Content-type: text/html; charset=utf-8');
// Show source
if (isset($_REQUEST['source'])) {
highlight_file(__FILE__);
exit;
}
?><!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<title>SWITCH Attribute Viewer</title>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<style type="text/css">
<!--
a
{
color: #1B3E93;
font-size: 14px;
font-weight: bold;
text-decoration: none;
}
a:hover
{
color: #FE911B;
text-decoration: underline;
}
body
{
background-color: white;
font-family: Verdana, Arial, Helvetica, sans-serif;
}
h1
{
font-family: Verdana, Arial, Helvetica, sans-serif;
font-size: 18px;
font-weight: bold;
text-decoration: none;
}
.logo
{
color: white;
text-decoration: none;
}
.border-blue
{
border-style: solid;
border-width: 1px;
border-color: rgb(0,43,127);
font-family: Verdana, Arial, Helvetica, sans-serif;
font-size: 13px;
}
.border-orange
{
border-style: solid;
border-width: 1px;
border-color: rgb(226,140,5);
font-family: Verdana, Arial, Helvetica, sans-serif;
font-size: 13px;
}
.blue
{
color: rgb(0,43,127);
}
.orange
{
color: rgb(226,140,5);
}
td.top-left {
background-image: url('http://www.switch.ch/aai/design/images/topleft.gif');
height: 14px;
width: 14px;
}
td.top-middle {
background-image: url('http://www.switch.ch/aai/design/images/topcenter.gif');
height: 14px;
}
td.top-right {
background-image: url('http://www.switch.ch/aai/design/images/topright.gif');
height: 14px;
width: 14px;
}
td.middle-left {
background-image: url('http://www.switch.ch/aai/design/images/middleleft.gif');
width: 14px;
}
td.middle-right {
background-image: url('http://www.switch.ch/aai/design/images/middleright.gif');
width: 14px;
}
td.bottom-left {
background-image: url('http://www.switch.ch/aai/design/images/bottomleft.gif');
height: 14px;
width: 14px;
}
td.bottom-middle {
background-image: url('http://www.switch.ch/aai/design/images/bottomcenter.gif');
height: 14px;
}
td.bottom-right {
background-image: url('http://www.switch.ch/aai/design/images/bottomright.gif');
height: 14px;
width: 14px;
}
-->
</style>
</head>
<body>
<div align="center">
<table border="0" cellpadding="0" cellspacing="0">
<!-- top left corner + middle bar + right corner -->
<tr>
<td class="top-left"></td>
<td class="top-middle"></td>
<td class="top-right"></td>
</tr>
<!-- left border + content + right border -->
<tr>
<!-- left border -->
<td class="middle-left"></td>
<!-- content -->
<td>
<a class="logo" href="http://www.switch.ch/aai">
<img alt="SWITCHaai logo" src="http://www.switch.ch/aai/design/images/switch-aai-logo.gif" style="padding-bottom: 7px" /></a>
<h1>SWITCH Attribute Viewer</h1>
<!-- table content -->
<table width="100%">
<tr>
<td class="blue">Attributes</td><td class="orange">Values</td>
</tr>
<?php
$attribute_map_lines = file($attribute_map_file);
$attribute_map = '';
foreach($attribute_map_lines as $line){
$attribute_map .= $line;
}
$p = xml_parser_create();
xml_parse_into_struct($p, $attribute_map, $vals, $index);
xml_parser_free($p);
$HTTP_SHIB_HEADERS = array();
foreach ($vals as $element){
if ($element['tag'] == 'ATTRIBUTE' && isset($element['attributes']['ID'])){
$HTTP_SHIB_HEADERS[$element['attributes']['ID']] = $element['attributes']['ID'];
}
}
// Dump all received Shibboleth attributes
$status = '00';
foreach ($_SERVER as $key => $value){
// Do we have any variables defined in attribute map
if (isset($HTTP_SHIB_HEADERS[$key])){
$status[1] = '1';
echo '<tr valign="top">';
echo '<td class="border-blue" valign="top">'.$HTTP_SHIB_HEADERS[$key].'</td>';
$clean_value = ereg_replace('\$','<br>',htmlspecialchars(stripslashes($value)));
if (ereg(';', $clean_value)){
$clean_value = ereg_replace(";",'</tt></li><li><tt>',$clean_value);
$clean_value = '<ul><li><tt>'.$clean_value.'</tt></li></ul>';
}
else {
$clean_value = '<tt>'.$clean_value.'</tt>';
}
echo '<td class="border-orange">'.$clean_value.'</td>';
echo '</tr>';
}
// or any attributes starting with Shib-
elseif (eregi('Shib', $key) ) {
$status[0] = '1';
echo '<tr>';
echo '<td class="border-blue" valign="top"><span style="color: grey; font-style:italic;">'.$key.'</span></td>';
echo '<td class="border-orange"><span style="color: grey; font-style:italic;"><tt>'.wordwrap(htmlspecialchars($value), 70, "<br/>\n", true).'</tt></span></td>';
echo "</tr>\n";
}
}
if (isset($_REQUEST['assertions'])) {
$counter = 1;
foreach ($_SERVER as $key=>$value){
// Check if it is an assertion
if (ereg('Shib-Assertion-Count', $key) || !eregi('Shib-Assertion', $key)) {
continue;
}
// Download the assertion
$value = ereg_replace('dieng.switch.ch','127.0.0.1',stripslashes($value));
$assertion = '';
$handle = fopen($value, 'rb');
if ($handle){
while (!feof($handle)) {
$tmp = fread($handle, 8192);
if (!$tmp){
break;
}
$assertion .= $tmp;
}
fclose($handle);
}
echo '<tr><td colspan="2" class="border-orange"><h4>Assertion '.$counter.':</h4>';
$assertion = ereg_replace('<', "\n<", $assertion);
$assertion = preg_replace('/>(.+)/', ">\n$1", $assertion);
$assertion = preg_replace("/\s([\S]+)=\"([^\"]+)/", "\n$1=\"$2",$assertion);
$elements = preg_split('/\n/',$assertion);
echo '<pre>';
$indent = -1;
foreach ($elements as $element){
if (ereg('</', $element) && ereg('/>', $element)){
echo '';
}
else if (ereg('/>', $element)){
$reduce_indent = true;
echo '';
}
elseif (ereg('</', $element)){
$reduce_indent = true;
}
elseif (ereg('<', $element)){
$indent++;
}
else {
echo ' ';
}
for($i = 0; $i < $indent; $i++)
echo ' ';
// Syntax highlighting
$element = ereg_replace('<','<',$element);
$element = ereg_replace('>','>',$element);
$element = preg_replace('/(\w+)="(.+)"/', " <span style=\"color:green\">$1</span>=<span style=\"color:brown\">"$2"</span>", $element);
$element = preg_replace('/(<.+)/', "<span style=\"color:blue\">$1</span>", $element);
$element = ereg_replace('span>>', "span><span style=\"color:blue\">></span>", $element);
if (!ereg('=', $element) && !ereg('<', $element))
echo '<span style="color:black;">'.wordwrap($element."\n", 120, "\n", 1).'</span>';
else
echo wordwrap($element."\n", 120, "\n", 1);
if ($reduce_indent){
$indent--;
$reduce_indent = false;
}
}
echo '</pre>';
echo '</td></tr>';
$counter++;
}
}
// Check status
if ($status == '10' ) {
echo '<tr>';
echo '<td colspan=2><b>Valid Shibboleth session but no user attributes received!</b></td>';
echo '</tr>';
echo '<tr>';
echo '<td colspan=2>Hint to Home Organization administrators:<br>Please verify your metadata and ARP files.</td>';
echo '</tr>';
}
elseif ($status == '00') {
echo '<tr>';
echo '<td colspan=2><b>No valid Shibboleth session!</b></td>';
echo '</tr>';
echo '<tr>';
echo '<td colspan=2>This web page is probably not protected with Shibboleth. Hint to Home Organization administrators:<br>Have a look at your web server or Shibboleth configuration</td>';
echo '</tr>';
}
else {
?>
<tr>
<td colspan="2" align="center">
<?php
if (isset($_REQUEST['assertions'])) {
echo '<a href=".">Hide Shibboleth assertions</a>';
}
else {
echo '<a href="?assertions">Show Shibboleth assertions</a>';
}
if (isset($_REQUEST['all_variables'])) {
echo ' | <a href=".">Hide all HTTP variables</a>';
}
else {
echo ' | <a href="?all_variables">Show all HTTP variables</a>';
}
?>
| <a href="?source">Show PHP source</a>
</td>
</tr>
<?php
}
?>
</table>
<!-- end content -->
</td>
<!-- right border -->
<td class="middle-right"></td>
</tr>
<!-- bottom left corner + middle bar + right corner -->
<tr>
<td class="bottom-left"></td>
<td><img src="http://www.switch.ch/aai/design/images/bottomcenter.gif" height="14" width="100%" alt="bottomcenter" /></td>
<td class="bottom-right"></td>
</tr>
</table>
<!-- all HTTP variables -->
<?php
if (isset($_REQUEST['all_variables'])) {
?>
<p> </p>
<table>
<tr>
<td><strong>HTTP Environment Variables</strong></td><td><strong>Raw Values</strong></td></tr>
<?php
ksort($_SERVER);
foreach ($_SERVER as $key => $value) {
if ( ereg('^Shib-', $key) || isset($HTTP_SHIB_HEADERS[$key]))
{
$class= "border-orange";
}
else {
$class= "border-blue";
}
echo '<tr valign="top">';
echo '<td class="'.$class.'">'.$key.'</td>';
if (is_array($value)){
echo '<td class="'.$class.'">';
if (!empty($value)){
echo '<ul>';
foreach($value as $item){
echo '<li><tt>'.wordwrap($item, 70, '<br>', true).'</tt></li>';
}
echo '</ul>';
}
echo '</td>';
} else {
echo '<td class="'.$class.'"><tt>'.wordwrap(htmlspecialchars(stripslashes($value)), 70, '<br>', true).'</tt></td>';
}
echo "</tr>\n";
}
?>
</table>
<?php
}
?>
</div>
</body>
</html>
If the file is called viewer.php (You need to install PHP by "yum install php"), then go to https://df.arcs.org.au/secure/viewer.php. You should see the WAYF. If login successfully, you'll see an attribute list with your SharedToken? and common name, etc.
Back-out Procedures
Stop shibd, remove config from Apache, remove RPMs by "rpm -e"
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-009 | 18 Jul 2009 - 21:15 | ---+!! Upgrade of Grid VMs at eRSA
Description
Replace the existing ng2, ngdata, and nggums VMs with newer versions based on CentOS 5.3 and VDT 1.10.1 on Arcturus.
Estimated Duration
4 hours allocated, 2 hours expected.
Systems/Services Affected
All Grid services provided at eRSA.
Staff Responsible
Garth Denley
Detailed Instructions
- Shut down the existing ng2, ngdata, and nggums machines.
- Using the pre-built and tested ng2new, ngdatanew, and nggums new VMs, change the IP address and hostnames for each to that of ng2, ngdata, and nggums.
- Search through each machine for references to ng2, ngdata, and nggums using standard tools (eg. "find" and "grep"). Check each reference is correct.
- Reboot each machine to ensure that it comes up cleanly with the new configuration.
- Test the new machines, performing changes as problems are discovered.
Testing Procedures
- Using local variants of the tests on http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite , test that job submission to all three clusters (hydra, aquila, corvus) works successfully, as can be best determined by current availability and reliability of those clusters.
- Manually test gridftp transfers on ngdata.
- Using the web interface on nggums, test various VO / cert combinations to confirm they are mapped to correct accounts.
- Confirm that MDS information is being published correctly.
- Confirm that job logs from all three clusters successfully reaches ng2new/ng2.
Back-out Procedures
- Change the IP and hostname details for the new machines back to their previous state.
- Bring the original ng2, ngdata, and nggums VMs back up.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-010 | 18 Jul 2009 - 20:58 | ---+!! Migration of unicast bridge in Perth to new hardware
Description
The unicast bridge agbridge-perth.arcs.org.au will be moved from aging hardware to a virtual machine hosted on new hardware.
Estimated Duration
Two hours.
Systems/Services Affected
AccessGrid unicast. However, there are two other unicast bridges within ARCS that can still be used: agbridge-canberra.arcs.org.au and agbridge-melb.arcs.org.au.
Staff Responsible
DarranCarey
Detailed Instructions
The existing system will be shutdown. The new system will be renamed and brought into service.
Testing Procedures
The system will be minimally tested after installation on Sunday July 19th. Detailed testing will be performed during the AG Test Session Monday July 20th.
Back-out Procedures
If any problem are encountered with the new system, it will be shutdown for later triage, and the existing system will be restarted.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-011 | 28 Jul 2009 - 14:34 | ---+!! TITLE
Enabling Postgres WAL Archiving On Datafabric
Description
We will use archived Write Ahead Log(WAL) files, to prepare backup to Postgres database. We will archive WAL files to another machine(arcs-df.vpac.org) so we can have offsite backup. To archive files to arcs-df.vpac.org, we need to have password less access between two machine[arcs-df.ivec.org & arcs-df.vpac.org] as its already been setup between these machines I am not adding that part in this changenote.
Estimated Duration
30mins
Systems/Services Affected
arcs-df.ivec.org
Staff Responsible
Sridhar Reddapani
Detailed Instructions
Backup of the existing database
- The master database will be backed up using pgdump.
Stop Davis, iRODS and Postgres in sequence
- service httpd stop
- service davis stop
- service irods stop
- service postgres stop
Modify postgresql.conf file, add below line from line 183
- archive_mode = on
- archive_command = 'ssh arcs-df.vpac.org test ! -f /data/DataFabric_Backups/Current_Wal_Archives/%f && rsync -az %p arcs-df.vpac.org:/data/DataFabric_Backups/Current_Wal_Archives/%f'
- checkpoint_timeout = 1h
- archive_timeout = 12h
* Start Postgres, iRODS, davis, httpd in sequence*
- service postgres start
- service irods start
- service davis start
- service httpd start
Testing Procedures
Test whether files WAL files are being copied to arcs-df.vpac.org:/data/DataFabric_Backups/Current_Wal_Archives/ when they are fill or every 12 Hours.
Back-out Procedures
- Stop all services in sequence as above
- comment out above lines in postgresql.conf file
- Start all services in sequence as above
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-012 | 11 Aug 2009 - 11:29 | ---+!! Upgrade of grid virtual machines at JCU
Description
Replace the existing ng2 and nggums virtual machines with newer versions based on CentOS 5.3 and VDT 1.10.1
Estimated Duration
4 hours allocated, 2 hours expected
Systems/Services Affected
All Grid services provided at JCU
Staff Responsible
WillHsu
Detailed Instructions
Shutdown the existing ng2 and nggums virtual machines. Rename the new ng2 and nggums installations and test the grid workflow.
Restart INCA to make sure gridftp tests can be run from ng2.
Testing Procedures
Using local variants of the tests on http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite , test that job submission to cluster is successful.
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated time, then these installations will be turned off, and the previous production installations will be brought back online.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-013 | 25 Jul 2009 - 20:51 | ---+!! Upgrade of grid virtual machines at NCI NF
Description
Migrate the existing NCI NF ng2 and nggums virtual machines to CentOS? 5.3 and VDT 1.10.1.
Estimated Duration
4 hours allocated, 2 hours expected.
Systems/Services Affected
NCI NF Grid Services
Staff Responsible
PaulWarren
Detailed Instructions
Shutdown the existing ng2 and nggums virtual machines. Rename the new ng2 and nggums installations and test the grid workflow.
Testing Procedures
Tests 1 through 4 of the Globus 4 TestSuite? located here http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite will be performed. If they all succeed, this installation will be put into production.
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated time, then these installations will be turned off, and the previous production installations will be brought back online.
Review
Schedule/Notification
Notifications are to be sent to the Community, Developers, and Techstaff mailing lists.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200907-014 | 01 Aug 2009 - 19:49 | ---+!! Grisu backend fixes
Description
Two changes to the Grisu backend web service are to be implemented to improve client performance.
Estimated Duration
30 minutes.
Systems/Services Affected
Grisu.
Staff Responsible
MarkusBinsteiner, SeanFleming
Detailed Instructions
An updated version of the webservice backend has been built. The tomcat server on grisu.vpac.org needs to be shutdown, the old war file needs to be replaced with the new one and tomcat needs to be restarted. That's all.
This adds better gridftp file listing speed and a force-mpi tag processing capability to the default ARCS Grisu backend.
Testing Procedures
Once the changes are in place and the web service has been restarted, a series of jobs will be submitted to various sites.
Back-out Procedures
If any issues arise during the testing procedures, the changes will be backed out by restoring the web service to it previous configuration and restarting the service. The testing phase will be rerun to ensure the service is operational.
Review
Schedule/Notification
Notification to be sent to the Community, Developers and Techstaff mailing lists.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-001 | 07 Aug 2009 - 15:52 | ---+!! TITLE
Description
Moving Datafabric ICAT from arcs-df.ivec.org to arcs-df.vpac.org
Estimated Duration
1 Hour
Systems/Services Affected
All Datafabric Servers
- 1. arcs-df.vpac.org
- 2. arcs-df.ivec.org
- 3. arcs-df.ac3.edu.au
- 4. arcs-df.sf.utas.edu.au
- 5. arcs-df.hpcu.uq.edu.au
- 6. arcs-df.eresearchsa.edu.au
Staff Responsible
Sridhar Reddapani
Detailed Instructions
Stopping required services
- On arcs-df.ivec.org
- service httpd stop
- service davis stop
- service irods stop
- On arcs-df.vpac.org, arcs-df.ac3.edu.au, arcs-df.sf.utas.edu.au, arcs-df.hpcu.uq.edu.au and arcs-df.eresearchsa.edu.au
Take ICAT backup on arcs-df.ivec.org with pg_dump
- mkdir /tmp/Backups
- pg_dump ICAT > /tmp/Backups/ICAT_DUMP_IVEC
Take Full ICAT backup on arcs-df.ivec.org
- service postgres stop
- su - rods
- cd ~/DB/pgsql
- tar -cf /tmp/Backups/data.tar data/
Burn both ICAT_DUMP_IVEC and data.tar files to DVD and compare md5 sum of original files with copy on DVD, as additional backup
Copy ICAT_DUMP_IVEC from arcs-df.ivec.org to arcs-df.vpac.org
- scp /tmp/Backups/ICAT_DUMP_IVEC arcs-df.vpac.org:/tmp/
Rebuild the iRODS on arcs-df.vpac.org with ICAT enabled
Tuning performence of postgres
- Edit postgresql.conf as
- listen_addresses = '*'
- max_connections = 250
- shared_buffers = 2048MB
- effective_cache_size = 4194MB
- autovacuum = on
Allow access to postgres from all other DF servers
- Edit pg_hba.conf as
- host ICAT rods 127.0.0.1/32 trust
- # arcs-df.ivec.org
- host ICAT rods 192.65.130.247/32 trust
- # arcs-df.vpac.org
- host ICAT rods 202.158.218.58/32 trust
- # arcs-df.ac3.edu.au
- host ICAT rods 203.202.30.74/32 trust
- # arcs-df.sf.utas.edu.au
- host ICAT rods 202.6.77.50/32 trust
- # arcs-df.hpcu.uq.edu.au
- host ICAT rods 130.102.163.138/32 trust
- # arcs-df.eresearchsa.edu.au
- host ICAT rods 129.127.96.93/32 trust
Configure Postgres WAL archiving on arcs-df.vpac.org
- Enable password less ssh between arcs-df.vpac.org and arcs-df.ac3.edu.au
- Edit /opt/iRODS-2.0v/Postgres/pgsql/data/postgresql.conf and add
- archive_mode = on
- archive_command = 'ssh arcs-df.ac3.edu.au test ! -f /data/DataFabric_Backups/Current_Wal_Archives/%f && rsync -az %p arcs-df.ac3.edu.au:/data/DataFabric_Backups/Current_Wal_Archives/%f'
- checkpoint_timeout = 1h
- archive_timeout = 12h
Start postgres on arcs-df.vpac.org and restore ICAT with backup file
- cp -r /opt/iRODS-2.0v/Postgres/pgsql/data /tmp/data_bkp
- service postgres start
- dropdb ICAT
- createdb ICAT
- psql ICAT < /tmp/ICAT_DUMP_IVEC
Configure iRODS on all servers talk to arcs-df.vpac.org
- edit ../server/config/server.config as
- icatHost arcs-df.vpac.org
Start all required services
- On arcs-df.vpac.org,arcs-df.ivec.org, arcs-df.ac3.edu.au, arcs-df.sf.utas.edu.au, arcs-df.hpcu.uq.edu.au and arcs-df.eresearchsa.edu.au*
Continue with Changenote http://wiki.arcs.org.au/bin/view/Main/ChangeNote200907-003 to move davis from arcs-df.ivec.org to arcs-df.vpac.org
Testing Procedures
- Try accessing DF with icommands
- Test auto user creation is working on arcs-df.vpac.org
- Test all rules are functional on arcs-df.vpac.org
Back-out Procedures
Stopping required services
- On arcs-df.vpac.org
- service irods stop
- service postgres stop
- On arcs-df.ivec.org, arcs-df.ac3.edu.au, arcs-df.sf.utas.edu.au, arcs-df.hpcu.uq.edu.au and arcs-df.eresearchsa.edu.au
Configure all irod servers talk to arcs-df.ivec.org for ICAT
- edit ../server/config/server.config as
- icatHost arcs-df.ivec.org
Start all required services
- On arcs-df.ivec.org
- service postgres start
- service irods start
- On arcs-df.vpac.org, arcs-df.ac3.edu.au, arcs-df.sf.utas.edu.au, arcs-df.hpcu.uq.edu.au and arcs-df.eresearchsa.edu.au*
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-002 | 06 Aug 2009 - 18:27 | ---+!! Upgrade of grid virtual machines at UQ/UQ ESSCC
Description
Replace the existing ng2, nggums, and ngdata virtual machines with newer versions based on CentOS 5.3 and VDT 1.10.1
Estimated Duration
8 hours allocated, 4 hours expected, 2 hours per each site.
Systems/Services Affected
All Grid services provided at UQ and UQ ESSCC
Staff Responsible
WillHsu
Detailed Instructions
Shutdown the existing ng2, nggums, and ngdata virtual machines. Rename the new installations and test the grid workflow.
Testing Procedures
Using local variants of the tests on http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite , test that job submission to cluster is successful.
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated time, then these installations will be turned off, and the previous production installations will be brought back online.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-003 | 04 Sep 2009 - 10:32 | ---+!! Update arcs.org.au Plone site to update products/features
Description
The ARCS Plone website requires a configuration update in order to introduce several new features to the site (forum improvements [email notification, improved post/thread management], Google Apps integration) and to, at the same time, update a number of existing 3rd party add-on products for stability and security. This outage affects the ARCS site located at http://www.arcs.org.au/
Estimated Duration
Two hours, in which the configuration will be updated and products reinstalled on the site.
Systems/Services Affected
ARCS Plone Site: http://www.arcs.org.au/
Staff Responsible
DavidBreitkreutz
Detailed Instructions
- Pull updated ARCS buildout configuration onto the server
- Re-run the buildout process to pull in the updated products/changes
- Pull down the latest version of the ARCS customisations product
- Stop the ARCS Plone server process
- Start the ARCS Plone server process in the foreground to make sure it is working after the update
- Stop the ARCS Plone server process
- Start the ARCS Plone server process normally
- On the website front end, reinstall each relevant product that required updating
Testing Procedures
- Verify site process is operational again
- Verify ARCS site is accessible via the web interface
Back-out Procedures
- Restore previous configuration
- Re-run buildout process to restore previous products
- Undo any changes that were carried out on the site.
Review
Schedule/Notification
6:30am 11th September 2009
Tracking completion
RT Ticket
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-004 | 14 Sep 2009 - 11:10 | ---+!! Upgrading iRODS from 2.0.1 to 2.1 on DataFabric
Upgrading iRODS from 2.0.1 to 2.1 on DataFabric
Description
Upgrading iRODS from 2.0.1 to 2.1 on DataFabric
Estimated Duration
3 hours
Systems/Services Affected
All DF Servers
- arcs-df.vpac.org
- arcs-df.ivec.org
- arcs-df.ac3.edu.au
- arcs-df.sf.utas.edu.au
- arcs-df.hpcu.uq.edu.au
- arcs-df.eresearchsa.edu.au
- arcs-df.hpsc.csiro.au
Staff Responsible
Sridhar Reddapani, Graham Jenkins, Pauline Mak
Detailed Instructions
Note: This documentation is on assumption, that you have installed iRODS2.0.1v at /opt/iRODS-2.0v/iRODS and postgre at /opt/iRODS-2.0v/Postgres
On arcs-df.ivec.org irods is installed at /opt/rods/iRODS and postgres is at /opt/rods/DB
On arcs-df.eresearchsa.edu.au irods is installed at /data/irods/iRODS and postgres is at /data/irods/Postgres
Preliminary Filesystem Reorganisation on arcs-df.vpac.org
As user root run
- chkconfig --add httpd
- init 2
- fuser -c /opt # Ensure that no processes are shown
- mv /opt /opt-20090912
- ln -s /data /opt
- cd /opt-20090912
- find . -print | cpio -pdm /opt
- init 3 # Check iRODS/Davis functionality
VDT Upgrade [all servers]
Ref: http://vdt.cs.wisc.edu/releases/1.10.1/release-p24.html
As user root run
- . /opt/vdt/setup.sh
- cd $VDT_LOCATION
- vdt-control --off
- pacman -get http://vdt.cs.wisc.edu/vdt_1101_cache:VDT-Updater
- pacman -update VDT-Updater
- mkdir /opt/vdt-20080912
- cp -pr $VDT_LOCATION /opt/vdt-20080912
- vdt/update/vdt-updater
- .. Absolute path to backup: /opt/vdt-20080912/vdt
- vdt-control --on
- service irods restart # Check iRODS/Davis functionality
Upgrading ICAT enabled servers[arcs-df.vpac.org]
As user root run
- service davis stop
- service httpd stop
- service irods stop
- mkdir /opt/iRODS-2.1v
- chown -R rods:rods /opt/iRODS-2.1v
- Install openssl-devel [if not already installed]
- yum install openssl-devel
- su - rods
As user rods run
- Download irods2.1.tgz to /tmp
- cd /opt/iRODS-2.1v
- tar -xvf /tmp/irods2.1.tgz
- mkdir /tmp/patch
- cd /tmp/patch
- wget ftp://ftp.sdsc.edu/pub/outgoing/mwan/irods/patchFor2.1.tar
- tar -xvf patchFor2.1.tar
- cp -r patchFor2.1/* /opt/iRODS-2.1v/iRODS
- unset IRODS_HOME
- export PATH=`echo $PATH | sed -e 's/iRODS/xRODS/g'`
- cd /opt/iRODS-2.1v/iRODS/
- cp /opt/iRODS-2.0v/iRODS/config/irods.config config/
- /opt/iRODS-2.0v/Postgres/pgsql/bin/psql ICAT < server/icat/src/psg-patch-v2.0.1tov2.1.sql
- ./irodssetup --upgrade
- cd /opt/iRODS-2.1v/iRODS
- mkdir -p server/bin/local
- cd server/bin/local
- wget http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/Rules/updateRules.sh
- chmod a+xr updateRules.sh
- edit crontab [crontab -e] as below to point to new iRODS_HOME=/opt/iRODS/iRODS
- 0 6 * * * /usr/local/bin/irodsLogPrune.sh /opt/iRODS/iRODS >/dev/null 2>&1 || :
- 0 * * * * /opt/iRODS/iRODS/server/bin/local/updateRules.sh YOUR_DEFAULT_RESOURCE >/dev/null 2>&1 || :
- Run "/opt/iRODS/iRODS/server/bin/local/updateRules.sh YOUR_DEFAULT_RESOURCE" to download rule files.
As user root run
Upgrading ICAT less servers
As user root run
- service irods stop
- mkdir /opt/iRODS-2.1v
- chown -R rods:rods /opt/iRODS-2.1v
- Install openssl-devel [if not already installed]
- yum install openssl-devel
- su - rods
As user rods run
- Download irods2.1.tgz to /tmp
- cd /opt/iRODS-2.1v
- tar -xvf /tmp/irods2.1.tgz
- mkdir /tmp/patch
- cd /tmp/patch
- wget ftp://ftp.sdsc.edu/pub/outgoing/mwan/irods/patchFor2.1.tar
- tar -xvf patchFor2.1.tar
- cp -r patchFor2.1/* /opt/iRODS-2.1v/iRODS
- unset IRODS_HOME
- export PATH=`echo $PATH | sed -e 's/iRODS/xRODS/g'`
- cd /opt/iRODS-2.1v/iRODS/
- cp /opt/iRODS-2.0v/iRODS/config/irods.config config/
- edit irods.config as
- $IRODS_ICAT_HOST = 'arcs-df.vpac.org';
- ./irodssetup --upgrade
- cd /opt/iRODS-2.1v/iRODS
- mkdir -p server/bin/local
- cd server/bin/local
- wget http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/Rules/updateRules.sh
- chmod a+xr updateRules.sh
- edit crontab [crontab -e] as below to point to new iRODS_HOME=/opt/iRODS/iRODS
- 0 6 * * * /usr/local/bin/irodsLogPrune.sh /opt/iRODS/iRODS >/dev/null 2>&1 || :
- 0 * * * * /opt/iRODS/iRODS/server/bin/local/updateRules.sh YOUR_DEFAULT_RESOURCE >/dev/null 2>&1 || :
- Run "/opt/iRODS/iRODS/server/bin/local/updateRules.sh YOUR_DEFAULT_RESOURCE" to download rule files.
As user root run
Enable Trash
- Add the following line in arcs.irb and check into subversion.
acTrashPolicy||nop|nop
This will be propagated to all data fabric node within the hour. arcs.irb should also be modified on arcs-df.vpac.org
Testing Procedures
Back-out Procedures
As root
- service irods stop
- rm -f /opt/iRODS
- ln -s /opt/iRODS-2.0v/ /opt/iRODS
- service irods start
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-005 | 25 Aug 2009 - 11:38 | ---+!! Inca Migration to Arcturus
Description
The Inca VM will be migrated as-is from grid-gateway to Arcturus as part of a multi-step migration.
Proposed Date
The migration will be performed on 19th August from 3pm AEST through 4pm AEST
Estimated Duration
1 hour
Systems/Services Affected
The Inca testing suite will be inactive during the migration. No real user impact.
Staff Responsible
- Garth Denley: System migration
- Will Hsu: Inca startup and testing
Detailed Instructions
The existing Inca VM will be shut down. The image file for the root filesystem will be copied (and compressed) from grid-gateway to Arcturus and placed into an preprepared VM shell. The filesystem will then be mounted on the host, expanded to the new size, fstab modified to reflect the new layout, console changed, and then unmounted. The VM will be brought up and tested.
The old Inca VM will be retained as a back-out option for this and future migration tasks.
All of these steps have been previously undertaken in a test environment.
Testing Procedures
The machine will be tested to ensure it is still working correctly.
The Inca suite will be initialised and tested to ensure that it is still behaving properly.
Back-out Procedures
The new VM will be shut down and disabled. The old VM will be brought back up again.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-006 | 20 Aug 2009 - 16:57 | ---+!! Adding rules for EM-XRay collections
Description
Files under the EM-XRay collection must be stored at UQ.
Estimated Duration
1 hour
Systems/Services Affected
Data Fabric on all production boxes:
arcs-df.hpcu.uq.edu.au, arcs-df.ivec.org, arcs-df.ac3.edu.au, arcs-df.sf.utas.edu.au, arcs-df.eresearchsa.edu.au, arcs-df.vpac.org, arcs-df.hpsc.csiro.au
Staff Responsible
Pauline Mak
Detailed Instructions
Add the EMXray rules into the server config $IRODS_HOME/server/config/server.config
reRuleSet emxray, imos,arcs,core
The updateRules.sh script should update itself automatically to grab the emxray.irb file from subversion.
Testing Procedures
Login as the rods user, and modify the ~/.irods/.irodsEnv file and comment out the default resource setting.
Put a file in the following directory on the data fabric. Create a dummy text file, then
icd /ARCS/projects/EMXRAY/
iput <some test file>
ils -L
The result should say the file is now on arcs-df.qcif.edu.au
Remove the file using irm and uncomment the default resource setting in ~/.irods/.irodsEnv.
Back-out Procedures
Remove imos from the server configuration file: $IRODS_HOME/server/config/server.config
reRuleSet imos, arcs,core
Remove the imos rules file:
rm $IRODS_HOME/server/config/reConfigs/emxray.irb
Review
Schedule/Notification
Enable EMXray rules after the outage on 20th August.
Tracking completion
VPAC, TPAC, iVEC, UQ - done.
AC3 - done
eRSA - done, but have to add "wget -o SCRATCH http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/Rules/updateRules.sh?q=$$" to stop proxy cache...
CSIRO - done.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-007 | 03 Nov 2009 - 17:00 | ---+!! Cron Job for Moving IMOS-ACORN Staging Data
Description
This note describes the steps necessary to create a cron job for moving IMOS-ACORN staging data to archive and opendap directories.
Estimated Duration
10 minutes
Systems/Services Affected
Only QCIF/UQ DF machine: arcs-df.hpcu.uq.edu.au
Staff Responsible
KaiLu
Detailed Instructions
As user rods:
run:
cd /opt/iRODS/iRODS/server/bin/local
wget http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/utils/ACORN_Data_Move.sh/?format=raw -O ACORN_Data_Move.sh
chmod +x ACORN_Data_Move.sh
crontab -e and adding the following line:
5,25,45 * * * * /opt/iRODS/iRODS/server/bin/local/ACORN_Data_Move.sh >> /opt/iRODS/iRODS/server/log/ACORN-Moving-Data 2>&1
The cron job will be run once every 20 minutes
Testing Procedures
As user rods:
run:
cd /opt/iRODS/iRODS/server/bin/local
wget http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/utils/filesChecking.sh/?format=raw -O filesChecking.sh
chmod +x filesChecking.sh
find /data/Vault/ARCS/projects/IMOS/staging/ACORN -daystart -type f \( -mtime 0 -or -mtime 1 \) > fileList
/opt/iRODS/iRODS/server/bin/local/ACORN_Data_Move.sh >> /opt/iRODS/iRODS/server/log/ACORN-Moving-Data 2>&1
./filesChecking.sh -f fileList
If nothing is printed, this means that the cron job script works properly.
rm -f fileList
When you visit the web site: opendap-qcif.arcs.org.au, you will also see those files that have just been moved to directory opendap from directory staging
Back-out Procedures
As user rods:
run:
crontab -e and remove the following line:
5,25,45 * * * * /opt/iRODS/iRODS/server/bin/local/ACORN_Data_Move.sh >> /opt/iRODS/iRODS/server/log/ACORN-Moving-Data 2>&1
rm -f /opt/iRODS/iRODS/server/bin/local/ACORN-Moving-Data.sh
rm -f /opt/iRODS/iRODS/server/log/ACORN-Moving-Data
rm -f /opt/iRODS/iRODS/server/bin/local/filesChecking.sh
Review
Schedule/Notification
Tracking completion
Done. -- KaiLu - 03 Nov 2009
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-008 | 26 Aug 2009 - 13:37 | ---+!! Hollowworld, CISM - New Trac Instances
Description
This outage is needed to implement:
- Hollowworld - a Trac site along the scheme used for Fullmoon
- CISM - a Trac site for the Community Ice Shelf Model
- Urgent kernel and apache security updates
Estimated Duration
5 minutes
Systems/Services Affected
http://projects.arcs.org.au
https://projects.arcs.org.au
Staff Responsible
Simon Yin
Detailed Instructions
The URLs for the new sites are:
https://projects.arcs.org.au/trac/hollowworld
https://projects.arcs.org.au/svn/hollowworld
https://projects.arcs.org.au/trac/cism
https://projects.arcs.org.au/svn/cism
Detailed instruction may be found at: https://projects.arcs.org.au/trac/systems/wiki/HowTo/UpgradeAndMigrateTracSubversion
Testing Procedures
For Hollowworld, the following scenarios are tested:
- Windows XP, IE8 user
- Connect anonymously to http://203.202.139.98/trac/hollowworld
- View tickets
- Register as "TestUser"
- (https only) Connect as "TestUser" with no extra permissions to https://203.202.139.98/trac/hollowworld
- Create a new ticket
- (https only) Connect as user with full admin permissions to https://203.202.139.98/trac/hollowworld
- Change the password of "TestUser" (and then login again as "TestUser" to check the password)
- Connect to http://203.202.139.98/svn/hollowworld
- Repeat the tests in 1. for a Mac OSX, Safari user
- Repeat the tests in 1. for a Linux, Firefox 3.5.x user
- (https only) Linux, Subversion from command line (if necessary, check out from a known sub-directory of the project-under-test to save time)
- svn co https://203.202.139.98/svn/hollowworld
Where tests listed above have used the http protocol, repeat them with the https protocol.
The new site is first configured at a standby site (IP address 203.202.139.98). After these tests are completed successfully, the configuration is dumped to the production server (projects.arcs.org.au). A restart of httpd is needed on projects.arcs.org.au.
* Important For Hollowworld *
Once running on projects.arcs.org.au, make a test ticket and ensure that emails are sent to the hollowworld-dev@arcs.org.au mailing-list
Back-out Procedures
On projects.arcs.org.au, before changing them to enable Hollowworld and CISM, make copies of
/etc/httpd/conf.d/trac.conf as /etc/httpd/conf.d/trac.conf.YYYYMMDD
/etc/httpd/conf.d/subversion.conf as /etc/httpd/conf.d/subversion.conf .YYYYMMDD
Revert to the above config settings if the changes fail.
Also, the postgreSQL database used by Hollowworld and CISM must be dropped.
Finally, delete the directories:
- /var/www/trac/hollowworld
- /var/www/svn/hollowworld
- /var/www/trac/cism
- /var/www/svn/cism
Review
Schedule/Notification
Notification by email:
| For ARCS internal projects: |
|
| |
techstaff@arcs.org.au, developers@arcs.org.au |
| For Fullmoon |
|
| |
simon.cox@jrc.ec.europa.eu, ryan.fraser@csiro.au, Alexandre.Robin@spotimage.fr, bryan.lawrence@stfc.ac.uk, eboisver@nrcan.gc.ca, Francois.Letourneau@RNCan-NRCan.gc.ca, Jacek.Radajewski@csiro.au, johannes.echterhoff@igsi.eu, Jonathan.Yu@csiro.au, mggr@pml.ac.uk, pavel.golodoniuc@csiro.au, Peter.Taylor@csiro.au, pcd@ecosystem.com, Rick.Meng@csiro.au, rob.atkinson@csiro.au, robert.woodcock@csiro.au, rocky@cc.gatech.edu, stefan.hansen@lisasoft.com |
| For podd |
|
| |
gkam08@gmail.com, liyf@itee.uq.edu.au, james.eddes@adelaide.edu.au, jane@itee.uq.edu.au, ben.joseph@utas.edu.au, xueqin.wang@csiro.au, paul.coddington@arcs.org.au, andrew.treloar@ands.org.au, Adrienne.McKenzie@anu.edu.au, Donald.Hobern@csiro.au, Robert.Furbank@csiro.au |
| For pht3d |
|
| |
Janek.Greskowiak@csiro.au, aaron.mcdonough@csiro.au, vincent.post@falw.vu.nl |
| For aodaac |
|
| |
ken.suber@csiro.au, Edward.King@csiro.au, matt.paget@csiro.au |
| For common-grid-libs |
|
| |
aron.abrook@qut.edu.au |
| For commons-vfs-grid |
|
| |
manish.saroha@its.monash.edu.au |
| For CAPSnav |
|
| |
Edward.King@csiro.au, Leon.Majewski@bom.gov.au, Peter.Turner@csiro.au |
| For DataMINX? |
|
| |
p.turner@chem.usyd.edu.au, alexa@intersect.org.au, andrey@intersect.org.au, carlos@intersect.org.au, Doug.duBoulay@gmail.com, xiao.wang@stfc.ac.uk, nguyen_h@chem.usyd.edu.au, gerson.galang@versi.edu.au |
| For Generic Grid Client |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org, vladimir.mencl@canterbury.ac.nz |
| For grisu |
|
| |
y.halytskyy@auckland.ac.nz |
| For grix |
|
| |
aron.abrook@qut.edu.au |
| For gsub |
|
| |
sam.moskwa@sapac.edu.au |
| For jmoleditor |
|
| |
Joachim.Mai@ac3.edu.au, vvv900@gmail.com, Martin.Thompson@unsw.edu.au |
| For macddap |
|
| |
jason.lohrey@arcitecta.com, R.Williams@utas.edu.au, paolap@utas.edu.au, pblain@postoffice.utas.edu.au |
| For access |
|
| |
martin.dix@csiro.au, l.logan@bom.gov.au, mrezny@sgi.com |
| For mem |
|
| |
chris.jackett@csiro.au, peter.turner@csiro.au, jenny.lovell@csiro.au, r.williams@utas.edu.au |
| For mgplot |
|
| |
frank.colberg@utas.edu.au |
| For pht3d |
|
| |
Henning.Prommer@csiro.au, Aaron.McDonough@csiro.au |
| For portal |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org |
| For sggc |
|
| |
Terry.Rankine@csiro.au |
| For slcs-client |
|
| |
chi.nguyen@melco.mq.edu.au |
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200908-009 | 25 Aug 2009 - 18:08 | ---+!! Inca Grid Monitor software upgrade at eRSA
Description
Upgrade current Inca VM to CentOS 5.3 and Inca 2.5
Estimated Duration
6 hours allocated, 3 hours expected.
Systems/Services Affected
Grid Monitoring/Testing.
Staff Responsible
WillHsu
Detailed Instructions
Shutdown Inca processes, yum update, then upgrade Inca software.
Reboot Inca VM, start Inca processes.
Testing Procedures
Starts manual Inca tests via incat java control application, check test results via http://inca.arcs.org.au/
Back-out Procedures
Shutdown Inca processes
mv /home/inca /home/inca.new
mv /home/inca.old /home/inca
Starts previous version of Inca processes
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-001 | 08 Sep 2009 - 13:05 | ---+!! ARCS Sakai Upgrade to Version 2.5.5
Description
The ARCS Sakai server is being upgraded to version 2.5.5 which includes security fixes. In addition to the upgrade, additional bug fixes to WebDAV? (Windows Vista issue) and Search tool are included.
Proposed Date
Friday 11th September 12:00pm AEST
Estimated Duration
One hour
Systems/Services Affected
All sites on the ARCS Sakai server.
Staff Responsible
- Denny (via ARCS email, Jabber)- contact for Sakai-level issues.
- Garth Denley (via ARCS email, Jabber)- contact for system-level issues.
Detailed Instructions
- The tomcat service will be stopped.
- The database will be dumped.
- The filestore used by Sakai will be backed up to the same mountpoint in a different directory (plenty of space).
- The symbolic link /opt/tomcat will be updated to point to the new version of Sakai.
- The tomcat service will be started.
Testing Procedures
- The Sakai site will be tested to see if any problems arise.
Back-out Procedures
- The tomcat service will be stopped.
- The database will be dropped and restored from backup.
- The filestore backup will be copied back to the live filestore.
- The symbolic link /opt/tomcat will be updated to point to the old version of Sakai.
- The tomcat service will be started.
UPDATE 8/9:
- A slight change in the back-out procedures: The modified database will not be dropped. Instead, we will restore the database backup to a new table name, and adjust sakai.properties to use that. This gains us the ability to use the modified database as a last resort should database restoration fail for some reason.
Review
Schedule/Notification
Notifications are to be sent to the Community, Developers, and Techstaff mailing lists. Sakai users will also be notified by email.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-002 | 18 Feb 2010 - 10:55 | ---+!! Davis change for iRODS 2.1
Description
Update Davis to a version that works with iRODS 2.1 (due to Jargon changes...)
Estimated Duration
30 Minutes
Systems/Services Affected
arcs-df.vpac.org
Staff Responsible
Pauline Mak
Detailed Instructions
Stop davis and httpd if running:
service davis stop
service httpd stop
Then move the current deployed version of davis:
cd /opt/davis
mv davis-0.7.3 davis-0.7.3_OLD
rm davis
As the davis user, download webavis from GoogleCode? :
wget http://webdavis.googlecode.com/files/davis-0.7.3.tar.gz
tar zxvf davis-0.7.3.tar.gz
mv /tmp/davis-0.7.3 /opt/davis/
Then copy over config files:
cp davis-0.7.3_OLD/webapps/root/WEB-INF/web.xml davis-0.7.3/webapps/root/WEB-INF/
cp davis-0.7.3_OLD/etc/*.xml davis-0.7.3/etc/
Then start httpd and davis:
service davis start
service httpd start
Testing Procedures
Test with litmus on irodsdev.vpac.org
Back-out Procedures
Stop davis Remove the new version of davis and rename davis-0.7.3_OLD to davis-0.7.3 Start davis
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-003 | 09 Sep 2009 - 19:42 | ---+!! Migrate LDAP server for IdP?
Description
Migrate the LDAP server off idp.arcs.org.au onto dedicated host ldap.arcs.org.au
Estimated Duration
30 minutes
Systems/Services Affected
Directly: idp.arcs.org.au, openidp.arcs.org.au
Indirectly: Users of the ARCS IdP will be unable to login to shibboleth services during this time. (DataFabric being the main one)
Staff Responsible
Sam Morrison
Detailed Instructions
Dump LDIF from idp.arcs.org.au then load it into ldap.arcs.org.au
Change configuration on:
IdP? registration app
idp.arcs.org.au login handler
idp.arcs.org.au resolver handler
openidp.arcs.org.au login handler
openidp.arcs.org.au resolver handler
Testing Procedures
Testing has been done using a copy of the data and the unused idp.arcs.org.au
Back-out Procedures
Change configuration changes back.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-004 | 15 Sep 2009 - 13:02 | ---+!! VDT Upgrade
Description
VDT will be upgraded from version 1.10.1y to 1.10.1p24. This is primarily for security updates.
Estimated Duration
15 minutes.
Systems/Services Affected
All ARCS Grid services.
Staff Responsible
All Systems Services staff.
Detailed Instructions
All ARCS Grid virtual machines will be upgraded from VDT 1.10.1y to 1.10.1p24 via the vdt-updater script.
Testing Procedures
Once the upgrade has been performed, tests 1 and 2 from the test suite will be run. If any fail, the upgrade will be rolled back.
Back-out Procedures
The vdt-updater script forces you to make a backup of the existing VDT installation. If backout is required, the backup copy of VDT will be moved back into production.
Review
Schedule/Notification
Notification to be sent to community, developers, and techstaff lists.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-005 | 18 Sep 2009 - 17:51 | ---+!! Fix to ibun Command
Fix to ibun Command
Description
This fix allow us to extract tar files using "ibun -x" command in any collection, if you have write permission to it.
Estimated Duration
1 Hour
Systems/Services Affected
All DF Server
Staff Responsible
Sridhar Reddapani
Detailed Instructions
Testing Procedures
Test using ibun to extarct tar files to any collection other than home collection
Back-out Procedures
- service httpd stop [if necessary]
- service davis stop [if necessary]
- service irods stop
- service postgres stop [if necessary]
- cd /opt/iRODS/iRODS/server/api/src/
- rm rsStructFileExtAndReg.c
- mv /tmp/rsStructFileExtAndReg.c
- cd /opt/iRODS/iRODS
- make
- service postgres start [if necessary]
- service irods start
- service davis start [if necessary]
- service httpd start [if necessary]
Review
Schedule/Notification
Tracking completion
Patch applied to
iVEC
eRSA
UQ
Intersect
TPAC
VPAC
iVEC
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-006 | 17 Sep 2009 - 17:23 | ---+!! Upgrade Davis to 0.8.0
Description
Upgrade Davis to 0.8.0
Estimated Duration
30mins
Systems/Services Affected
ARCS DF
Staff Responsible
Shunde Zhang
Detailed Instructions
Stop davis.
Make a copy of current davis instance. Change the owner of that directory properly.
cp -r davis-0.7.3 davis-0.8.0
replace davis.jar in webapps/root/WEB-INF/lib
Modify web.xml
<init-param>
<description>
organisation name - default is Davis
</description>
<param-name>organisation-name</param-name>
<param-value>Australian Research Collaboration Service</param-value>
</init-param>
<init-param>
<description>
authentication realm - default is Davis
</description>
<param-name>authentication-realm</param-name>
<param-value>ARCS Data Fabric</param-value>
</init-param>
<init-param>
<description>
organisation logo
</description>
<param-name>organisation-logo</param-name>
<param-value>/images/logo.jpg</param-value>
</init-param>
<init-param>
<description>
organisation logo geometry - widthxheight
</description>
<param-name>organisation-logo-geometry</param-name>
<param-value>32x32</param-value>
</init-param>
<init-param>
<description>
favicon for browser title bar
</description>
<param-name>favicon</param-name>
<param-value>http://www.arcs.org.au/favicon.ico</param-value>
</init-param>
<init-param>
<description>
myproxy server where certs are stored
</description>
<param-name>myproxy-server</param-name>
<param-value>myproxy.arcs.org.au</param-value>
</init-param>
Copy images files from tar ball to /var/www/html
Change the 'davis' soft link to 0.8.0, and restart davis.
Testing Procedures
Litmus, WebDAV? clients, etc.
Back-out Procedures
Change the 'davis' soft link back to 0.7.3, and restart davis.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-007 | 17 Sep 2009 - 17:12 | ---+!! slcs-server - New Trac Instances
Description
This outage is needed to implement:
- slcs-server - a Trac site for the SLCS Proxy project
- ciomp - (was CISM) renamed a Trac site
- Urgent kernel and apache security updates
Estimated Duration
5 minutes
Systems/Services Affected
http://projects.arcs.org.au
https://projects.arcs.org.au
Staff Responsible
Simon Yin
Detailed Instructions
The URLs for the new sites are:
https://projects.arcs.org.au/trac/slcs-server
https://projects.arcs.org.au/svn/slcs-server
https://projects.arcs.org.au/trac/ciomp
https://projects.arcs.org.au/svn/ciomp
Detailed instruction may be found at: https://projects.arcs.org.au/trac/systems/wiki/HowTo/UpgradeAndMigrateTracSubversion
Testing Procedures
Since ciomp is just a renamed site, no further tests are needed.
Since slcs-server is for internal ARCS use, no detailed tests are needed
Back-out Procedures
On projects.arcs.org.au, before changing them to enable Hollowworld and CISM, make copies of
/etc/httpd/conf.d/trac.conf as /etc/httpd/conf.d/trac.conf.YYYYMMDD
/etc/httpd/conf.d/subversion.conf as /etc/httpd/conf.d/subversion.conf .YYYYMMDD
Revert to the above config settings if the changes fail.
Also, the postgreSQL database used by ciomp and slcs-server must be dropped.
Finally, delete the directories:
- /var/www/trac/ciomp
- /var/www/svn/ciomp
- /var/www/trac/slcs-server
- /var/www/svn/slcs-server
Review
Schedule/Notification
Notification by email:
| For ARCS internal projects: |
|
| |
techstaff@arcs.org.au, developers@arcs.org.au |
| For Fullmoon |
|
| |
simon.cox@jrc.ec.europa.eu, ryan.fraser@csiro.au, Alexandre.Robin@spotimage.fr, bryan.lawrence@stfc.ac.uk, eboisver@nrcan.gc.ca, Francois.Letourneau@RNCan-NRCan.gc.ca, Jacek.Radajewski@csiro.au, johannes.echterhoff@igsi.eu, Jonathan.Yu@csiro.au, mggr@pml.ac.uk, pavel.golodoniuc@csiro.au, Peter.Taylor@csiro.au, pcd@ecosystem.com, Rick.Meng@csiro.au, rob.atkinson@csiro.au, robert.woodcock@csiro.au, rocky@cc.gatech.edu, stefan.hansen@lisasoft.com |
| For podd |
|
| |
gkam08@gmail.com, liyf@itee.uq.edu.au, james.eddes@adelaide.edu.au, jane@itee.uq.edu.au, ben.joseph@utas.edu.au, xueqin.wang@csiro.au, paul.coddington@arcs.org.au, andrew.treloar@ands.org.au, Adrienne.McKenzie@anu.edu.au, Donald.Hobern@csiro.au, Robert.Furbank@csiro.au, joanne.evans@unimelb.edu.au |
| For pht3d |
|
| |
Janek.Greskowiak@csiro.au, aaron.mcdonough@csiro.au, vincent.post@falw.vu.nl |
| For aodaac |
|
| |
ken.suber@csiro.au, Edward.King@csiro.au, matt.paget@csiro.au |
| For common-grid-libs |
|
| |
aron.abrook@qut.edu.au |
| For commons-vfs-grid |
|
| |
manish.saroha@its.monash.edu.au |
| For CAPSnav |
|
| |
Edward.King@csiro.au, Leon.Majewski@bom.gov.au, Peter.Turner@csiro.au |
| For DataMINX? |
|
| |
p.turner@chem.usyd.edu.au, alexa@intersect.org.au, andrey@intersect.org.au, carlos@intersect.org.au, Doug.duBoulay@gmail.com, xiao.wang@stfc.ac.uk, nguyen_h@chem.usyd.edu.au, gerson.galang@versi.edu.au |
| For Generic Grid Client |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org, vladimir.mencl@canterbury.ac.nz |
| For grisu |
|
| |
y.halytskyy@auckland.ac.nz |
| For grix |
|
| |
aron.abrook@qut.edu.au |
| For gsub |
|
| |
sam.moskwa@sapac.edu.au |
| For jmoleditor |
|
| |
Joachim.Mai@ac3.edu.au, vvv900@gmail.com, Martin.Thompson@unsw.edu.au |
| For macddap |
|
| |
jason.lohrey@arcitecta.com, R.Williams@utas.edu.au, paolap@utas.edu.au, pblain@postoffice.utas.edu.au |
| For access |
|
| |
martin.dix@csiro.au, l.logan@bom.gov.au, mrezny@sgi.com |
| For mem |
|
| |
chris.jackett@csiro.au, peter.turner@csiro.au, jenny.lovell@csiro.au, r.williams@utas.edu.au |
| For mgplot |
|
| |
frank.colberg@utas.edu.au |
| For portal |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org |
| For sggc |
|
| |
Terry.Rankine@csiro.au |
| For slcs-client |
|
| |
chi.nguyen@melco.mq.edu.au |
| For ciomp |
|
| |
bkgalton@utas.edu.au |
| For holllowworld |
|
| |
See Fullmoon |
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-008 | 29 Sep 2009 - 17:36 | ---+!! Intersect - Short Term Outages (<5 min) Due To Network Upgrade
Description
| Intersect is upgrading to a pair of 1Gb/s switches at Global Switch Data Centre |
|
|
| The reason for this change is to |
|
|
| * Improve data transfer speed for the ARCS Data Fabric |
|
| * Provide failover for high-availability needs |
|
Estimated Duration
< 5 min
Systems/Services Affected
| All ARCS systems and services hosted by Intersect |
|
|
| * projects.arcs.org.au |
|
| * arcs-df.ac3.edu.au |
|
| * ng2.ac3.edu.au |
|
| * nggums.ac3.edu.au |
|
| * idp.ac3.edu.au |
|
| * grid.ac3.edu.au |
|
| * auth86.ac3.edu.au |
|
| * auth01.ac3.edu.au - auth16.ac3.edu.au |
|
| * ARCS piwik server |
|
Staff Responsible
Simon Yin - simon.yin@arcs.org.au
Detailed Instructions
This is an initial population of 2 new 1Gb/s switches. The interruption will occur when the uplink to AARNET is transferred from an existing switch to the new switches. It will be a short duration outage but will affect all public services.
Testing Procedures
Prior to installation at Global Switch, the new switches have been power cycled and preconfigured by AC3.
Back-out Procedures
Move the uplink to AARNET back to the old Cisco switch.
Review
Schedule/Notification
Notification by email:
| For ARCS internal projects: |
|
| |
techstaff@arcs.org.au, developers@arcs.org.au |
| For Fullmoon |
|
| |
simon.cox@jrc.ec.europa.eu, ryan.fraser@csiro.au, Alexandre.Robin@spotimage.fr, bryan.lawrence@stfc.ac.uk, eboisver@nrcan.gc.ca, Francois.Letourneau@RNCan-NRCan.gc.ca, Jacek.Radajewski@csiro.au, johannes.echterhoff@igsi.eu, Jonathan.Yu@csiro.au, mggr@pml.ac.uk, pavel.golodoniuc@csiro.au, Peter.Taylor@csiro.au, pcd@ecosystem.com, Rick.Meng@csiro.au, rob.atkinson@csiro.au, robert.woodcock@csiro.au, rocky@cc.gatech.edu, stefan.hansen@lisasoft.com |
| For podd |
|
| |
gkam08@gmail.com, liyf@itee.uq.edu.au, james.eddes@adelaide.edu.au, jane@itee.uq.edu.au, ben.joseph@utas.edu.au, xueqin.wang@csiro.au, paul.coddington@arcs.org.au, andrew.treloar@ands.org.au, Adrienne.McKenzie@anu.edu.au, Donald.Hobern@csiro.au, Robert.Furbank@csiro.au, joanne.evans@unimelb.edu.au |
| For pht3d |
|
| |
Janek.Greskowiak@csiro.au, aaron.mcdonough@csiro.au, vincent.post@falw.vu.nl |
| For aodaac |
|
| |
ken.suber@csiro.au, Edward.King@csiro.au, matt.paget@csiro.au |
| For common-grid-libs |
|
| |
aron.abrook@qut.edu.au (Defunct) |
| For commons-vfs-grid |
|
| |
manish.saroha@its.monash.edu.au |
| For CAPSnav |
|
| |
Edward.King@csiro.au, Leon.Majewski@bom.gov.au, Peter.Turner@csiro.au |
| For DataMINX? |
|
| |
p.turner@chem.usyd.edu.au, alexa@intersect.org.au, andrey@intersect.org.au, carlos@intersect.org.au, Doug.duBoulay@gmail.com, xiao.wang@stfc.ac.uk, nguyen_h@chem.usyd.edu.au, gerson.galang@versi.edu.au |
| For Generic Grid Client |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org, vladimir.mencl@canterbury.ac.nz |
| For grisu |
|
| |
y.halytskyy@auckland.ac.nz |
| For grix |
|
| |
aron.abrook@qut.edu.au |
| For gsub |
|
| |
sam.moskwa@sapac.edu.au |
| For jmoleditor |
|
| |
Joachim.Mai@ac3.edu.au, vvv900@gmail.com, Martin.Thompson@unsw.edu.au |
| For macddap |
|
| |
jason.lohrey@arcitecta.com, R.Williams@utas.edu.au, paolap@utas.edu.au, pblain@postoffice.utas.edu.au |
| For access |
|
| |
martin.dix@csiro.au, l.logan@bom.gov.au, mrezny@sgi.com |
| For mem |
|
| |
chris.jackett@csiro.au, peter.turner@csiro.au, jenny.lovell@csiro.au, r.williams@utas.edu.au |
| For mgplot |
|
| |
frank.colberg@utas.edu.au |
| For portal |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org |
| For sggc |
|
| |
Terry.Rankine@csiro.au |
| For slcs-client |
|
| |
chi.nguyen@melco.mq.edu.au (Defunct) |
| For ciomp |
|
| |
bkgalton@utas.edu.au |
| For holllowworld |
|
| |
See Fullmoon |
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-009 | 01 Oct 2009 - 11:28 | ---+!! Network changes at VPAC
Description
Network changes at VPAC
Estimated Duration
All day
Systems/Services Affected
All systems hosted at VPAC
Staff Responsible
Andy Botting, Sam Morrison
Detailed Instructions
Network changes at VPAC will cause ARCS services to be unavailable.
- arcs.org.au
- arcs-df.vpac.org
- status.arcs.org.au
- plonedev1.arcs.org.au
- voms.arcs.org.au
- wikitest.arcs.org.au
- codedev.arcs.org.au
- shib-mp.arcs.org.au
- code.arcs.org.au
- static.arcs.org.au
- services.arcs.org.au
- rrdev.arcs.org.au
- plone.arcs.org.au
- fedmgr0.arcs.org.au
- arcs-vh01.vpac.org
- arcs-vh02.vpac.org
- arcs-vh03.vpac.org
- idp.arcs.org.au
- chat.arcs.org.au
- www.arcs.org.au
- plonedev.arcs.org.au
- rt.arcs.org.au
- arcs-db.vpac.org
- crm.arcs.org.au
- support.arcs.org.au
- myproxy2.arcs.org.au
- slcs1.arcs.org.au
- slcstest.arcs.org.au
- slcsca.arcs.org.au
- evo.arcs.org.au
- test.arcs.org.au
- dreamwidth.arcs.org.au
- agbridge-melb.arcs.org.au
- odl.arcs.org.au
- openidp.arcs.org.au
- nagios.arcs.org.au
- hosting.arcs.org.au
- ds.arcs.org.au
- ds-test.arcs.org.au
- ldap.arcs.org.au
- vomrs.arcs.org.au
Testing Procedures
All the hostnames listed will be tested once the migration is complete.
Back-out Procedures
The old hardware will still be available if the migration is not successful. Reverting to the old infrastructure will be possible if necessary.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-010 | 29 Sep 2009 - 10:17 | ---+!! Fix to ibun to extract files to any resource on Data Fabric
Fix to ibun to extract files to any resource on DataFabric?
Description
This fix allow us to extract tar files using "ibun -x" command in any resource, if you have write permission to it.
Estimated Duration
1 Hour
Systems/Services Affected
All DF Server
Staff Responsible
Sridhar Reddapani
Detailed Instructions
- service httpd stop [if necessary]
- service davis stop [if necessary]
- service irods stop
- service postgres stop [if necessary]
- su - rods
- cd /opt/iRODS/
- tar cvf pre_ibun_second_patch_iRODS.tar iRODS/
- cd /opt/iRODS/iRODS/server/api/src/
- mv rsPhyPathReg.c /tmp/
- wget http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/BugFix/rsPhyPathReg.c
- cd /opt/iRODS/iRODS
- make
- exit
- service postgres start [if necessary]
- service irods start
- service davis start [if necessary]
- service httpd start [if necessary]
Testing Procedures
Test using ibun to extarct tar files to different resources.
Back-out Procedures
- service httpd stop [if necessary]
- service davis stop [if necessary]
- service irods stop
- service postgres stop [if necessary]
- su - rods
- cd /opt/iRODS
- mv iRODS /tmp/
- tar -xvf pre_ibun_second_patch_iRODS.tar
- exit
- service postgres start [if necessary]
- service irods start
- service davis start [if necessary]
- service httpd start [if necessary]
Review
Schedule/Notification
Tracking completion
iVEC ---Done
eRSA ---
UQ ---Done
Intersect ---Done
TPAC ---
VPAC ---Done
CISRO --Done
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200909-011 | 28 Sep 2009 - 12:04 | ---+!! Davis and SP integration
Description
Configure Apache to get Davis shibbolized
Estimated Duration
10 mins
Systems/Services Affected
ARCS DF (no downtime is needed)
Staff Responsible
Shunde Zhang
Detailed Instructions
Prerequisites: SP should be installed and working properly.
Make sure the Davis configuration file, web.xml, has:
<init-param>
<param-name>insecureConnection</param-name>
<param-value>shib</param-value>
</init-param>
<init-param>
<param-name>shared-token-header-name</param-name>
<param-value>auEduPersonSharedToken</param-value>
</init-param>
<init-param>
<param-name>cn-header-name</param-name>
<param-value>cn</param-value>
</init-param>
<init-param>
<param-name>admin-cert-file</param-name>
<param-value>/etc/grid-security/daviscert.pem</param-value>
</init-param>
<init-param>
<param-name>admin-key-file</param-name>
<param-value>/etc/grid-security/daviskey.pem</param-value>
</init-param>
And rods has DN of the above cert mapped.
rods@arcs-df ~ $ iadmin lu rods
user_id: 10007
user_name: rods
user_type_name: rodsadmin
zone_name: ARCS
user_distin_name: /C=AU/O=APACGrid/OU=VPAC/CN=arcs-df.vpac.org
user_info:
r_comment:
create_ts: 01234939508 : 2009-02-18.17:45:08
modify_ts: 01253773433 : 2009-09-24.16:23:53
Then add a conf file to httpd conf.d.
vi /etc/httpd/conf.d/arcs.conf
<VirtualHost *:80>
ServerName df.arcs.org.au
DocumentRoot "/var/www/html"
ProxyPass /ARCS/ ajp://localhost:8009/ARCS/
<Location /ARCS>
AuthType shibboleth
ShibRequireSession On
ShibUseHeaders On
require valid-user
</Location>
<Location /secure>
AuthType shibboleth
ShibRequireSession On
ShibUseHeaders On
require valid-user
</Location>
</VirtualHost>
Restart shibd and RELOAD httpd.
/etc/init.d/shibd restart
/etc/init.d/httpd reload
Testing Procedures
Go to http://df.arcs.org.au/ARCS/home
Back-out Procedures
delete arcs.conf from /etc/httpd/conf.d
reload httpd
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-001 | 06 Oct 2009 - 13:48 | ---+!! TITLE
Upgrading PGPOOL on arcs-01.ivec.org
Description
Upgrading PGPOOL to latest version Pgpool-2.2.5 on arcs-01.ivec.org
Estimated Duration
30 Mins
Systems/Services Affected
arcs-01.ivec.org
Staff Responsible
Sridhar Reddapani
Detailed Instructions
pgpool stop
As Root
cd /data/pgsql/
mkdir pgpool-2.2.5
chown -R postgres:postgres pgpool-2.2.5/
su -postgres
cd /data/pgsql/pgpool-2.2.5/
mkdir pgpool
mkdir logs
Download latest pgpool to /tmp
cd /tmp/
tar -xvf pgpool-II-2.2.5.tar.gz
cd /tmp/pgpool-II-2.2.5
./configure --prefix=/data/pgsql/pgpool-2.2.5/pgpool/
make
make install
cd /data/pgsql/pgpool-2.2.5/pgpool/etc
cp /data/pgsql/pgpool/pgpool/etc/pcp.conf ./
cp /data/pgsql/pgpool/pgpool/etc/pgpool.conf ./
cp /data/pgsql/pgpool/pgpool/etc/pool_hba.conf ./
exit
cd /data/pgsql/
mv pgpool/ pgpool-2.2.1/
ln -s pgpool-2.2.5/ pgpool
su - postgres
pgpool -version
pgpool -d -n 2>&1 | /usr/local/sbin/cronolog --hardlink=/data/pgsql/pgpool/logs/pgpool.log '/data/pgsql/pgpool/logs/%Y-%m-%d-pgpool.log' &
tail -f /data/pgsql/pgpool/logs/pgpool.log
Testing Procedures
Test failover is working by simulating failure node
Back-out Procedures
pgpool stop
As Root
cd /data/pgsql/
rm pgpool
ln -s pgpoo-2.2.1/ pgpool
su - postgres
pgpool -version
pgpool -d -n 2>&1 | /usr/local/sbin/cronolog --hardlink=/data/pgsql/pgpool/logs/pgpool.log '/data/pgsql/pgpool/logs/%Y-%m-%d-pgpool.log' &
tail -f /data/pgsql/pgpool/logs/pgpool.log
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-002 | 20 Oct 2009 - 10:19 | ---+!! Intersect - Move Data Fabric to 1Gb/s link, add Redundant PSU and NIC to Projects machine, Connect ng2/nggums host to 1Gb/s link
Description
| Intersect now has 1Gb/s switches at Global Switch Data Centre |
|
|
| This change is to |
|
|
| * Improve data transfer speed for the ARCS Data Fabric by moving uplinks to the 1Gb/s switch |
|
| * Install extra PSU and NIC to allow Highly-Available service from the Projects Xen Host |
|
| * Move Grid ng2/nggums services to 1Gb/s switch |
|
Estimated Duration
1 Hour
Systems/Services Affected
| The following ARCS systems and services hosted by Intersect will be intermittent during this period |
|
|
| * projects.arcs.org.au - Trac and Subversion sites |
|
| * arcs-df.ac3.edu.au |
|
| * ng2.ac3.edu.au |
|
| * nggums.ac3.edu.au |
|
| * idp.ac3.edu.au |
|
Staff Responsible
Simon Yin - simon.yin@arcs.org.au
Detailed Instructions
| projects.arcs.org.au (RU21) - Trac and Subversion sites |
|
|
| * Shutdown the projects.arcs.org.au VM |
|
| * Activate the standby projects.arcs.org.au VM |
|
| * Power down the host located at RU21 |
|
| * Disconnect the power-cord to RU21 |
|
| * Connect the new NIC card to PCIe Slot #2 |
|
| * Connect the second PSU to the empty power bay |
|
| * Restart the host and disable the standby projects.arcs.org.au |
|
| * Restart the production projects.arcs.org.au VM |
|
| ng2/nggums/idp (RU22) |
|
|
| Public VLAN |
|
|
| * Identify the existing Ethernet connection to Public (HPCC_FRONT) VLAN |
|
| * Connect the above to RU27 (ARCS SW1 IA069B? ) Port 5 |
|
| Private VLAN |
|
|
| * Identify an unused Ethernet port |
|
| * Connect the above to RU27 (ARCS SW1 IA069B? ) Port 16 |
|
| arcs-df.ac3.edu.au - Data Fabric (RU20) |
|
|
| Public VLAN |
|
|
| * Identify the existing Ethernet connection to Public (HPCC_FRONT) VLAN |
|
| * Connect the above to RU27 (ARCS SW1 IA069B? ) Port 4 |
|
| Private VLAN |
|
|
| * Identify the existing Ethernet connection to Private (NODE) VLAN |
|
| * Connect the above to RU27 (ARCS SW1 IA069B? ) Port 15 |
|
| mclarenfs VLAN |
|
|
| * Identify the existing Ethernet connection to Mclarenfs (DATA) VLAN |
|
| * Connect the above to RU27 (ARCS SW1 IA069B? ) Port 22 |
|
Testing Procedures
- Complete the installation of the second PDU and NIC on the Projects host machine first.
- After disabling the standby and restarting the production VM, point a web-browser at https://projects.arcs.org.au/trac/systems and confirm the production version has restarted and is accessible
- Next, complete the connection of the ng2/nggums/idp to the 1Gb/s switch. Confirm the functionality with a grid job submission
- Once Step 3 is successful, perform the connection upgrade of the arcs-df machine
Back-out Procedures
If a machine does not restart correctly after the installation of new hardware, remove that hardware and return to prior configuration. Contact IBM for warranty replacement of the component.
If a network connection does not return upon moving to the 1Gb/s switch, first try a different port (on the same VLAN). If this is unsuccessful then return to the old Cisco switch connection.
Review
Schedule/Notification
Notification by email:
| For ARCS internal projects: |
|
| |
techstaff@arcs.org.au, developers@arcs.org.au |
| For Fullmoon |
|
| |
simon.cox@jrc.ec.europa.eu, ryan.fraser@csiro.au, Alexandre.Robin@spotimage.fr, bryan.lawrence@stfc.ac.uk, eboisver@nrcan.gc.ca, Francois.Letourneau@RNCan-NRCan.gc.ca, Jacek.Radajewski@csiro.au, johannes.echterhoff@igsi.eu, Jonathan.Yu@csiro.au, mggr@pml.ac.uk, pavel.golodoniuc@csiro.au, Peter.Taylor@csiro.au, pcd@ecosystem.com, Rick.Meng@csiro.au, rob.atkinson@csiro.au, robert.woodcock@csiro.au, rocky@cc.gatech.edu, stefan.hansen@lisasoft.com |
| For podd |
|
| |
gkam08@gmail.com, liyf@itee.uq.edu.au, james.eddes@adelaide.edu.au, jane@itee.uq.edu.au, ben.joseph@utas.edu.au, f.davies@uq.edu.au, kxkevin@gmail.com, xueqin.wang@csiro.au, paul.coddington@arcs.org.au, andrew.treloar@ands.org.au, Adrienne.McKenzie@anu.edu.au, Donald.Hobern@csiro.au, Robert.Furbank@csiro.au, joanne.evans@unimelb.edu.au |
| For pht3d |
|
| |
Janek.Greskowiak@csiro.au, aaron.mcdonough@csiro.au, vincent.post@falw.vu.nl |
| For aodaac |
|
| |
ken.suber@csiro.au, Edward.King@csiro.au, matt.paget@csiro.au |
| For common-grid-libs |
|
| |
aron.abrook@qut.edu.au (Defunct) |
| For commons-vfs-grid |
|
| |
manish.saroha@its.monash.edu.au |
| For CAPSnav |
|
| |
Edward.King@csiro.au, Leon.Majewski@bom.gov.au, Peter.Turner@csiro.au |
| For DataMINX? |
|
| |
p.turner@chem.usyd.edu.au, alexa@intersect.org.au, andrey@intersect.org.au, carlos@intersect.org.au, Doug.duBoulay@gmail.com, xiao.wang@stfc.ac.uk, nguyen_h@chem.usyd.edu.au, gerson.galang@versi.edu.au |
| For Generic Grid Client |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, ronald@ivec.org, vladimir.mencl@canterbury.ac.nz |
| For grisu |
|
| |
y.halytskyy@auckland.ac.nz |
| For grix |
|
| |
aron.abrook@qut.edu.au (Defunct) |
| For gsub |
|
| |
sam.moskwa@sapac.edu.au |
| For jmoleditor |
|
| |
Joachim.Mai@ac3.edu.au, vvv900@gmail.com, Martin.Thompson@unsw.edu.au |
| For macddap |
|
| |
jason.lohrey@arcitecta.com, R.Williams@utas.edu.au, paolap@utas.edu.au, pblain@postoffice.utas.edu.au |
| For access |
|
| |
martin.dix@csiro.au, l.logan@bom.gov.au, mrezny@sgi.com |
| For mem |
|
| |
chris.jackett@csiro.au, peter.turner@csiro.au, jenny.lovell@csiro.au, r.williams@utas.edu.au |
| For mgplot |
|
| |
frank.colberg@utas.edu.au |
| For portal |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au, ronald@ivec.org |
| For sggc |
|
| |
Terry.Rankine@csiro.au |
| For slcs-client |
|
| |
chi.nguyen@melco.mq.edu.au (Defunct) |
| For ciomp |
|
| |
bkgalton@utas.edu.au, msd@ccpo.odu.edu |
| For hollowworld |
|
| |
See Fullmoon |
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-003 | 02 Nov 2009 - 11:23 | ---+!! MDS VM upgrade at UQ
Description
Upgrade current MDS VM to CentOS 5.3 and latest vdt release.
Estimated Duration
4 hours allocated, 2 hours expected
Systems/Services Affected
all ng2 publishing MDS info to UQ MDS server.
Staff Responsible
WillHsu
Detailed Instructions
Shut down MDS VM. Boot CentOS 5 MDS VM, install vdt and MDS rpms according to the following doco:
http://projects.arcs.org.au/trac/systems/wiki/InfoSystems/CentralIndexServiceInstall
Testing Procedures
Check log output /opt/vdt/globus/var/container-real.log
Manual check with wsrf-query:
wsrf-query -s https://mds1.arcs.org.au:8443/wsrf/services/DefaultIndexService "//*[local-name()='Site']"|less
Back-out Procedures
If any of the tests fail, and cannot be remedied within the allocated time, then the new VM will be turned off, and the previous production VM will be brought back online.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-004 | 23 Oct 2009 - 09:09 | ---+!! SLCS moving to AAF Federation
Description
The host slcs1.arcs.org.au will be moving from the MAMS Level-2 federation to the AAF Pilot Production Federation
Estimated Duration
Allow 5 mins to include testing etc. Actual down time will be less than 3 seconds (the time it takes to restart shibd)
Systems/Services Affected
All users that request short lived certificates
The two AuScope portals that use the SLCS Delgation Service
Grix and Grisu - They use the SLCS Client
DataFabric? ?? - Do they still use the slcs client?
Staff Responsible
SamMorrison
Detailed Instructions
Modify the shibboleth configuration file on slcs1 to point to the AAF Discovery Service instead of the MAMS Where Are You From Service.
Restart shibd
Testing Procedures
Use slick-init command line tool to request a certificate.
Test Grix and Grisu
Test Datafabric
Back-out Procedures
Revert to old shibboleth configuration
Review
Schedule/Notification
Plan to do this on Thursday the 29th Oct at 9:00am
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-005 | 02 Nov 2009 - 11:20 | ---+!! EVO Server Migration
Description
EVO server will be migrated from it's current physical host to a virtual machine.
Estimated Duration
Less than 2 hours
Systems/Services Affected
EVO Web-start and registrations for http://evo.arcs.org.au
Staff Responsible
Andy Botting, Sam Morrison
Detailed Instructions
The EVO server filesystem will be rsync'd across to a newly created filesystem on an ARCS virtual machine host. Once it has been rsync'd, the EVO services will be shut down, and a final rsync will occur to ensure that the new virtual machine has all the latest configuration.
The EVO server will then be shut down, and the virtual machine will be brought up.
Testing Procedures
Test that EVO can be started by the web-start link and new user registrations are successful.
Back-out Procedures
The old physical server can be booted if the migration is unsuccessful.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-006 | 02 Nov 2009 - 11:37 | ---+!! Migrating ARCS Wiki to another host
Description
The ARCS Wiki will be moved from its current virtual machine (hosted on VPAC infrastructure) to a new ARCS virtual machine.
Estimated Duration
Less than 2 hours
Systems/Services Affected
ARCS TWiki, and all included webs.
Staff Responsible
Andy Botting, Lev Lafayette
Detailed Instructions
The TWiki will be rsync'd from it's current host to a new host. When the change window begins, the current TWiki will be made unavailable and a final rsync will occur.
The TWiki will then be started again on the new host.
Cron jobs for TWiki management will also be moved.
Testing Procedures
Testing each of the webs, and doing some page write tests.
Back-out Procedures
Enable the IP address of the TWiki on the old host. Re-enable the TWiki virtual host on the old virtual machine.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-007 | 22 Oct 2009 - 12:18 | ---+!! TITLE
Attaching New Storage to arcs-df.ac3.edu.au
Description
Attaching 4TB storage to Intesct DataFabric? node[arcs-df.ac3.edu.au]
Estimated Duration
1 Hour
Systems/Services Affected
arcs-df.ac3.edu.au
Staff Responsible
Sridhar Reddapani
Detailed Instructions
As Root
service irods stop
cp -r /DataFabric /DataFabric1
umount /DataFabric
unmount /DataFabric1
mount /dev/DfVolGroup/Datafabric /DataFabric/
Add below line to /etc/fstab
/dev/DfVolGroup/Datafabric /DataFabric1/ ext3 defaults,acl 0 0
service irods start
Testing Procedures
check all files are there with icommands
Back-out Procedures
As Root
service irods stop
umount /DataFabric
mount /dev/etherd/e0.0 /DataFabric
service irods start
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-008 | 28 Oct 2009 - 09:24 | ---+!! TITLE
Adding New Index to ICAT Database
Description
Adding New Index [idx_coll_main3] to ICAT database on arcs-df.vpac.org
Estimated Duration
10 Mins
Systems/Services Affected
arcs-df.vpac.org
Staff Responsible
Sridhar Reddapani
Detailed Instructions
As user rods
psql ICAT
create unique index idx_coll_main3 on R_COLL_MAIN (coll_name);
\q
Testing Procedures
psql ICAT
\di
see there are 19 indexes and idx_coll_main3 is listed there
Back-out Procedures
As user rods
psql ICAT
drop index idx_coll_main3;
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200910-009 | 28 Oct 2009 - 10:36 | ---+!! Adding TDS to arcs-df.vpac.org
Description
Installing a THREDDS Data Server on the VPAC data box to serve data through OPeNDAP? .
Estimated Duration
1 hour
Systems/Services Affected
arcs-df.vpac.org and df.arcs.org.au (need a httpd reload/restart)
Staff Responsible
Pauline Mak
Detailed Instructions
Detailed installation notes here: https://projects.arcs.org.au/trac/systems/wiki/DataServices/OPeNDAP-TDS-Install-Apache
Testing Procedures
Back-out Procedures
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-001 | 06 Nov 2009 - 17:03 | ---+!! New Davis update (0.8.1)
Description
Updating davis, which contains a bug fix for trash, etc. See release notes for more detials:
http://projects.arcs.org.au/trac/davis/wiki/ReleaseNotes/0.8.1
Estimated Duration
30 minutes
Systems/Services Affected
ARCS Data Fabric, arcs-df.arcs.org.au
Staff Responsible
Pauline Mak
Detailed Instructions
As root, stop davis and remove symlink:
service davis stop
rm /opt/davis/davis
Download new davis, unpack and recreate the symlink:
cd /opt/davis
wget http://webdavis.googlecode.com/files/davis-0.8.1.tar.gz
tar -zxvf davis-0.8.1.tar.gz
rm davis-0.8.1.tar.gz
rm davis
ln -s davis-0.8.1 davis
chown -R davis:davis davis-0.8.1
Make a new directory for persistent config files and copy config over
mkdir /opt/davis/etc
cp /opt/davis/davis/webapps/root/WEB-INF/host-dfvpac.properties /opt/davis/etc/host-local.properties
And run the new configuration script:
sh davis-configure.sh
Then modify settings in /opt/davis/davis/webapps/root/WEB-INF/davis-host.properties
server-name=localhost
zone-name=ARCS
default-domain=arcs-df.vpac.org
default-resource=
anonymousCollections=/ARCS/worldview,/ARCS/projects/IMOS/public
Copy davis images across to root directory:
cp /opt/davis/davis/webapps/images/* /var/www/html/images
Restart davis (service davis start)
Testing Procedures
Test with litmus, then check with web interface
Make sure that permissions and metadata can be applied to gz files.
Also test other web interface function
Back-out Procedures
Link back to older version of davis
As root:
service davis stop
cd /opt/davis rm davis
ln -s davis-0.8.0 davis
service davis start
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-002 | 19 Nov 2009 - 15:31 | ---+!! Migrate Plone Storage to blobs
Description
Migrate ZODB to Blob based storage. This will improve the speed of plone and allow larger files to be stored more easily.
Estimated Duration
3 Hours
Systems/Services Affected
www.arcs.org.au
Staff Responsible
Russell Sim
David Breitkreutz
Detailed Instructions
extend buildout to include
[instance]
...
eggs +=
plone.app.blob
Products.contentmigration
zcml +=
plone.app.blob
Products.contentmigration
Visit: http://<site>/@@blob-image-migration and http://<site>/@@blob-file-migration to migrate content over to blobs.
Testing Procedures
Test downloading a file and viewing an image.
Back-out Procedures
Backup before begining remove migration section from buildout and revert to backup of database.
Review
Schedule/Notification
6am 1/12/09.
Notification should go to the TechStaff? mailing list
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-003 | 28 Nov 2009 - 10:57 | ---+!! Installing new SLCS client library, Davis 0.8.2
Description
Installing the latest SLCS library to cope with the new AAF Pilot WAYF page as well as updating Davis. Also allowing core dumps to be written to track iRODS crashes.
Estimated Duration
1 hour
Systems/Services Affected
df.arcs.org.au
Staff Responsible
Pauline Mak, Graham Jenkins
Pre-Change
Update /etc/init.d/irods by replacing the first runuser line with:
runuser -s /bin/bash - $IRODS_USER -c \
"ulimit -c 8192; $Env $PORTS $IRODS_HOME/irodsctl $_P"
Detailed Instructions
Stop service, download and install Davis 0.8.2, as root:
service davis stop
cd /opt/davis/
wget http://webdavis.googlecode.com/files/davis-0.8.2.tar.gz
tar -zxvf davis-0.8.2
cd /opt/davis
rm davis
ln -s /opt/davis/davis-0.8.2/ davis
chown -R davis:davis /opt/davis/davis-0.8.2
cd /opt/davis/davis/bin
sh davis-configure.sh
Modify slcs-client properites to point to the slcs1 server:
cd /opt/davis/davis/webapps/root/WEB-INF/classes/
slcs.server=https://slcs1.arcs.org.au/SLCS/login
ssl.blindtrust=false
Then modify the davis config like so (/opt/davis/etc/host-local.properties)
server-name=localhost
zone-name=ARCS
default-domain=arcs-df.vpac.org
default-resource=
anonymousCollections=/ARCS/worldview,/ARCS/projects/IMOS/public
insecureConnection=shib
admin-cert-file=/etc/grid-security/daviscert.pem
admin-key-file=/etc/grid-security/daviskey.pem
shared-token-header-name=auEduPersonSharedToken
cn-header-name=cn
methods=LOCK PROPFIND
organisation-name=Australian Research Collaboration Service
authentication-realm=ARCS Data Fabric
organisation-logo=/images/logo.jpg
organisation-logo-geometry=32x32
favicon=http://www.arcs.org.au/favicon.ico
myproxy-server=myproxy.arcs.org.au
server-type=irods
server-port=1247
default-idp=arcs idp
Save the file and restart Davis
Testing Procedures
Litmus test
Also test using IdP? login with the new AAF Pilot enabled SLCS server.
Back-out Procedures
As the ARCS SLCS server will be using the Pilot AAF, there will be no back out procedure, unless SLCS is also moved back into MAMS.
For some reason, setting blindtrust to false disabled webdav and web based access to the DF. Will investigate issue as it was working on test system.
Review
Schedule/Notification
Tracking completion
Change note applied 28th Nov at 9:15am. Checked with litmus and have passed tests as expected, and also checked on web interface with the TPAC IdP? and Mac Finder mount.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-004 | 15 Dec 2009 - 15:34 | ---+!! Authorisation Service VM domain and IP address change
Description
Intersect will begin using it's own allocation of public IP addresses - 203.161.182.0/24
The Authorisation Team has 16 IP addresses in a borrowed (from AC3) range - migration of these to the Intersect range is a priority because AC3 will soon revoke the borrowed range.
Estimated Duration
Phase 1: Initial test on 1 VM - 1 hour
Phase 2: TBD
Systems/Services Affected
Phase 1: auth13.ac3.edu.au
Staff Responsible
Simon Yin
Detailed Instructions
Step 1: Create an alias IP address on auth13.ac3.edu.au
This involves the following:
ssh root@auth13.ac3.edu.au
cat - <<EOF >/etc/sysconfig/network-scripts/ifcfg-eth0:1
DEVICE=eth0:1
BOOTPROTO=static
IPADDR=203.161.182.113
NETMASK=255.255.255.0
EOF
Step 2: On auth13.ac3.edu.au, edit /etc/resolv.conf and change the nameserver (existing should be 203.202.139.100) to 203.161.182.100
Step 3: On auth13.ac3.edu.au, edit /etc/sysconfig/network and add GATEWAY=203.161.182.253
Step 4: ifup eth0:1
Perform the testing procedures below before moving to the next step.
Step 5: Contact ac3 to change DNS (a-name) to
auth13.ac3.edu.au 203.161.182.113
Repeat testing procedures below but now use auth13.ac3.edu.au instead of the IP address.
Testing Procedures
After Step 4: * While on a command shell in auth13.ac3.edu.au, test the DNS by trying: nslookup www.google.com and confirm that a response is given.
* From another ARCS site, telnet 203.161.182.113 443 and confirm that a response is given.
* From another ARCS site, telnet 203.161.182.113 8443 and confirm that a response is given.
* From another ARCS site, telnet 203.161.182.113 80 and confirm that a response is given.
Back-out Procedures
If the services do not function as expected once the DNS is changed, then it will be necessary to change the hostname from auth13.ac3.edu.au to auth13.intersect.org.au. DNS changes at ac3 (from Step 5) must be undone. auth13.intersect.org.au should now be registered using Sirca's server DNS records in the intersect.org.au domain (both forward and reverse lookups).
Review
Schedule/Notification
cuong.hoang@arcs.org.au
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-005 | 26 Nov 2009 - 16:39 | ---+!! Migrate from MAMS to AAF Pilot
Description
Moving all ARCS federated services from the MAMS federation to the AAF Pilot Federation. This is due to MAMS federation being shutdown on the 30th November
Estimated Duration
5 Mins
Systems/Services Affected
Directly: slcs1.arcs.org.au and services.arcs.org.au hosts
Indirectly: AAF Shibboleth access to the following ARCS Services
* Data Fabric
* ARCS Compute Grid
Staff Responsible
Sam Morrison <sam.morrison at arcs.org.au>
Detailed Instructions
Telling puppet to move all shibboleth SP's in MAMS to AAF. Puppet will handle all configuration changes.
Testing Procedures
Test the following:
* Shibboleth Access to Data Fabric
* Grix Shibboleth Auth
* Grisu Shibboleth Auth
* services.arcs.org.au registration portal
* Use slick command line client
Back-out Procedures
Tell puppet to move it back to MAMS. NOTE: MAMS is being shut down so this is not an option.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-006 | 26 Nov 2009 - 22:10 | ---+!! Minor Grisu & Grix update
Description
Because of the switch to AAF Grix & Grisu need to be updated. In addition to providing new binaries there will also be a small update on the Grisu backend on grisu.vpac.org which adds support for pooled accounts.
Estimated Duration
10 seconds
Systems/Services Affected
Grisu, Grix
Staff Responsible
Markus Binsteiner
Detailed Instructions
Uploading of new Grix & Grisu binaries, exchanging of old grisu war with new one.
Testing Procedures
Manually testing of Grix & Grisu client applications.
Back-out Procedures
Re-instating old binaries
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-007 | 28 Nov 2009 - 10:58 | ---+!! Modifying memory usage on arcs-df.vpac.org and other minor updates
Description
Modifying the amount of memory used by various services on arcs-df.vpac.org
Estimated Duration
20 minutes
Systems/Services Affected
ARCS Data Fabric (arcs-df.vpac.org)
Staff Responsible
Pauline Mak
Detailed Instructions
Modify Postgres to use 1G of memory:
As root, stop iRODS services:
service davis stop
service irods stop
service postgres stop
vim /opt/iRODS/Postgres/pgsql/data/postgresql.conf
Change the shared_buffer value size to 1G:
shared_buffers = 1024MB
Save the file and restart postgres
Modify the Davis setting in:
vim /etc/default/davis
-server -Xms768m -Xmx1024m -XX:+AggressiveHeap -XX:PermSize=192m -XX:MaxPermSize=192m -XX:+UseParallelGC
Then start iRODS and Davis
service irods start
service davis start
Modify the TDS settings:
vim /etc/defaults/tds
JAVA_OPTIONS="-server -Xmx256m -Xms256m -Djava.awt.headless=true"
Testing Procedures
Test that iRODS is functioning, and that davis is working by running litmus test.
Back-out Procedures
Revert davis to previous settings:
-server -Xms1024m -Xmx1562m -XX:+AggressiveHeap -XX:PermSize=192m -XX:MaxPermSize=192m -XX:+UseParallelGC
And postgres back to
shared_buffer=2048MB
Review
Schedule/Notification
Part of the outage on 28th November betwee 9-11 AEDT.
Note that TDS change has already applied 27th Nov at 2:30 AEDT.
Also modifying auto replication rule to use less retries:
acPostProcForPut|"$objPath" not like "/ARCS/trash/*"|delayExec(<PLUSET>1s</PLUSET><EF>30s DOUBLE UNTIL 5 TIMES</EF>,msiSysReplDataObj(ARCS-FABRIC,null),nop)|nop
acPostProcForCopy|"$objPath" not like "/ARCS/trash/*"|delayExec(<PLUSET>1s</PLUSET><EF>30s DOUBLE UNTIL 5 TIMES</EF>,msiSysReplDataObj(ARCS-FABRIC,null),nop)|nop
Tracking completion
Completed 28th November 2009, 10:58.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-008 | 22 Dec 2009 - 08:45 | ---+!! Upgrade iRODS Servers to Centos 5.4
Description
This Change is intended to improve the resilience of iRODS servers under heavy load conditions, and to reduce the likelihood of a kernel panic when writing to an XFS filesystem (particularly on a 32-bit system) under such conditions.
A standard set of 'yum' operations is used to download and install the necessary operating system patches and updated kernel. Two additional lines are then added to the file /etc/rc.local to ensure that the console doesn't clear during reboots; this is intended an an aid in crash diagnosis.
A reboot is then performed.
Estimated Duration
Sixty minutes.
Systems/Services Affected
Phase 1: srbdev.vpac.org, irodsdev.vpac.org and other development machines as appropriate .. no Production service outage.
Phase 2: arcs-df.vpac.org .. Production service outage required!
Phase 3: arcs-df.ivec.org, arcs-df.ac3.edu.au, arcs-df.qcif.edu.au, arcs-df.tpac.org.au, emii.resource.tpac.org.au, arcs-df.eresearchsa.edu.au, arcs-df.hpsc.csiro.au, datastore.hpsc.csiro.au .. Production service outage required!
Staff Responsible
VPAC: Graham Jenkins
TPAC: Pauline Mak
Pre-Change Tests
As the user rods issue the following commands:
iput /etc/group
ils -l group
irm -f /etc/group
Also execute (as yourself) the iput/ils/irm tests from your workstation.
Create a myproxy identity using slix.jar and use it to upload, list and delete some files using a browser and a mounted filesystem.
Pre-Change Certificate Update (Phase 2 only)
As the root user, perform the following operations:
cp /etc/httpd/conf.d/ssl.conf /var/tmp
vi /etc/httpd/conf.d/ssl.conf
Comment the 3 lines starting at line 109
SSLCertificateFile /etc/grid-security/df-hostcert.pem .. etc.
Uncomment the 3 lines starting at line 113
#SSLCertificateFile /etc/httpd/ssl/hostcert.pem
Then:service httpd restart
And finally .. start a new browser session and test Davis operation as above.
Detailed Instructions
As the root user, add the following lines to the end of the file: /etc/rc.local
# Set the screen blank timeout to 0 mins
echo -e "\033[9;0]" >/dev/console
# Unblank the screen
echo -e "\033[13]" >/dev/console
Then perform the following steps:
init 2 # Should shut down irods, postgres, etc.)
yum clean all
yum update glibc\*
yum update yum\* rpm\* python\*
yum clean all
yum --disablerepo=rpmforge update # As required
init 6
If the last 'yum update' directive fails with a dependency issue, you will need to add to the file /etc/yum.conf lines like the following so as to circumvent attempted updates of RPMs not sourced from standard repositories.
# Local exclusions
exclude shibboleth\*
exclude log4shib\*
Then re-execute:
yum update
init 6
Testing Procedures
Perform the following test (as the rods user)
lsb_release -a # Should show: Release: 5.4
Then repeat the Pre-Change Tests as shown in an earlier paragraph.
Back-out Procedures
None available
Review
Schedule/Notification
Phase 2: 0800 - 0900 Tues. Dec. 22
Tracking completion
Phase 2: Completed!
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200911-009 | 30 Nov 2009 - 14:00 | ---+!! ARCS IdP? - Software upgrade
Description
Update the software the ARCS IdP? uses.
* Upgrade shibboleth from 2.1.2 -> 2.1.5
* Do general OS update
Estimated Duration
30 minutes
Systems/Services Affected
All users that use the ARCS IdP?
Staff Responsible
SamMorrison
Detailed Instructions
https://spaces.internet2.edu/display/SHIB2/IdP2Upgrade
Testing Procedures
Log into a shibboleth protected website with the ARCS IdP?
Back-out Procedures
Save old code base and revert to old WAR if needed
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200912-001 | 04 Dec 2009 - 09:06 | ---+!! Enabling permission cronjob on VPAC
Description
Installing a cron job that will change file permissions to allow the THREDDS Data Server to read files.
Estimated Duration
30 minutes
Systems/Services Affected
arcs-df.vpac.org. No outage required
Staff Responsible
Pauline Mak
Detailed Instructions
Download script twice from SVN: cd $IRODS_HOME/server/bin/local
wget http://projects.arcs.org.au/trac/systems/export/1250/trunk/dataFabricScripts/iRODS/utils/opendap_acl.sh
Modify one script for TCMIP (opendap_acl_tcmip.sh) and another for the PUBLUMW (opendap_acl_publumw.sh) group.
vaultDir=/data/Vault
user=jetty
Modify the tree variable like so:
tree="ARCS projects TCMIP"
tree="ARCS projects PUBLUMW"
The add the scripts to the rod user's crontab:
#OPeNDAP Rules
0 * * * * /opt/iRODS/iRODS/server/bin/local/opendap_acl_tcmip.sh >/dev/null 2>&1 || :
0 * * * * /opt/iRODS/iRODS/server/bin/local/opendap_acl_publumw.sh >/dev/null 2>&1 || :
Testing Procedures
Run the cron job and check the permissions by using getfacl
Back-out Procedures
Remove scripts from the iRODS home directory and also remove the scripts from the rods user's crontab.
Review
Schedule/Notification
Tracking completion
Completed 4th December 2009.
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200912-002 | 15 Dec 2009 - 16:31 | ---+!! Updating TDS to version 4.1 on iVEC
Description
Updating TDS to the next version because of caching issue and also WMS does not correctly handle fill values for colour scale range (which are fixed in 4.1). Note that this is still an alpha release.
Estimated Duration
30 minutes
Systems/Services Affected
opendap-ivec.arcs.org.au
Staff Responsible
Pauline Mak
Detailed Instructions
Download the latest version from Unidata:
cd /tmp
wget ftp://ftp.unidata.ucar.edu/pub/thredds/4.1/thredds.war
Install new version:
service tds stop
unzip thredds.war tds
mv /opt/tds/webapps/thredds ~/thredds_4.0
mv tds /opt/tds/webapps/thredds
Change all of the configured catalogs to not use cache, i.e. add the folllowing attribute to each datasetScan element:
cache="false"
Restart the server
Testing Procedures
Load up thredds in the browser to make sure data can be retrieved. Check the version number in the footer to make sure it is now 4.1+
Back-out Procedures
Stop the server
Remove new thredds webapp
Move ~/thredds_4.0 back into /opt/tds/webapps
Start the server
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200912-003 | 26 Feb 2010 - 15:59 | ---+!! NCI NGGUMS Upgrade
Description
NGGums upgrade for NCI NF. Involves update to VDT 1.10.1 and Centos 5.4
Estimated Duration
2 hours
Systems/Services Affected
Grid Gateway for terrawulf.anu.edu.au
Staff Responsible
Paul Warren
Detailed Instructions
Set up a new VM image as detailed at http://projects.arcs.org.au/trac/systems/wiki/HowTo/InstallNgGums
test, switch over ip addresses, turn off old machine.
Testing Procedures
Use a selection of Grid Certificates and VOs to test that mappings work correctly
Back-out Procedures
Reinstate old VM image
Review
Schedule/Notification
2009-12-14
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200912-004 | 04 Jan 2010 - 15:55 | ---+!! Installing TDS 4.1 stable and awstats for opendap-*.arcs.org.au
Description
Updating all TDS instances to the 4.1 stable and install awstat to keep usage statstics
Estimated Duration
2 hours
Systems/Services Affected
opendap-vpac.arcs.org.au, opendap-ivec.arcs.org.au, opendap-tpac.arcs.org.au, opendap-ersa.edu.au, opendap-intersect.edu.au, opendap-qcif.edu.au
The ARCS Data Fabric - as awstat requires a restart of the httpd server.
Staff Responsible
Pauline Mak
Detailed Instructions
Modify the Apache log in /etc/logrotate.d/httpd with the following text:
/var/log/httpd/*log {
compress
missingok
notifempty
rotate 2
sharedscripts
size=1M
postrotate
/sbin/service httpd reload > /dev/null 2>/dev/null || true
endscript
}
Then as root, install awstats:
yum install awstats
This will install awstat to /usr/share/awstats
cd /usr/share/awstats/
tools/awstats_configure.pl
Then answer the quesiton as follows, substituion irodsdev.arcs.org.au with the host name of the server
Enter full config file path of your Web server.
Example: /etc/httpd/httpd.conf
Example: /usr/local/apache2/conf/httpd.conf
Example: c:\Program files\apache group\apache\conf\httpd.conf
Config file path ('none' to skip web server setup):
> /etc/httpd/conf/httpd.conf
-----> Need to create a new config file ?
Do you want me to build a new AWStats config/profile
file (required if first install) [y/N] ? y
-----> Define config file name to create
What is the name of your web site or profile analysis ?
Example: www.mysite.com
Example: demo
Your web site, virtual server or profile name:
> irodsdev.arcs.org.au
-----> Define config file path
In which directory do you plan to store your config file(s) ?
Default: /etc/awstats
Directory path to store config file(s) (Enter for default):
>
-----> Create config file '/etc/awstats/awstats.irodsdev.arcs.org.au.conf'
Config file /etc/awstats/awstats.irodsdev.arcs.org.au.conf created.
Then add the following to root's crontab:
#update all statstics on a web server, which may have more than one virtual host
* 0 * * * /usr/bin/perl /usr/share/awstats/wwwroot/cgi-bin/awstats.pl/awstats_updateall.pl now
Download the new TDS 4.1
Testing Procedures
Back-out Procedures
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200912-005 | 18 Dec 2009 - 11:36 | ---+!! Enabling PUBLIC folders for all users
Description
Creating new public folders that will allow users of the ARCS Data Fabric to share files with anyone.
Estimated Duration
1 hour
Systems/Services Affected
df.arcs.org.au, arcs-df.vpac.org
Staff Responsible
Pauline Mak, Graham Jenkins
Detailed Instructions
Stop davis
Install new createUser script
cd /opt/iRODS/iRODS/server/bin/cmd
wget http://projects.arcs.org.au/trac/systems/export/1274/trunk/dataFabricScripts/iRODS/utils/createUser.pl-v3.08
Install createInbox script under /usr/local/bin
wget http://projects.arcs.org.au/trac/systems/export/1274/trunk/dataFabricScripts/iRODS/utils/createInbox.sh
Run createInbox.sh -a to create a INBOX and PUBLIC box for every existing user
Download inbox rules script from subversion
Update arcs.irb to include new inbox rules
Modify Davis to make __PUBLIC a public directory
Restart davis
Testing Procedures
Tested on ngdata-dev.
Login through davis and upload a file in your public directory. Then start a new browser session and try and download the file without logging into the data fabric.
Test putting in a file in another user's inbox, and check that once uploaded that you can no longer see it and that the other user can read and modify the file.
Back-out Procedures
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote200912-006 | 22 Dec 2009 - 13:16 | ---+!! UPDATE CERTIFICATES ON ERSA.EDU.AU HOSTED SERVICES
Description
Update certificates on the systems listed below to ensure continuity of service.
Estimated Duration
1 hour
Systems/Services Affected
arcs-df.ersa.edu.au sakai.arcs.org.au idp.ersa.edu.au authsvc.ersa.edu.au
Staff Responsible
David Logan
Detailed Instructions
On arcs-df.ersa.edu.au
Backup existing certificates and keys
/etc/grid-security/hostcert.pem
/etc/grid-security/hostkey.pem
/etc/grid-security/IPS-IPSCABUNDLE.CRT
Copy the following files to the following locations
/root/startssl_root_certs/arcs-df.ersa.edu.au_req.pem /etc/grid-security/hostcert.pem
/root/startssl_root_certs/hostkey.pem /etc/grid-security/hostkey.pem
/root/startssl_root_certs/sub.class1.server.ca.pem /etc/grid-security/sub.class1.server.ca.pem
/root/startssl_root_certs/ca.pem /etc/grid-security/ca.pem
Modify the following parameters in the /etc/httpd/conf.d/ssl.conf file
SSLCertificateChainFile? /etc/grid-security/sub.class1.server.ca.pem
SSLCACertificateFile? /etc/grid-security/ca.pem
# service httpd restart
***************************************************************************
on server authsvc.ersa.edu.au
Backup existing certificates and keys
/usr/local/ssl/private/authsvc.eresearchsa.edu.au.crt
/usr/local/ssl/certs/authsvc.eresearch.edu.au.key
/usr/local/ssl/private/IPS-IPSCABUNDLE.CRT
Copy the following files to the following locations
/root/startssl_root_certs/authsvc.ersa.edu.au_req.pem /usr/local/ssl/private/authsvc.eresearchsa.edu.au.crt
/root/startssl_root_certs/hostkey.pem /usr/local/ssl/certs/authsvc.eresearchsa.edu.au.key
/root/startssl_root_certs/sub.class1.server.ca.pem /usr/local/ssl/private/sub.class1.server.ca.pem
/root/startssl_root_certs/ca.pem /usr/local/ssl/private//ca.pem
Modify the following parameters in the /etc/httpd/conf.d/ssl.conf file
SSLCertificateChainFile? /usr/local/ssl/private/sub.class1.server.ca.pem
SSLCACertificateFile? /usr/local/ssl/private/ca.pem
# service httpd restart
***************************************************************************
on server idp.ersa.edu.au
Backup existing certificates and keys
/etc/pki/tls/certs/localhost.crt
/etc/pki/tls/private/localhost.key
/etc/pki/tls/certs/IPS-IPSCABUNDLE.crt
Copy the following files to the following locations
/root/startssl_root_certs/idp.ersa.edu.au_req.pem /etc/pki/tls/certs/localhost.crt
/root/startssl_root_certs/hostkey.pem /etc/pki/tls/private/localhost.key
/root/startssl_root_certs/sub.class1.server.ca.pem /etc/pki/tls/certs/sub.class1.server.ca.pem
/root/startssl_root_certs/ca.pem /etc/pki/tls/certs/ca.pem
Modify the following parameters in the /etc/httpd/conf.d/ssl.conf file
SSLCertificateChainFile? /etc/pki/tls/certs/sub.class1.server.ca.pem
SSLCACertificateFile? /etc/pki/tls/certs/ca.pem
Modify the following parameters in the /etc/httpd/conf.d/ssl-federation.conf file
Remove line SSLCACertificatePath?
Replace with SSLCertificateChainFile? /etc/pki/tls/certs/sub.class1.server.ca.pem
Add SSLCACertificateFile? /etc/pki/tls/certs/ca.pem
# service httpd restart
***************************************************************************
on server sakai.arcs.org.au
Backup existing certificates and keys
/etc/httpd/ssl/sakai.arcs.org.au_20090325_100958.crt
/etc/httpd/ssl/sakai.arcs.org.au_20090325_100958.key
/etc/httpd/ssl/IPS-IPSCABUNDLE.CRT
Copy the following files to the following locations
/root/startssl_root_certs/sakai.ersa.edu.au_req.pem /etc/httpd/ssl/sakai.arcs.org.au.crt
/root/startssl_root_certs/hostkey.pem /etc/httpd/ssl/sakai.arcs.org.au.key
/root/startssl_root_certs/sub.class1.server.ca.pem /etc/httpd/ssl/sub.class1.server.ca.pem
/root/startssl_root_certs/ca.pem /etc/httpd/sslca.pem
Modify the following parameters in the /etc/httpd/conf.d/ssl.conf file
SSLCertificateFile? /etc/httpd/ssl/sakai.arcs.org.au.crt
SSLCertificateKeyFile? /etc/httpd/ssl/sakai.arcs.org.au.key
SSLCertificateChainFile? /etc/httpd/ssl/sub.class1.server.ca.pem
SSLCACertificateFile? /etc/httpd/ssl/ca.pem
# service httpd restart
Testing Procedures
Ensure web services can be accessed without issue
Back-out Procedures
Copy back keys and certificates backed up in above procedure
Review
Log to each service and ensure it is working as planned.
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201001-001 | 20 Jan 2010 - 17:23 | ---+!! Migrate ESSCC VMs to Dell Xen host
Description
Migrating ESSCC production VMs to Dell Xen host that is under maintenance contract, and eventually free up the rack space taken up by IBM x346 server.
Estimated Duration
Less than 2 hours to migrate VMs and reboot Dell Xen host. Allocate 4 hours to allow for any unexpected scenarios.
Systems/Services Affected
All UQ and UQ ESSCC VMs:
ng2.hpcu.uq.edu.au
nggums.hpcu.uq.edu.au
ngdata.hpcu.uq.edu.au
ngmds.hpcu.uq.edu.au
myproxy1.arcs.org.au
ng2.esscc.uq.edu.au
nggums.esscc.uq.edu.au
cg-gw.nravs.uq.edu.au will not be able to accept jobs due to nggums.hpcu.uq.edu.au downtime.
Staff Responsible
WillHsu
Detailed Instructions
Shutdown all ESSCC VMs, copy/migrate VM files to Dell Xen host.
Relocate ESSCC network connection from IBM Xen host to Dell Xen host.
Configure Dell Xen host for extra network connection, shutdown all UQ VMs and reboot.
Starts UQ and ESSCC VMs and test with job submissions.
Testing Procedures
submit some test jobs from globus client to ng2
http://wiki.arcs.org.au/bin/view/APACgrid/TestSuite
Back-out Procedures
Relocate ESSCC network connection from Dell Xen host back to IBM Xen host.
Start ESSCC VMs from IBM Xen host.
Revert changes to Dell Xen host and reboot.
Review
Schedule/Notification
Notification sent on 12/Jan/2010 to community, developers, techstaff
Tracking completion
100% complete
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201001-002 | 13 Jan 2010 - 12:53 | ---+!! Migrate BCEES VM to an LVM Based VM
Description
Migrating the BCEES VM from a file base to an LVM base. Also increasing the disk space to 16GB due to growing SQL database
Estimated Duration
1 Hour
Systems/Services Affected
bcees.hpsc.csiro.au, https://bcees.org.au
Staff Responsible
Joel Ludbey-Bruhwel
Detailed Instructions
Shutdown bcees VM
Create same size logical volume to migrate to
sudo lvcreate -L 8G -n bcees vg0
DD filesystem to logical volume
sudo dd if=/xen_guests/bcees.img of=/dev/vg0/bcees
Make relevant change to xen config file /etc/xen/bcees.cfg
Start up VM and test
Shutdown VM again and increase disk space by 8GB
sudo lvextend -L +8G /dev/vg0/bcees
Start up VM, test and archive old disk images if everything works
Testing Procedures
SSH to Host
Check https://bcees.org.au
Upload some test files to msql database via https://bcees.org.au
Back-out Procedures
Restore backed up /etc/xen/bcees.cfg and reboot file based VM
Review
Schedule/Notification
Have already notified Troy Sadkowsky of this and he is notifying the relevant users of bcees.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201001-003 | 05 Feb 2010 - 11:39 | ---+ Deploying Access Service Integration to the ARCS Data Fabric
Description
Estimated Duration
3 hour
Systems/Services Affected
df.arcs.org.au
Staff Responsible
Pauline Mak
Detailed Instructions
__It is assumed that the Access Service has been installed at this point.
Backup the Database
(Based on previous change note: http://wiki.arcs.org.au/bin/view/Main/ChangeNote200908-001)
On df.arcs.org.au, shut down services:
service httpd stop
service davis stop
service irods stop
On all ARCS DF hosts, shut down iRODS:
df.arcs.org.au, arcs-df.sf.utas.edu.au, arcs-df.ivec.org, arcs-df.ac3.edu.au, arcs-df.eresearchsa.edu.au, arcs-df.hpcu.uq.edu.au, arcs-df.hpsc.csiro.au
service irods stop
Backup ICAT with pgdump on df.arcs.org.au
mkdir /tmp/Backups pg_dump ICAT > /tmp/Backups/ICAT_DUMP_BEFORE_AS_DF
Take full ICAT backup on df.arcs.org.au
service postgres stop su - rods cd ~/DB/pgsql tar -cf /tmp/Backups/data.tar data/
Start iRODSon all hosts
service irods start
Ingest Users into the Access Service
Export existing users details:
for U in `iadmin lu` ; do iadmin lu $U; echo; done > df_users.txt
Get the list of ARCS IdP? users, extract data, then run match script.
Double check that the number of users in df_users.txt matches the number of users in the CSV (minus users like rods)
Remove Anh and Pauline from export file
Go to AS admin interface and upload the CSV file.
Update createUser script
Update the arcs Rules file in subversion to no longer call the createUser script, by removing line 4. Commit the change.
Add sync user script
As the root user, Check that perl-XML-XPath amd perl-Crypt-SSLeay are installed, if not:
yum install perl-Crypt-SSLeay perl-XML-XPath
Then as the rods user, download the script:
cd /usr/local/bin
wget -N http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/utils/syncUsers.pl
Update the crontab to run this once every 5 minutes:
*/5 * * * * ( . /etc/profile.d/irods.sh; /usr/local/bin/syncUsers.pl arcs-data@lists.arcs.org.au ) >/dev/null 2>&1 || :
DO NOT RESTART IRODS SERVERS YET.
Update certificate on iVEC
The host certificate (and iRODS host certificate) will need to be updated on iVEC.
Restart iRODS servers
Update Davis to 0.8.3
service davis stop cd /tmp
wget http://webdavis.googlecode.com/files/davis-0.8.3.tar.gz
tar -zxvf davis-0.8.3.tar.gz
mv davis-0.8.3 /opt/davis
chown -R davis:davis /opt/davis/davis-0.8.3
cd /opt/davis
rm davis
ln -s /opt/davis/davis-0.8.3 davis
cd /opt/davis/davis/bin sh davis-configure.sh
Then modify the davis configuration file under /opt/davis/etc/host-local.configuration: default-idp=arcs
arcs-myproxy-server=mp-ca.arcs.org.au
authClass=au.org.arcs.davis.ARCSAuthorizationProcessor
Update Davis to use the slcs1 server. Modify the file:
vi /opt/davis/davis/webapps/root/WEB-INF/classes/slcs-client.properties
The change:
slcs.server=https://slcs1.arcs.org.au/SLCS/login
Make sure davis-organisation does not contain settings for the dynamic objects Add in MyProxy? CA bundle to arcs-df.vpac.org. The certificate and signing policies should go into /etc/grid-security/certificates
Get the certificates from ??? Update vdt update script to include this new CA:
vim /opt/vdt/vdt/etc/vdt-update-certs.conf
And add in the following lines:
include=/etc/grid-security/157c0362.0
include /etc/grid-security/157c0362.signing_policy
Then restart Davis
service davis start
service httpd start
Testing Procedures
*This is very important* Test that existing login (using SLCS) continues to work. Run through items 2-4 from the Data Fabric/Access Service Integration plan: https://projects.arcs.org.au/trac/systems/wiki/AuthServices/DataFabricIntegration
Back-out Procedures
Restore Data Fabric database:
cp -r /opt/iRODS-2.0v/Postgres/pgsql/data /tmp/data_bkp
service postgres start dropdb ICAT createdb ICAT psql ICAT < /tmp/ICAT_DUMP_BEFORE_AS_DF
Revert Davis to version 0.8.2
rm /opt/davis/davis cd /opt/davis/ ln -s /opt/davis/davis-0.8.2 davis
Use old createUser script
cd /opt/iRODS-2.1v/iRODS/server/bin/cmd
mv createUser-beforeASDF createUser
Remove syncUsers.pl from rod user's cron job Insert the createUser rule in arcs.irb:
acGetUserByDN(*arg,*OUT)||msiExecCmd(createUser,'"*arg"',null,null,null,*OUT)|nop
Commit changes back to subversion
Review
Schedule/Notification
Scheduled for Friday evening at 5pm Eastern time.
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201001-004 | 20 Jan 2010 - 14:30 | ---+!! TITLE
Enhancement of Activity Report Program on ARCS IDP Server
Description
User activity on idp.arcs.org.au is recorded in a database on db-2.arcs.org.au for later analysis by ARCS managment staff. This information is used in securing on-going funding.
Information is gathered through periodic (hourly) execution of the 'ShibReport.sh' program located in '/usr/local/bin'. This program parses the most recent log files, and generates usage records in MySQL tables which contain username and source information.
It was observed that log files can roll at indeterminate times, and to cover this case, earlier incarnations of the program were modified so as to parse all lines in the three most recent log files. The usage level has now grown to such an extent that a single invocation of the program can run for several minutes.
This change involves replacement of that program by one which parses only the lines in log files which changed in the previous 125 minutes.
Estimated Duration
This Change involves replacement and testing of a single program. It is of 60 minutes duration.
Systems/Services Affected
idp.arc.org.au
Staff Responsible
Graham Jenkins
Detailed Instructions
On idp.arc.org.au as the 'root' user, do:
cd /usr/local/bin
cp ShibReport.sh ShibReport.sh.20100120
Replace it with attached version.
Testing Procedures
Execute the script using the command shown in 'root' crontab.
On a workstation with appropriate query access. do:
select * from idp_stats_ip where id like "20100120%";
select * from idp_stats_user where id like "20100120%"
Ensure that recent activity is shown by both queries.
Back-out Procedures
Restore and test the orginal version of the program.
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201002-001 | 22 Feb 2010 - 13:06 | ---+!! Intersect - 2 Hour Network Outages Starting 9:00am on Sunday 21 February 2010 and Sunday 28 February 2010
Description
This change is to allow Intersect's hosting company AC3 to perform Fiber Link upgrades.
Estimated Duration
2 hours (21 Feb) + 2 hours (28 Feb)
Systems/Services Affected
All services provided by Intersect will be unavailable:
| projects.arcs.org.au - Trac and Subversion sites. The standby will also be unavailable |
|
| arcs-df.ac3.edu.au |
All Authorisation services (auth01-auth16).ac3.edu.au |
| ng2.ac3.edu.au |
hpc-ra.intersect.org.au |
| idp.ac3.edu.au |
idp.intersect.org.au |
| nggums.ac3.edu.au |
|
| |
|
Staff Responsible
Simon Yin - simon.yin@arcs.org.au
Detailed Instructions
No physical work on the machines - all activity is conducted by 3rd parties through AC3.
Testing Procedures
A sanity check of the affected sites will be performed following the completion of the work.
Back-out Procedures
None
Review
Schedule/Notification
Notification by email:
| For ARCS Internal Projects |
|
| |
techstaff@arcs.org.au, developers@arcs.org.au |
| For cawcr |
|
| |
A.Moise@bom.gov.au, Arnold.Sullivan@csiro.au, b.hu@bom.gov.au, Craig.Heady@csiro.au, Damien.Irving@csiro.au, david.kent@csiro.au, Didier.Monselesan@csiro.au, i.macadam@unsw.edu.au, J.Arblaster@bom.gov.au, j.sisson@bom.gov.au, Janice.Bathols@csiro.au, John.Clarke@csiro.au, Julian.O'grady@csiro.au, l.hanson@bom.gov.au, L.Rikus@bom.gov.au, Lauren.Stevens@csiro.au, Les.Muir@csiro.au, Martin.Dix@csiro.au, Peter.Vanrensch@csiro.au, Sarah.Perkins@csiro.au, Skye.Platten@csiro.au, Stacey.Osbrough@csiro.au, tim.erwin@csiro.au, Tony.Rafter@csiro.au, Will.Thurston@csiro.au |
| For geosciml |
|
| |
a.tellez-arenas@brgm.fr, ben.caradoc-davies@csiro.au, eric.boisvert@rncan-nrcan.gc.ca, ryan.fraser@csiro.au, trd@bgs.ac.uk |
| For ciomp |
|
| |
bkgalton@utas.edu.au, frank.colberg@utas.edu.au, john.hunter@utas.edu.au, kate@arsc.edu, msd@ccpo.odu.edu, tore.hattermann@npolar.no |
| For hollowworld |
|
| |
See Fullmoon |
| For sggc |
|
| |
Terry.Rankine@csiro.au |
| For portal |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, aron.abrook@qut.edu.au |
| For mgplot |
|
| |
frank.colberg@utas.edu.au |
| For mem |
|
| |
chris.jackett@csiro.au, peter.turner@csiro.au, jenny.lovell@csiro.au, r.williams@utas.edu.au |
| For access |
|
| |
martin.dix@csiro.au, l.logan@bom.gov.au, mrezny@sgi.com |
| For macddap |
|
| |
jason.lohrey@arcitecta.com, R.Williams@utas.edu.au, paolap@utas.edu.au, pblain@postoffice.utas.edu.au |
| For jmoleditor |
|
| |
joachim.mai@intersect.org.au, Martin.Thompson@unsw.edu.au |
| For gsub |
|
| |
sam.moskwa@sapac.edu.au (defunct) |
| For grisu |
|
| |
y.halytskyy@auckland.ac.nz |
| For Generic Grid Client |
|
| |
ryan.fraser@csiro.au, Terry.Rankine@csiro.au, ronald@ivec.org, vladimir.mencl@canterbury.ac.nz |
| For DataMINX? |
|
| |
p.turner@chem.usyd.edu.au, alexa@intersect.org.au, andrey@intersect.org.au, carlos@intersect.org.au, Doug.duBoulay@gmail.com, xiao.wang@stfc.ac.uk, nguyen_h@chem.usyd.edu.au, gerson.galang@versi.edu.au |
| For CAPSnav |
|
| |
Edward.King@csiro.au, Leon.Majewski@bom.gov.au, Peter.Turner@csiro.au |
| For aodaac |
|
| |
ken.suber@csiro.au, Edward.King@csiro.au, matt.paget@csiro.au |
| For pht3d |
|
| |
Janek.Greskowiak@csiro.au, aaron.mcdonough@csiro.au, vincent.post@falw.vu.nl |
| For podd |
|
| |
gkam08@gmail.com, liyf@itee.uq.edu.au, james.eddes@adelaide.edu.au, jane@itee.uq.edu.au, ben.joseph@utas.edu.au, f.davies@uq.edu.au, kai.xu@anu.edu.au, xueqin.wang@csiro.au, paul.coddington@arcs.org.au, andrew.treloar@ands.org.au, Adrienne.McKenzie@anu.edu.au, Donald.Hobern@csiro.au, Robert.Furbank@csiro.au, joanne.evans@unimelb.edu.au, xavier.sirault@csiro.au |
| For Fullmoon |
|
| |
simon.cox@jrc.ec.europa.eu, ryan.fraser@csiro.au, Alexandre.Robin@spotimage.fr, bryan.lawrence@stfc.ac.uk, eboisver@nrcan.gc.ca, Francois.Letourneau@RNCan-NRCan.gc.ca, Jacek.Radajewski@csiro.au, johannes.echterhoff@igsi.eu, Jonathan.Yu@csiro.au, mggr@pml.ac.uk, pavel.golodoniuc@csiro.au, Peter.Taylor@csiro.au, pcd@ecosystem.com, Rick.Meng@csiro.au, rob.atkinson@csiro.au, robert.woodcock@csiro.au, rocky@cc.gatech.edu, stefan.hansen@lisasoft.com, oliver.raymond@ga.gov.au, Bruce.Simons@dpi.vic.gov.au |
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201002-002 | 11 Feb 2010 - 15:30 | ---+!! Cron Job for Moving IMOS-ANFOG Staging Data
Description
This note describes the steps necessary to create a cron job for moving IMOS-ANFOG staging data to archive, opendap and public directories.
Estimated Duration
10 minutes
Systems/Services Affected
Only iVEC DF machine: arcs-df.ivec.org
Staff Responsible
KaiLu
Detailed Instructions
As user rods:
run:
cd /opt/iRODS/iRODS/server/bin/local
wget http://projects.arcs.org.au/svn/systems/trunk/dataFabricScripts/iRODS/utils/ANFOG_Data_Moving.sh/?format=raw -O ANFOG_Data_Moving.sh
chmod +x ANFOG_Data_Moving.sh
crontab -e and adding the following line:
0 23 * * * /opt/iRODS/iRODS/server/bin/local/ANFOG_Data_Moving.sh >> /opt/iRODS/iRODS/server/log/ANFOG-Moving-Data 2>&1
The cron job will be run once per day
Testing Procedures
As user rods:
run:
/opt/iRODS/iRODS/server/bin/local/ANFOG_Data_Moving.sh >> /opt/iRODS/iRODS/server/log/ANFOG-Moving-Data 2>&1
and check if:
the data files has been moved from directory staging into the directoties: archive, opendap and public
Back-out Procedures
As user rods:
run:
crontab -e and remove the following line:
0 23 * * * /opt/iRODS/iRODS/server/bin/local/ANFOG_Data_Moving.sh >> /opt/iRODS/iRODS/server/log/ANFOG-Moving-Data 2>&1
rm -f /opt/iRODS/iRODS/server/bin/local/ANFOG_Data_Moving.sh
rm -f /opt/iRODS/iRODS/server/log/ANFOG-Moving-Data
Review
Schedule/Notification
Tracking completion
Done. -- KaiLu - 10 Feb 2010
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201002-003 | 19 Feb 2010 - 15:05 | ---+!! VPAC Database service downtime
Description
VPAC DB server is being hardened
Estimated Duration
1 hour although actual time should be approx. 10 mins
Systems/Services Affected
RT web interface (note all rt emails will be queued on mail server so no data lost)
ARCS IdP? Registration
services.arcs.org.au website
status.arcs.org.au website
code.arcs.org.au - Some services will be unavailable (hudson and gitorious will be unaffected)
VTiger
Staff Responsible
SamMorrison - VPAC
Detailed Instructions
This email is to notify you of a scheduled VPAC system change:
System affected: VPAC Database Server; VPAC website; RT (helpdesk software); Jabber (chat software); Mailing Lists; Placard; Portals: AutoCRC? , BushfireCRC? , Co2CRC? , Ewater, RMIT, Project Management Portal; Timesheet.
Reason: Upgrading Database Server for increased stability and redundancy
Date of change: 26 February 2010
Time of change: 17:00 AEDT
Duration of outage: 1 hour
Impact: No functionality changes, no-data will be lost, transactions will be locked for the duration of the change over.
Notes: See Laszlo Kun or Brett Pemberton for details.
For further information email help@vpac.org or telephone (03) 9925 4410
Regards,
VPAC Systems Department
Testing Procedures
Visit websites
Back-out Procedures
Can't as rely on VPAC DB
Review
Schedule/Notification
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201003-001 | 10 Mar 2010 - 12:36 | ---+!! Reboot ARCS Sakai Server
Description
Reboot the ARCS Sakai VM on arcturus.ersa.edu.au to allow an lvextend operation and resize2fs of the root partition as it has now reached 83% capacity and is generating warnings on an ongoing basis.
Estimated Duration
30 minutes
Systems/Services Affected
ARCS Sakai server arcssakai.ersa.edu.au
Staff Responsible
David Logan
Detailed Instructions
Take a copy of the arcssakai-root lv prior to anything changing
On arcssakai
# /etc/shutdown
On Arcturus
# lvcreate -L 6G -n arcssakai-root-backup VolGroup00?
# dd if=/dev/VolGroup00/arcssakai-root of=/dev/VolGroup00/arcssakai-root-backup
On Arcturus (this extends the lv)
# lvextend +2G /dev/VolGroup00/arcssakai-root
# xm create arcssakai (This will recreate the filesystem at the new size and allow the resize2fs to take place)
On arcssakai (this extends the actual ext3 filesystem to the maximum size of the )
# resize2fs /dev/sda1 lvcreate -L 6G -n arcssakai-root-backup VolGroup00?
Testing Procedures
Use df to ensure new / partition is correct size
Back-out Procedures
Recreate the old boot logical volume by using the dd command
On Arcturus
# xm shutdown arcssakai 1010
# lvremove VolGroup00? /arcssakai-root
# lvcreate -L 6G -n arcssakai-root VolGroup00?
# dd if=/dev/VolGroup00/arcssakai-root-backup of=/dev/VolGroup00/arcssakai-root
# xm create arcssakai
Review
Schedule/Notification
All users of sakai need to be notified . Scheduled outage to be for 30 minutes starting at 5:00pm ACDT 12 March 201. This will be done just before the University of Adelaide's scheduled power outage (refer to ChangeNote? 201003-002).
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNote201003-002 | 10 Mar 2010 - 12:41 | ---+!! Power outage at eRSA
Power Outage at eRSA - 12th - 15th of March 2010
Description
The power will be disconnected to the equipment room in the Physics Building at the University of Adelaide over the period described above for maintenance to the power systems.
Estimated Duration
63.5 hours
Systems/Services Affected
idp
mds
imos0
imos1
imos2
arcturus
arcssakai
ng2
nggums
ngdata
inca
imos
datadev
arcs-df
Staff Responsible
David Logan
Detailed Instructions
Shut down all VM's and then hosts to ensure system integrity
Testing Procedures
Back-out Procedures
N/A
Review
Schedule/Notification
From 5:00pm 12th of March to 8:30am 15th of March 2010, all times Australian Central Daylight Time
Tracking completion
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
| ChangeNotes | 30 Jun 2008 - 16:59 | ---+ Change Notes
Dynamic list of change notes for ChangeControl policy.
This does not show current state yet. Testing in ChangeNotesTest without much luck. Will probably split the search into an APPROVED/not-completed search and a non-APPROVED search.
Number of topics: 113
See: VarSEARCH, FormattedSearch, TablePlugin
- Hint: use
?raw=debug in URL to see raw topic text including metadata.
-- DanielCox - 23 Jun 2008 |
Number of topics: 113
| ChangeNote200806-001 | 01 Jul 2008 - 11:30 | Testing for ChangeNotesTest - trying to work out why WorkflowPlugin variables are not being expanded correctly in search results.
WF:
ABC: %ABC%
Change Control
This document used for ARCS ChangeControl Policy. The contents of this section comes from ChangeNoteInclude.
State: -
History:
|
Number of topics: 1
Hint: use ?raw=debug in URL to see raw topic text including metadata.
Try form label field?
-- DanielCox - 30 Jun 2008