Wednesday, April 27, 2011

Brief Network Outage GOC Services at Indianapolis

Today beginning at 2:30PM EDT network connectivity was lost for a period of approximately 13 minutes. This network outage caused all OSG services based solely at the Indianapolis Campus to be unavailable. This includes OIM and the OSG TWiki. Network connectivity was returned to normal at 2:43PM EDT. Services hosted at both campuses or solely on the Bloomington campus were not affected.

Tuesday, April 19, 2011

ESNet Root CA Service Issue

As some of you may have noticed, the ESNet-Root-CA has had service problems since yesterday (April 18, 2011). The CRL file published by the ESNet Root CA (not the DOEGrids CA) was not available at it's expected coordinates. The ESnet Operations Team has notified us that they put a fix in place to redirect to the new location. OSG Security along with a few OSG Site Administrators have tested the fix successfully. Please let us know if you still have any connection problems to the CRL server.

Please also remember that even if you receive (or had already received) error/failure messages, this situation will NOT affect your site's availability. The CRL file has a lifetime of a year and if you ever have downloaded a copy previously, you will continue your operations as usual. The error messages on RSV probes also does not affect your site availability metrics.

We are also aware that this problem may have affected WLCG users access to CERN Single-Sign-On service via their certificates. We believe the fix in place may take care of this problem. However, we have not verified this with CERN IT staff. We will inform you when we hear from them.

Wednesday, April 6, 2011

DOEGrids Maintenance Notice - April 13 - 19:00 - 23:00 UTC

DOEGrids CA services will undergo planned maintenance on Wednesday, April 13, 2011 between 19:00 - 23:00 UTC. CRLs will not be affected. However, all other services on pki1.doegrids.org will be affected.

OSG users access to the grid resources will continue as normal. However, requests for new certificates and renewal of expiring certificates will not be processed during maintenance.


GOC Service Update - Tuesday, April 12th at 14:00 UTC

The GOC will upgrade the following services beginning at Tuesday, April 12th, 2010 at 14:00 UTC. The GOC reserves four hours (14:00 - 18:00 UTC) in the unlikely event that unexpected problems are encountered. We encourage users to test affected services before the production release.

OIM 2.32 (https://oim.grid.iu.edu)

ITB version is now available for testing at https://oim-itb.grid.iu.edu
Made Site Form’s State field optional
Installed Capacity Report Script / Updated to include VO ownership information for more detail please see https://ticket.grid.iu.edu/goc/viewer?id=10131

MyOSG / Gip Validator Consolidator

Made changes to the top level wlcg bdii monitor script so that BDII entries are loaded one at a time in order to minimize timeout issue possibly caused during low throughput events of IU network.

Monday, April 4, 2011

DOEGrids Services Outage - Update - GOC Ticket # 10187

The DOEGrids services were restored late Friday evening. We have received confirmation that they have resumed normal operations as of Monday morning.

If you tried to submit a new certificate request or renew your certificate on Friday and received an error, please resubmit your request now.

Please see ticket 10187 at:
https://ticket.grid.iu.edu/goc/viewer?id=10187

Friday, April 1, 2011

DOEGrids Services Outage - Update - GOC Ticket # 10187

OSG Operations has been informed the DOEGrid CA Services will remain offline for an unspecified period of time, this is likely to last through the weekend of April 2nd and 3rd. During this time all requests for new certificates and certificate renewals will not be processed by DOEGrids. We will alert you on Monday April 4th how users can obtain new certificates or renew their existing certificates. We are currently working to find a back up service.

This downtime will NOT affect the grid access for the users who already have unexpired certificates. OSG Resources will continue providing access and working as usual. Resources will NOT experience any critical RSV failures due to this down time (because the CA certificate and CRL files are still available). We will continue updating you as the situation develops. If the downtime continues over 72 hours, sites may experience a NON-CRITICAL failure of RSV probes. We will inform you promptly if that happens.

Please see ticket 10187 at:
https://ticket.grid.iu.edu/goc/viewer?id=10187

DOEGrid services unavailable - Friday April 1, 2011 - GOC Ticket # 10187

DOEGrids Agent Services (https://pki1.doegrids.org:8100/ca/) is currently unavailable. A CA administrator is investigating the root cause of the issue. No further information is available as to when the service will restored.


Please see ticket 10187 at:
https://ticket.grid.iu.edu/goc/viewer?id=10187