General information

GEANT TCS certificate service interruption

  • At the beginning of January 2025 it will not be possible to request/renew GEANT TCS certificates any longer
  • Please renew all the host/personal TCS certificate in the coming few weeks
  • New solutions are under investigations, but finalising them will take time.

Middleware


UMD

  • UMD5 released: https://repository.egi.eu/umd/distribution.html?id=5#5
    • APEL 2.1.0, APEL SSM 3.4.1
    • Arc 6.20.1
    • BDII 6.0.3, 
    • WN 5.1.0
    • UI 7.0.0
    • Dcache 9.2.25
    • Gfal2 2.23.0
    • Frontier-squid 5.9.2
    • Voms 2.1.0, voms-api 3.3.3, voms-client-java 3.3.3, voms-client-cpp 2.1.0
    • xroot 5.7.1
    • htcondor-ce 23.0
    • cvmfs 2.11.5
    • config-egi 2.6.1
    • egi-cvmfs 6.7.28
    • Davix 0.8.7

Migration to EL9

Following PROC16 Decommissioning of unsupported software

Broadcast circulated in June.

Requested to enable the metric to detect CentOS7 endpoints:

The NGIs can open tickets against sites to track the migration

Operations

Accounting Repository

Pub/Sync system taken offline for a security issue. Accounting Repository operation unaffected, but Repository test is provided via the pub/sync hosts.

We receive weekly reports by email about the publication of the accounting records.

ARGO/SAM

  • Waiting for the new version of the HTCondorCE probe
    • for the moment the endpoints are tested with the host certificate validity metric
  • Several sites with HTCondorCE are failing the tests:
    • They still have HTCondor 9 (on CentOS 7) which doesn't work correctly with the new HTCondor client (v23) on EL9
    • Those sites are requested to upgrade to HTCondor 23.0.x as soon as possible
  • Monitoring issue with ARC-CE 6.20.1 version

FedCloud

A/R numbers report sent on 5th/Dec. had to be re-calculated. There are still some issues now being fixed by Emir (ARGO).

Feedback from DMSU

From July 1st the second level support is provided by UKIM:

  • the partner representing the Macedonian Academic Research Grid Initiative (MARGI) in the EGI Council, is now a full member of the EGI Federation

New Known Error Database (KEDB)

The KEDB has been moved to Jira+Confluence: https://confluence.egi.eu/display/EGIKEDB/EGI+Federation+KEDB+Home

  • problems are tracked with Jira tickets to better follow-up their evolution
  • problems can be registered by DMSU staff and EGI Operations team

Monthly Availability/Reliability

Under-performed sites in the past A/R reports with issues not yet fixed:

Under-performed sites after 3 consecutive months, under-performed NGIs, QoS violations: (November 2024):

sites suspended: 

  • GR-07-UOI-HEPLAB (NGI_GRNET)

Using YAIM to configure Site and Top BDII on EL9

IPv6 readiness plans

VOMS upgrade campaign to EL9

  • VOMS released on EL9:
  • The sites can now upgrade their VOMS endpoints to either to EL8 or EL9
  • EL9 package was also released in UMD5
  • Optionally you could keep the current server to work as the database (not exposed to the outside), while you expose externally the new server with voms and voms-admin
    • This should shorten the downtime when doing the switch 
  • Note: it was noticed a dependency of voms-admin on Python 2 that makes it difficulty the installation on EL9 (EL9 removed the support to python 2)
    • the voms team is working to fix this
    • as alternative, the sites can install voms-admin on EL8 where Python 2 is still supported 

Currently there are 28 VOMS endpoints in production. We are also starting to decommission about 100 inactive VOs, so the number of VOMS endpoints could also decrease.

Tickets tracked here: 2024 VOMS upgrade campaign

StoRM upgrade campaign to EL9

  • INFN is working to release StoRM on EL9
    • StoRM WebDAV v1.4.2 (the latest released on CentOS 7) is available also for el9 in their stable repository
    • The other components will be soon ready
  • 31 StoRM endpoints published in the BDII
  • We can track the migration in 2024 StoRM upgrade campaign

New benchmark HEPscore23

The benchmark HEPscore23 is replacing the old Hep-SPEC06

Recent activities:

  • APEL client 2.1.0 released and included in UMD 5
  • Testing ongoing, with data sent from some sites to the accounting repository and published into the staging accounting portal
  • Last week the Accounting Repository was upgrade to the new version supporting the new benchmark.

HEPSCORE application:

WLCG Operations Coordination meeting (Oct 2024)

New helpdesk

  • Pilot production instance was released in October
  • The new GGUS implementation is based on Zammad
  • You can login and explore the new look
    • the supporter role that you have in the old GGUS will be assigned to you automatically after a few days from the first login
  • First Steps Guide for New GGUS Users ( start here)
  • Test emails to all support units will be sent
  • The current GGUS implementation will be put in read-only mode on Feb 1st
  • In January all the open tickets will be imported by the new helpdesk implementation
    • a downtime will be required
  • All SUs should use the new helpdesk by the end of Janaury (you can already start)

AOB

Next meeting

January

  • No labels