Document entry

Title	CMS experience of running glideinWMS in High Availability mode
Authors	I. Sfiligoi, J. Letts, S. Belforte, A. McCrea, K. Larson, M. Zvada, B. Holzman, P. Mhashilkar, D. C. Bradley, M. D. Saiz Santos, F. Fanzago, O. Gutsche, T. Martin, F. Würthwein
Abstract	The CMS experiment at the Large Hadron Collider is relying on the HTCondor-based glideinWMS batch system to handle most of its distributed computing needs. In order to minimize the risk of disruptions due to software and hardware problems, and also to simplify the maintenance procedures, CMS has set up its glideinWMS instance to use most of the attainable High Availability (HA) features. The setup involves running services distributed over multiple nodes, which in turn are located in several physical locations, including Geneva (Switzerland), Chicago (Illinois, USA) and San Diego (California, USA). This paper describes the setup used by CMS, the HA limits of this setup, as well as a description of the actual operational experience spanning many months.
Publisher	IOP Publishing
Type	Conference Paper
Confidentiality	PUBLIC
Tags	document wg2 workload_provisioning computing_architecture
Date	01 Jan 2014
URL	https://iopscience.iop.org/article/10.1088/1742-6596/513/3/032086
DOI	10.1088/1742-6596/513/3/032086
Relevant for	WG2
Added by	Tommaso Boccali
Notes

Space shortcuts