start

This is an old revision of the document!


Welcome to the Biomed support team wiki pages. These pages are dedicated to the Biomed technical support team. They provide organisational information as to duty shifts, and technical information about common support tasks as well as best practices.

Aug. 2017: A biomed ARGO Mon instance replaces the biomed Nagios box that is no longer be supported. This new instance is available here : https://argo-mon-biomed.cro-ngi.hr/nagios

For the record, the page on Nagios configuration is still available here.

Former news.

See this wiki page.

Mailing list: biomed-technical-support [no spam-AT] googlegroups [no spam-DOT] com

The participants are listed in the order of the shifts:

  • CNRS-I3S, FR (Franck Michel)
  • CNRS-Creatis, FR (Sorina Pop)
  • BME-IIT, HU (Ákos Szlavecz, backup: Gábor Hesz)
  • INSERM, FR (Gilles Mathieu)

Past (teams that contributed in the past, but had to leave us)

  • CNRS-IPHC, FR (Patrick Guterl)
  • CNRS-Creatis, FR (Tristan Glatard)
  • CNRS-LPC, FR (Paul de Vlieger)
  • CNRS-ISC-PIF, FR (Romain Reuillon)
  • INFN-BA, Libi, Bari, IT (Giacinto Donvito)
  • IsraGrid (Arad Alper)
  • UPV, ES (Abel Antonio Carrión)
  • IFI - Institut de la Francophonie pour l'Informatique, VN (Bui The Quang)

Start date End date Team on duty
03-08-2020 14-08-2020 I3S
17-08-2020 28-08-2020 INSERM
31-08-2020 11-09-2020 CREATIS
14-09-2020 25-09-2020 BME-IIT
28-09-2020 09-10-2020 I3S
12-10-2020 23-10-2020 INSERM

CNRS-I3S, CNRS-IPHC, CNRS-Creatis, BME-IIT, UPV

See the past schedule.

Minutes of the shift take-over conferences.

When a SE is to planned for decommissioning, launch the specific SE decommissioning procedure. </well>

The following monitoring tools are currently under development / being tested. They could be used to give information on the VO status but they can't be assumed 100% reliable now. In any case problems should be reproduced manually before a GGUS ticket is submitted.

Note: ARGO probes are launched using a robot certificate with DN /DC=EU/DC=EGI/C=HR/O=Robots/O=SRCE/CN=Robot:argo-egi@cro-ngi.hr that requires to be a Biomed member. We need to check regularly that membership has not expired, and in case it has, sign Biomed AUP in the name of the user (this is possible as VOMS admin) on https://cclcgvomsli01.in2p3.fr:8443/voms/biomed/ (last renewed early March 2020)

VO Support Tools: collection of CLI tools, some of them are integrated into VAPOR.

The VAPOR provides several operations features related to the monitoring of computing and storage resources (status of biomed Resources, unavailable and faulty resources, etc.). It is complementary of Nagios.

This list is consolidated from 2 sources:

  • the GOCDB provides status downtime, not in production and not monitored, as well as site uncertified,
  • the BDII provides status draining, closed, unknown….

VO Admin Dashboard: this portal provides an integrated view over several other portals based on the selected VO. Integrated portals: Ops Portal, CIC, GGUS, GStat, GOCDB, VOMS Admin, Apps DB, RT. Still under development, later on Nagios should be included into that view.

Top BDII:
Make sure the lcg-infosites tool that is used to query the BDII has version > 2.6.9, that comes with gLite 3.2.
Use command: rpm -qa | grep infosites

For Biomed it is advised to refer to the top BDII in IN2P3
export LCG_GFAL_INFOSYS=cclcgtopbdii02.in2p3.fr:2170
The script below may help you detect inconsistencies between top BDIIs:

#!/bin/bash
# This script compares the list of BIOMED SEs returned by 2 top BDIIs: CERN and IN2P3
lcg-infosites --vo biomed --is cclcgtopbdii02.in2p3.fr se | cut -f2 | sort > /tmp/list_se_bdi_in2p3
lcg-infosites --vo biomed --is lcg-bdii.cern.ch se | cut -f2 | sort > /tmp/list_se_bdi_cern
diff /tmp/list_se_bdi_in2p3 /tmp/list_se_bdi_cern</tt>

Team coordination usual tasks: this page describes the coordination of the support team and the VO management tasks.

  • start.1598970644.txt.gz
  • Last modified: 2020/09/01 16:30
  • by sorina