Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
biomed-shifts:practices [2021/10/13 12:01] – [Identify the problems] sorina | biomed-shifts:practices [2022/05/19 14:32] (current) – [Reproduce the problem] sorina | ||
---|---|---|---|
Line 57: | Line 57: | ||
====== | ====== | ||
+ | Link to Biomed ARGO page: [[https:// | ||
===== VOMS server | ===== VOMS server | ||
The proxy certificate creation should work: | The proxy certificate creation should work: | ||
Line 63: | Line 64: | ||
The VOMS administration interface should be available. From a UI, run the command: | The VOMS administration interface should be available. From a UI, run the command: | ||
< | < | ||
- | |||
- | ===== LFC server | ||
- | Command "'' | ||
===== Monitoring SEs ===== | ===== Monitoring SEs ===== | ||
Line 72: | Line 70: | ||
SRM probes used by ARGO box | SRM probes used by ARGO box | ||
- | - https:// | + | |
- | - based on the gfal2 library for the storage operations (gfal-copy, etc) | + | |
- | - queries the BDII service in order to build the Storage URL to test given the host-name and the VO name | + | - based on the gfal2 library for the storage operations (gfal-copy, etc) |
- | - a X509 valid proxy certificate is needed to execute the probe (configured via X509_USER_PROXY variable). | + | - queries the BDII service in order to build the Storage URL to test given the host-name and the VO name |
+ | - a X509 valid proxy certificate is needed to execute the probe (configured via X509_USER_PROXY variable). | ||
Line 81: | Line 80: | ||
==== Reproduce the problem | ==== Reproduce the problem | ||
- | < | ||
- | The [[https://github.com/frmichel/vo-support-tools/blob/master/SE/lcg-cr.sh|lcg-cr.sh]] script automates | + | Manual SRM testing (copy file to SE) |
+ | |||
+ | From the biomed-ui.fedcloud.fr VM, where gfal2 is already installed : | ||
+ | |||
+ | 1. Build the Storage URL following the model < | ||
+ | |||
+ | NOTE 1: the model works for DPM SEs, not sure about storm or dCache (a storm example is srm://// | ||
+ | |||
+ | NOTE 2: would be interesing to use the probe for building this URL | ||
+ | |||
+ | 2. Use gfal-ls to check that we can list the folder | ||
+ | < | ||
+ | 3. Use gfal-copy to copy a file (in this case, job.jdl) to the above URL | ||
+ | < | ||
+ | Copying file:/// | ||
+ | 4. Check the copy was copied | ||
+ | < | ||
+ | job.jdl </ | ||
+ | |||
+ | Note that in some cases, the gfal-ls may work (as well as gfal-mkdir), | ||
+ | < | ||
+ | gfal-ls srm:// | ||
+ | gfal-copy dirac/ | ||
+ | gfal-copy error: 70 (Communication error on send) - Could not open destination: | ||
==== Ignored alarms ==== | ==== Ignored alarms ==== | ||
Line 99: | Line 120: | ||
When a SE is to planned for decommissioning, | When a SE is to planned for decommissioning, | ||
+ | Older decommissionning page is available here [[Biomed-Shifts: | ||
===== Monitoring CEs ===== | ===== Monitoring CEs ===== | ||
==== Identify the problems | ==== Identify the problems | ||
- | The ARGO box is the best way to identify faulty resources. | + | The ARGO box is the best way to identify faulty resources. |
+ | ==== | ||
- | Probes documentation is available at https:// | + | 1. Manual ARC CE submission |
- | ==== Reproduce the problem | + | - see https:// |
- | Reproduce the problem by one of the two methods below. | + | |
+ | - submit with " | ||
+ | |||
+ | Further ARC CE documentation available in French : https:// | ||
- | Download this {{:biomed-shifts:test.jdl|test JDL}} (or {{:biomed-shifts: | + | and DIRAC :https://grand-est.fr/support-utilisateurs/ |
- | < | + | |
- | Then check that the status and the output when the submit command has completed: | + | |
- | < | + | |
- | Reminder: __before submitting a ticket make sure one is not open yet__. | + | 2. Manual HTCndorCE submission |
+ | TO BE DONE | ||
==== Ignored alarms ==== | ==== Ignored alarms ==== | ||
Shifters shall focus on failed job submissions in priority: probes '' | Shifters shall focus on failed job submissions in priority: probes '' |