Compute¶

The compute command ask the web service for available jobs that need to be run. To run some jobs you need to type in the terminal:

ceibacli compute -i input_compute.yml

Where the input_compute.yml is an file in YAML format containing the Compute Input File metadata.

The compute command takes the user’s input, request some available job and Job Scheduling those jobs using the information provided by the user.

Compute Input File¶

The input file contains the following mandatory keywords:

# Web service URL
web: "http://YourCeibaInstance:8080/graphql"

# Name of the collection to compute
collection_name: "simulation_name"

# Command use to run the workflow
command: compute_properties

Other optional keywords are:

# Configuration of the job scheduler
scheduler:
   "none"


# Path to the directory where the calculations are going to run (default: workdir_ceibacli)
workdir:
   /path/to/workdir

# Number of jobs to request and run (default: 10)
max_jobs:
   5

Job Scheduling¶

Most of the scientific simulation are usually perform in supercomputers that use a job scheduler. ceiba-cli supports two of the most popular ones: SLURM. If you choose a scheduler different from none, ceiba-cli will automatically contact the job scheduler with the options that you have provided. Below you can find a description of the available options:

# Job scheduler. Of of "none" or "slurm" (default: none)
scheduler:
   slurm

# Number of computing nodes to request (default: 1)
nodes:
   1

# Number of CPUs per task (default: None)
cpus_per_task:
   48

# Total time to request ind "days:hours:minutes" format (default: 1day)
walltime:
  "01:00:00"

# Partion name (queue's name) where the job is going to run (default: None)
partion_name:
  "short"

You can alternatively provide a string with all the options for the queue system like,

scheduler:
  slurm

# String with user's Configuration
free_format: "#!/bin/bash
#SBATCH -N 1
#SBATCH -t 00:15:00
....
"

Job State¶

The user’s requested jobs are initially marked as RESERVERED, in the web service to avoid conflicts with other users. Then, if the jobs are sucessfully scheduled they are marked as RUNNING. If there is a problem during the scheduling or subsequent running step the job would be marked as FAILED.