Information for Group Managers

A group manager is a person authorized to manage group course or private cluster course.
By default, the group manager is set to the service course applicant. The applicant of the service course can add/change the group manager. Please apply for add/change via User Portal.

A group manager can use the commands for managing the group. The dedicated commands of group manager allow group managers to manage queues and disks allocated to their groups and group members.

Group members can check or add/delete members by logging into the User Portal.

You can make backup settings of the LARGE disk space by using the group_backup command. LARGE disk space consists of the /LARGE0/groupname directory and the /LARGE1/groupname directory, and you can set one of the following status.

Status /LARGE0/groupname /LARGE1/groupname
Backup Safe(Make backup) Backup (Backup location)
Not Backup Unsafe(Not make backup) Unsafe(Not make backup)

Of these, disks whose settings are Safe or Unsafe can be used.

Checking the backup settings

The target group can be specified with the -g option. If omitted, the current group when the command is executed is targeted.

$ group_backup -g gr19999 -l 
 Num  Filesystem             Status  Filesystem             Status
  1)  /LARGE0/gr19999  ... Safe    /LARGE1/gr19999    ... Backup <- バックアップ使用状態

Setting the status to “Not Backup”

$ group_backup -g gr19999 --unsafe 1
/LARGE0/gr19999: Safe   => UnSafe
/LARGE1/gr19999: Backup => UnSafe

Checking the backup settings(after changes)

$ group_backup -g gr19999 -l
 Num  Filesystem             Status  Filesystem             Status
  1)  /LARGE0/gr19999    ... UnSafe  /LARGE1/gr19999    ... UnSafe  <- バックアップ未使用状態

Return to the status to Backup

$ group_backup -g gr19999 --safe 1
/LARGE0/gr19999: Unsafe => Safe
/LARGE1/gr19999: Unsafe => Backup

The group_trash command allows users to delete files(move to trash) in the LARGE disk space. It can delete the data files of users who are no longer enrolled due to graduation, etc. If you accidentally delete a file, you can recover it from the trash, but please note that the trash is emptied every Monday.

Deleting files by the group_trash command

Specify the target group with the -g option. If omitted, the group manager authority are determined by the current group when the command is executed.

$ group_trash -g gr19999 /LARGE0/gr19999/file1 
file1 to Trash (/LARGE0/gr19999/.DpcTrash/b59999/2009-04-10_1010)

There are two types of units of management of Slurm queue privileges: users and groups. Users are empty by default, Groups are initially registered with the group corresponding to the queue name as the default setting.

If you wish to use a queue with multiple groups, or if you wish to grant queue access to a single user who does not belong to a group, please contact us using the Inquiry Form.

You can select the job scheduling policy from the following three types. If you are using an individual queue (grXXXXXx), you can change your preferred scheduling policy by contacting us from the applicant at Inquiry Form. (We cannot accept requests for shared queues such as entry course or personal course.)

Settings Operation
pass If there are sufficient computing resources to execute a job, it will overtake jobs waiting to be executed that are in line before it.
You can use computing resources efficiently, but large jobs may not be executed indefinitely.【Default Value】
wait It will not overtake jobs even if there is a enough computing resources.
backfill Based on the calculations of the execution time limit (-t) of each job, it will overtake jobs only if it does not affect the execution start time of other jobs. For example, you can use resources effectively by executing small jobs that can complete execution before a large job is started.

You can confirm the job execution status of queues registered as a group manager and cancel the jobs with the spadmin command.

  • Confirmation of job execution status

    $ spadmin list -p gr19999b ## Please change the "gr19999b" part to the queue name you wish to confirm.
             JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)
              4781  gr19999b run_cpu2   b59999  R    1:26:09      1 nb0001
  • Cancellation of the job

    $ spadmin cancel 123
    scancel: Terminating job 123

If anyone other than the group manager execute the spadmin command, the following error message will be displayed.

$ spadmin list -p gr19999g
Authorization Failure