Prioritization

Automatic Prioritization

Motivation

Different processing workflows or the processing of different datasets can have different importance. We should support this on the fine enough level.

iBrain2's current solution

iBrain2 has a very simple prioritization solution. Processes have a priority value which is set 5 by default, but can be changed via rest. The process with higher priority value will be scheduled earlier.

Possible improvements

We can define priority values for each important factors of the submission process:

User
Workflow
Dataset (or maybe more fine grained, assay, project, dataset,…)

Of coures each item can have a default priority.
The priority values can be additive, so the priority of the running proces would be the sum of the above listed priorities.

NOTES

Prioritization should affect the priority of the submitted cluster job too
As the dataset related information most probably would come from the storage this would mean the priority of the datasets would be stored there. (as an optional attribuite of the dataset)
Maybe the prioritization policy should be also configurable (maybe specialized implementation fits better in some cases).
There was an idea that the priority of the process should also be dependent on the size of the input data to prioritize the shorter processing. This could be also feasible, but much complicated. What's the size of the dataset (files or bytes)? It isn't too expensive to compute the size of the dataset? How to implement this policy not to lead to starvation of large datasets (with some randomness)?

Allow manual prioritization

Allow setting initial priority for each processing
Allow dynamic changing of priority for each processing

Prioritize processing by Ressource Bottlenecks

Prefer processings for which more datasets are already staged
Prefer processing for which smaller amounts of data need to be staged
Prefer processings after which larger amounts of data can be deleted
Prefer processings of workflows with fewer remaining steps
Prefer processings which are quicker

screeningBee Data Analysis Tools

Sidebar

Table of Contents

Prioritization

Automatic Prioritization

Motivation

iBrain2's current solution

Possible improvements

NOTES

Allow manual prioritization

Prioritize processing by Ressource Bottlenecks

screeningBee Data Analysis Tools

User Tools

Site Tools

Sidebar

Table of Contents

Prioritization

Automatic Prioritization

Motivation

iBrain2's current solution

Possible improvements

NOTES

Allow manual prioritization

Prioritize processing by Ressource Bottlenecks

Page Tools