pull down to refresh
Inference jobs will go to the most appropriate device (some combination of lowest time-to-first-token / highest tokens-per-second / best reputation etc.) but we're focusing first on distributed training runs: going to try giving pieces of training work to all possible devices that can run them, but on't know for sure which devices will be able to realistically contribute good work to the training run until we start gathering live data from our Pylon network next week as we start pushing our training code live
Will share more data in our territory as we get it
reply
How is the job assigned ?
Is it like a random thing or based on token response speed?