drm/scheduler: Add flag to hint the release of guilty job.

Problem:
Sched thread's cleanup function races against TO handler
and removes the guilty job from mirror list and we
have no way of differentiating if the job was removed from within the
TO handler or from the sched thread's clean-up function.

Fix:
Add a flag to scheduler to hint the TO handler that the guilty job needs
to be explicitly released.

v2: whitespace fix

Reviewed-by: Christian König <christian.koenig@amd.com>
Signed-off-by: Andrey Grodzovsky <andrey.grodzovsky@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
Link: https://patchwork.freedesktop.org/patch/msgid/1555599624-12285-5-git-send-email-andrey.grodzovsky@amd.com
This commit is contained in:
Andrey Grodzovsky
2019-04-18 11:00:23 -04:00
committed by Alex Deucher
parent 290764af7e
commit a5343b8a2c
2 changed files with 9 additions and 2 deletions

View File

@@ -293,8 +293,10 @@ static void drm_sched_job_timedout(struct work_struct *work)
* Guilty job did complete and hence needs to be manually removed
* See drm_sched_stop doc.
*/
if (list_empty(&job->node))
if (sched->free_guilty) {
job->sched->ops->free_job(job);
sched->free_guilty = false;
}
spin_lock_irqsave(&sched->job_list_lock, flags);
drm_sched_start_timeout(sched);
@@ -395,10 +397,13 @@ void drm_sched_stop(struct drm_gpu_scheduler *sched, struct drm_sched_job *bad)
/*
* We must keep bad job alive for later use during
* recovery by some of the drivers
* recovery by some of the drivers but leave a hint
* that the guilty job must be released.
*/
if (bad != s_job)
sched->ops->free_job(s_job);
else
sched->free_guilty = true;
}
}