IB/hfi1: Fix yield logic in send engine

xiaomi-sm8450/android_kernel_xiaomi_sm8450

When there are many RC QPs and an RDMA READ request
is sent, timeouts occur on the requester side because
of fairness among RC QPs on their relative SDMA engine
on the responder side.  This also hits write and send, but
to a lesser extent.

Complicating the issue is that the current code checks if workqueue
is congested before scheduling other QPs, however, this
check is based on the number of active entries in the
workqueue, which was found to be too big to for
workqueue_congested() to be effective.

Fix by reducing the number of active entries as revealed by
experimentation from the default of num_sdma to
HFI1_MAX_ACTIVE_WORKQUEUE_ENTRIES.  Retry counts were monitored
to determine the correct value.

Tracing to investigate any future issues is also added.

Reviewed-by: Mike Marciniszyn <mike.marciniszyn@intel.com>
Signed-off-by: Sebastian Sanchez <sebastian.sanchez@intel.com>
Signed-off-by: Dennis Dalessandro <dennis.dalessandro@intel.com>
Signed-off-by: Doug Ledford <dledford@redhat.com>

Šī revīzija ir iekļauta:

Mike Marciniszyn

2017-05-04 05:14:10 -07:00

revīziju iesūtīja

Doug Ledford

vecāks 688f21c0be

revīzija dd1ed10817

4 mainīti faili ar 90 papildinājumiem un 31 dzēšanām

									
										4

drivers/infiniband/hw/hfi1/verbs.h
									
												Parādīt failu
												
				@@ -139,6 +139,10 @@ struct hfi1_pkt_state {

					struct hfi1_pportdata *ppd;

					struct verbs_txreq *s_txreq;

					unsigned long flags;

					unsigned long timeout;

					unsigned long timeout_int;

					int cpu;

					bool in_thread;

				};

				#define HFI1_PSN_CREDIT  16

IB/hfi1: Fix yield logic in send engine

4 drivers/infiniband/hw/hfi1/verbs.h Atkodēt Kodēt Parādīt failu

4

drivers/infiniband/hw/hfi1/verbs.h

Parādīt failu