diff mbox

[v4,6/6] drm/i915: Add skl_check_nv12_surface for NV12

Message ID 6142ae0b-a9da-f553-2726-632d44d83168@linux.intel.com (mailing list archive)
State New, archived
Headers show

Commit Message

Maarten Lankhorst April 19, 2018, 11:30 a.m. UTC
Op 19-04-18 om 13:22 schreef Ville Syrjälä:
> On Thu, Apr 19, 2018 at 02:36:42AM +0000, Srinivas, Vidya wrote:
>>
>>
>>
>>> -----Original Message-----
>>> From: Ville Syrjälä [mailto:ville.syrjala@linux.intel.com]
>>> Sent: Thursday, April 19, 2018 12:06 AM
>>> To: Maarten Lankhorst <maarten.lankhorst@linux.intel.com>
>>> Cc: Srinivas, Vidya <vidya.srinivas@intel.com>; intel-
>>> gfx@lists.freedesktop.org
>>> Subject: Re: [Intel-gfx] [PATCH v4 6/6] drm/i915: Add
>>> skl_check_nv12_surface for NV12
>>> On Wed, Apr 18, 2018 at 08:06:57PM +0200, Maarten Lankhorst wrote:
>>>> Op 18-04-18 om 17:32 schreef Ville Syrjälä:
>>>>> On Wed, Apr 18, 2018 at 09:38:13AM +0530, Vidya Srinivas wrote:
>>>>>> From: Maarten Lankhorst <maarten.lankhorst@linux.intel.com<mailto:maarten.lankhorst@linux.intel.com>>
>>>>>> We skip src trunction/adjustments for
>>>>>> NV12 case and handle the sizes directly.
>>>>>> Without this, pipe fifo underruns are seen on APL/KBL.
>>>>>> v2: For NV12, making the src coordinates multiplier of 4
>>>>>> v3: Moving all the src coords handling code for NV12 to
>>>>>> skl_check_nv12_surface
>>>>>> Signed-off-by: Maarten Lankhorst
>>>>>> <maarten.lankhorst@linux.intel.com<mailto:maarten.lankhorst@linux.intel.com>>
>>>>>> Signed-off-by: Vidya Srinivas <vidya.srinivas@intel.com<mailto:vidya.srinivas@intel.com>>
>>>>>> ---
>>>>>>  drivers/gpu/drm/i915/intel_display.c | 39
>>>>>> ++++++++++++++++++++++++++++++++++++
>>>>>>  drivers/gpu/drm/i915/intel_sprite.c  | 15 ++++++++++----
>>>>>>  2 files changed, 50 insertions(+), 4 deletions(-)
>>>>>> diff --git a/drivers/gpu/drm/i915/intel_display.c
>>>>>> b/drivers/gpu/drm/i915/intel_display.c
>>>>>> index 925402e..b8dbaca 100644
>>>>>> --- a/drivers/gpu/drm/i915/intel_display.c
>>>>>> +++ b/drivers/gpu/drm/i915/intel_display.c
>>>>>> @@ -3118,6 +3118,42 @@ static int skl_check_main_surface(const
>>> struct intel_crtc_state *crtc_state,
>>>>>>  return 0;
>>>>>>  }
>>>>>> +static int
>>>>>> +skl_check_nv12_surface(const struct intel_crtc_state *crtc_state,
>>>>>> +                                       struct intel_plane_state *plane_state) {
>>>>>> +                int crtc_x2 = plane_state->base.crtc_x + plane_state->base.crtc_w;
>>>>>> +                int crtc_y2 = plane_state->base.crtc_y +
>>>>>> +plane_state->base.crtc_h;
>>>>>> +
>>>>>> +                if (((plane_state->base.src_x >> 16) % 4) != 0 ||
>>>>>> +                    ((plane_state->base.src_y >> 16) % 4) != 0 ||
>>>>>> +                    ((plane_state->base.src_w >> 16) % 4) != 0 ||
>>>>>> +                    ((plane_state->base.src_h >> 16) % 4) != 0) {
>>>>>> +                                DRM_DEBUG_KMS("src coords must be multiple of 4 for
>>> NV12\n");
>>>>>> +                                return -EINVAL;
>>>>>> +                }
>>>>> I don't really see why we should check these. The clipped
>>>>> coordinates are what matters.
>>>> To propagate our limits to the userspace. I think we should do it for
>>>> all formats, but NV12 is the first YUV format we have tests for. If we
>>>> could we should do something similar for the other YUV formats, but they
>>> have different requirements.
>>>> In case of NV12 we don't have existing userspace, there will be
>>>> nothing that breaks if we enforce limits from the start.
>>> But what about sub-pixel coordinates? You're totally ignoring them here.
>>> We need to come up with some proper rules for this stuff.
>>>>>> +
>>>>>> +                /* Clipping would cause a 1-3 pixel gap at the edge of the screen? */
>>>>>> +                if ((crtc_x2 > crtc_state->pipe_src_w && crtc_state->pipe_src_w %
>>> 4) ||
>>>>>> +                    (crtc_y2 > crtc_state->pipe_src_h && crtc_state->pipe_src_h % 4))
>>> {
>>>>>> +                                DRM_DEBUG_KMS("It's not possible to clip %u,%u to
>>> %u,%u\n",
>>>>>> +                                                      crtc_x2, crtc_y2,
>>>>>> +                                                      crtc_state->pipe_src_w, crtc_state->pipe_src_h);
>>>>>> +                                return -EINVAL;
>>>>>> +                }
>>>>> Why should we care? The current code already plays it fast and loose
>>>>> and allows the dst rectangle to shrink to accomodate the hw limits.
>>>>> If we want to change that we should change it universally.
>>>> Unfortunately for the other formats we already have an existing
>>>> userspace
>>>> (X.org) that doesn't perform any validation. We can't change it for
>>>> that, but we can prevent future mistakes.
>>> We should do it uniformly. Not per-format. That will make the code
>>> unmaintainable real quick.
>>>>>> +
>>>>>> +                plane_state->base.src.x1 =
>>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.x1, 1 << 18) <<
>>> 18;
>>>>>> +                plane_state->base.src.x2 =
>>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.x2, 1 << 18) <<
>>> 18;
>>>>>> +                plane_state->base.src.y1 =
>>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.y1, 1 << 18) <<
>>> 18;
>>>>>> +                plane_state->base.src.y2 =
>>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.y2, 1 << 18) <<
>>> 18;
>>>>> Since this can now increase the size of the source rectangle our
>>>>> scaling factor checks are no longer 100% valid. We might end up with
>>>>> a scaling factor that is too high.
>>>>> I don't really like any of these "let's make NV12 behave special"
>>>>> tricks. We should make the code behave the same way for all pixel
>>>>> formats instead of adding format specific hacks.
>>>> This is not nivalid because we restrict the original src coordinates
>>>> to be a multiple of 4, you can only clip to something smaller, not to
>>>> something bigger. :)
>>> The clipped coordinates can be whatever thanks to scaling/etc.
>>> Also why are we trying to make everything a multiple of four? I don't
>>> remember any hw restrictions like that.
>>
>>
>> Hi
>>
>>
>> As per WA1106, Display corruption/color shift observed when using NV12 with 270 rotation or 90 rotation + horizontal flip.
>>
>> WA: NV12 with 270 rotation or 90 rotation + horizontal flip requires the programmed plane height to be a multiple of 4.
> Does plane height here mean src height or dst height?
>
> Either way I don't see why we aren't just checking for the right thing
> instead of trying to mandate a four pixel alignment everywhere.
>
Agreed, what about the below diff, would this be acceptable to you? I deliberately ignore the last 16 bits as that is what we currently do anyway for all formats.

Comments

Ville Syrjälä April 19, 2018, 11:50 a.m. UTC | #1
On Thu, Apr 19, 2018 at 01:30:32PM +0200, Maarten Lankhorst wrote:
> Op 19-04-18 om 13:22 schreef Ville Syrjälä:
> > On Thu, Apr 19, 2018 at 02:36:42AM +0000, Srinivas, Vidya wrote:
> >>
> >>
> >>
> >>> -----Original Message-----
> >>> From: Ville Syrjälä [mailto:ville.syrjala@linux.intel.com]
> >>> Sent: Thursday, April 19, 2018 12:06 AM
> >>> To: Maarten Lankhorst <maarten.lankhorst@linux.intel.com>
> >>> Cc: Srinivas, Vidya <vidya.srinivas@intel.com>; intel-
> >>> gfx@lists.freedesktop.org
> >>> Subject: Re: [Intel-gfx] [PATCH v4 6/6] drm/i915: Add
> >>> skl_check_nv12_surface for NV12
> >>> On Wed, Apr 18, 2018 at 08:06:57PM +0200, Maarten Lankhorst wrote:
> >>>> Op 18-04-18 om 17:32 schreef Ville Syrjälä:
> >>>>> On Wed, Apr 18, 2018 at 09:38:13AM +0530, Vidya Srinivas wrote:
> >>>>>> From: Maarten Lankhorst <maarten.lankhorst@linux.intel.com<mailto:maarten.lankhorst@linux.intel.com>>
> >>>>>> We skip src trunction/adjustments for
> >>>>>> NV12 case and handle the sizes directly.
> >>>>>> Without this, pipe fifo underruns are seen on APL/KBL.
> >>>>>> v2: For NV12, making the src coordinates multiplier of 4
> >>>>>> v3: Moving all the src coords handling code for NV12 to
> >>>>>> skl_check_nv12_surface
> >>>>>> Signed-off-by: Maarten Lankhorst
> >>>>>> <maarten.lankhorst@linux.intel.com<mailto:maarten.lankhorst@linux.intel.com>>
> >>>>>> Signed-off-by: Vidya Srinivas <vidya.srinivas@intel.com<mailto:vidya.srinivas@intel.com>>
> >>>>>> ---
> >>>>>>  drivers/gpu/drm/i915/intel_display.c | 39
> >>>>>> ++++++++++++++++++++++++++++++++++++
> >>>>>>  drivers/gpu/drm/i915/intel_sprite.c  | 15 ++++++++++----
> >>>>>>  2 files changed, 50 insertions(+), 4 deletions(-)
> >>>>>> diff --git a/drivers/gpu/drm/i915/intel_display.c
> >>>>>> b/drivers/gpu/drm/i915/intel_display.c
> >>>>>> index 925402e..b8dbaca 100644
> >>>>>> --- a/drivers/gpu/drm/i915/intel_display.c
> >>>>>> +++ b/drivers/gpu/drm/i915/intel_display.c
> >>>>>> @@ -3118,6 +3118,42 @@ static int skl_check_main_surface(const
> >>> struct intel_crtc_state *crtc_state,
> >>>>>>  return 0;
> >>>>>>  }
> >>>>>> +static int
> >>>>>> +skl_check_nv12_surface(const struct intel_crtc_state *crtc_state,
> >>>>>> +                                       struct intel_plane_state *plane_state) {
> >>>>>> +                int crtc_x2 = plane_state->base.crtc_x + plane_state->base.crtc_w;
> >>>>>> +                int crtc_y2 = plane_state->base.crtc_y +
> >>>>>> +plane_state->base.crtc_h;
> >>>>>> +
> >>>>>> +                if (((plane_state->base.src_x >> 16) % 4) != 0 ||
> >>>>>> +                    ((plane_state->base.src_y >> 16) % 4) != 0 ||
> >>>>>> +                    ((plane_state->base.src_w >> 16) % 4) != 0 ||
> >>>>>> +                    ((plane_state->base.src_h >> 16) % 4) != 0) {
> >>>>>> +                                DRM_DEBUG_KMS("src coords must be multiple of 4 for
> >>> NV12\n");
> >>>>>> +                                return -EINVAL;
> >>>>>> +                }
> >>>>> I don't really see why we should check these. The clipped
> >>>>> coordinates are what matters.
> >>>> To propagate our limits to the userspace. I think we should do it for
> >>>> all formats, but NV12 is the first YUV format we have tests for. If we
> >>>> could we should do something similar for the other YUV formats, but they
> >>> have different requirements.
> >>>> In case of NV12 we don't have existing userspace, there will be
> >>>> nothing that breaks if we enforce limits from the start.
> >>> But what about sub-pixel coordinates? You're totally ignoring them here.
> >>> We need to come up with some proper rules for this stuff.
> >>>>>> +
> >>>>>> +                /* Clipping would cause a 1-3 pixel gap at the edge of the screen? */
> >>>>>> +                if ((crtc_x2 > crtc_state->pipe_src_w && crtc_state->pipe_src_w %
> >>> 4) ||
> >>>>>> +                    (crtc_y2 > crtc_state->pipe_src_h && crtc_state->pipe_src_h % 4))
> >>> {
> >>>>>> +                                DRM_DEBUG_KMS("It's not possible to clip %u,%u to
> >>> %u,%u\n",
> >>>>>> +                                                      crtc_x2, crtc_y2,
> >>>>>> +                                                      crtc_state->pipe_src_w, crtc_state->pipe_src_h);
> >>>>>> +                                return -EINVAL;
> >>>>>> +                }
> >>>>> Why should we care? The current code already plays it fast and loose
> >>>>> and allows the dst rectangle to shrink to accomodate the hw limits.
> >>>>> If we want to change that we should change it universally.
> >>>> Unfortunately for the other formats we already have an existing
> >>>> userspace
> >>>> (X.org) that doesn't perform any validation. We can't change it for
> >>>> that, but we can prevent future mistakes.
> >>> We should do it uniformly. Not per-format. That will make the code
> >>> unmaintainable real quick.
> >>>>>> +
> >>>>>> +                plane_state->base.src.x1 =
> >>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.x1, 1 << 18) <<
> >>> 18;
> >>>>>> +                plane_state->base.src.x2 =
> >>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.x2, 1 << 18) <<
> >>> 18;
> >>>>>> +                plane_state->base.src.y1 =
> >>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.y1, 1 << 18) <<
> >>> 18;
> >>>>>> +                plane_state->base.src.y2 =
> >>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.y2, 1 << 18) <<
> >>> 18;
> >>>>> Since this can now increase the size of the source rectangle our
> >>>>> scaling factor checks are no longer 100% valid. We might end up with
> >>>>> a scaling factor that is too high.
> >>>>> I don't really like any of these "let's make NV12 behave special"
> >>>>> tricks. We should make the code behave the same way for all pixel
> >>>>> formats instead of adding format specific hacks.
> >>>> This is not nivalid because we restrict the original src coordinates
> >>>> to be a multiple of 4, you can only clip to something smaller, not to
> >>>> something bigger. :)
> >>> The clipped coordinates can be whatever thanks to scaling/etc.
> >>> Also why are we trying to make everything a multiple of four? I don't
> >>> remember any hw restrictions like that.
> >>
> >>
> >> Hi
> >>
> >>
> >> As per WA1106, Display corruption/color shift observed when using NV12 with 270 rotation or 90 rotation + horizontal flip.
> >>
> >> WA: NV12 with 270 rotation or 90 rotation + horizontal flip requires the programmed plane height to be a multiple of 4.
> > Does plane height here mean src height or dst height?
> >
> > Either way I don't see why we aren't just checking for the right thing
> > instead of trying to mandate a four pixel alignment everywhere.
> >
> Agreed, what about the below diff, would this be acceptable to you? I deliberately ignore the last 16 bits as that is what we currently do anyway for all formats.
> 
> diff --git a/drivers/gpu/drm/i915/intel_display.c b/drivers/gpu/drm/i915/intel_display.c
> index 4b3735720fee..3ff7b5491446 100644
> --- a/drivers/gpu/drm/i915/intel_display.c
> +++ b/drivers/gpu/drm/i915/intel_display.c
> @@ -3090,6 +3090,31 @@ static int skl_check_main_surface(const struct intel_crtc_state *crtc_state,
>  	return 0;
>  }
>  
> +static int
> +skl_check_nv12_surface(const struct intel_crtc_state *crtc_state,
> +		       struct intel_plane_state *plane_state)
> +{
> +	/* Display WA #1106 */
> +	if (plane_state->base.rotation != (DRM_MODE_REFLECT_X | DRM_MODE_ROTATE_90) &&
> +	    plane_state->base.rotation != DRM_MODE_ROTATE_270)
> +		return 0;

Hmm. I wonder if that's what the spec actually means. The HSDs only
talk about 270 degree rotation. So I guess this interpretation could
be correct.

> +
> +	/* Because x/y are src coordinates will be rotated, we look at x/width here. */
> +	if (((plane_state->base.src_x >> 16) % 4) != 0 ||
> +	    ((plane_state->base.src_w >> 16) % 4) != 0) {
> +		DRM_DEBUG_KMS("src x/w must be multiple of 4 for rotated NV12\n");
> +		return -EINVAL;
> +	}
> +
> +	/* And round y here */
> +	plane_state->base.src.y1 =
> +		DIV_ROUND_CLOSEST(plane_state->base.src.y1, 1 << 18) << 18;
> +	plane_state->base.src.y2 =
> +		DIV_ROUND_CLOSEST(plane_state->base.src.y2, 1 << 18) << 18;

Why not just a simple

if (drm_rect_height(src) >> 16 % 4 != 0)
	return -EINVAL;

?

> +
> +	return 0;
> +}
> +
>  static int skl_check_nv12_aux_surface(struct intel_plane_state *plane_state)
>  {
>  	const struct drm_framebuffer *fb = plane_state->base.fb;
> @@ -3173,6 +3198,9 @@ int skl_check_plane_surface(const struct intel_crtc_state *crtc_state,
>  	 * the main surface setup depends on it.
>  	 */
>  	if (fb->format->format == DRM_FORMAT_NV12) {
> +		ret = skl_check_nv12_surface(crtc_state, plane_state);
> +		if (ret)
> +			return ret;
>  		ret = skl_check_nv12_aux_surface(plane_state);
>  		if (ret)
>  			return ret;
Maarten Lankhorst April 19, 2018, 2:19 p.m. UTC | #2
Op 19-04-18 om 13:50 schreef Ville Syrjälä:
> On Thu, Apr 19, 2018 at 01:30:32PM +0200, Maarten Lankhorst wrote:
>> Op 19-04-18 om 13:22 schreef Ville Syrjälä:
>>> On Thu, Apr 19, 2018 at 02:36:42AM +0000, Srinivas, Vidya wrote:
>>>>
>>>>
>>>>> -----Original Message-----
>>>>> From: Ville Syrjälä [mailto:ville.syrjala@linux.intel.com]
>>>>> Sent: Thursday, April 19, 2018 12:06 AM
>>>>> To: Maarten Lankhorst <maarten.lankhorst@linux.intel.com>
>>>>> Cc: Srinivas, Vidya <vidya.srinivas@intel.com>; intel-
>>>>> gfx@lists.freedesktop.org
>>>>> Subject: Re: [Intel-gfx] [PATCH v4 6/6] drm/i915: Add
>>>>> skl_check_nv12_surface for NV12
>>>>> On Wed, Apr 18, 2018 at 08:06:57PM +0200, Maarten Lankhorst wrote:
>>>>>> Op 18-04-18 om 17:32 schreef Ville Syrjälä:
>>>>>>> On Wed, Apr 18, 2018 at 09:38:13AM +0530, Vidya Srinivas wrote:
>>>>>>>> From: Maarten Lankhorst <maarten.lankhorst@linux.intel.com<mailto:maarten.lankhorst@linux.intel.com>>
>>>>>>>> We skip src trunction/adjustments for
>>>>>>>> NV12 case and handle the sizes directly.
>>>>>>>> Without this, pipe fifo underruns are seen on APL/KBL.
>>>>>>>> v2: For NV12, making the src coordinates multiplier of 4
>>>>>>>> v3: Moving all the src coords handling code for NV12 to
>>>>>>>> skl_check_nv12_surface
>>>>>>>> Signed-off-by: Maarten Lankhorst
>>>>>>>> <maarten.lankhorst@linux.intel.com<mailto:maarten.lankhorst@linux.intel.com>>
>>>>>>>> Signed-off-by: Vidya Srinivas <vidya.srinivas@intel.com<mailto:vidya.srinivas@intel.com>>
>>>>>>>> ---
>>>>>>>>  drivers/gpu/drm/i915/intel_display.c | 39
>>>>>>>> ++++++++++++++++++++++++++++++++++++
>>>>>>>>  drivers/gpu/drm/i915/intel_sprite.c  | 15 ++++++++++----
>>>>>>>>  2 files changed, 50 insertions(+), 4 deletions(-)
>>>>>>>> diff --git a/drivers/gpu/drm/i915/intel_display.c
>>>>>>>> b/drivers/gpu/drm/i915/intel_display.c
>>>>>>>> index 925402e..b8dbaca 100644
>>>>>>>> --- a/drivers/gpu/drm/i915/intel_display.c
>>>>>>>> +++ b/drivers/gpu/drm/i915/intel_display.c
>>>>>>>> @@ -3118,6 +3118,42 @@ static int skl_check_main_surface(const
>>>>> struct intel_crtc_state *crtc_state,
>>>>>>>>  return 0;
>>>>>>>>  }
>>>>>>>> +static int
>>>>>>>> +skl_check_nv12_surface(const struct intel_crtc_state *crtc_state,
>>>>>>>> +                                       struct intel_plane_state *plane_state) {
>>>>>>>> +                int crtc_x2 = plane_state->base.crtc_x + plane_state->base.crtc_w;
>>>>>>>> +                int crtc_y2 = plane_state->base.crtc_y +
>>>>>>>> +plane_state->base.crtc_h;
>>>>>>>> +
>>>>>>>> +                if (((plane_state->base.src_x >> 16) % 4) != 0 ||
>>>>>>>> +                    ((plane_state->base.src_y >> 16) % 4) != 0 ||
>>>>>>>> +                    ((plane_state->base.src_w >> 16) % 4) != 0 ||
>>>>>>>> +                    ((plane_state->base.src_h >> 16) % 4) != 0) {
>>>>>>>> +                                DRM_DEBUG_KMS("src coords must be multiple of 4 for
>>>>> NV12\n");
>>>>>>>> +                                return -EINVAL;
>>>>>>>> +                }
>>>>>>> I don't really see why we should check these. The clipped
>>>>>>> coordinates are what matters.
>>>>>> To propagate our limits to the userspace. I think we should do it for
>>>>>> all formats, but NV12 is the first YUV format we have tests for. If we
>>>>>> could we should do something similar for the other YUV formats, but they
>>>>> have different requirements.
>>>>>> In case of NV12 we don't have existing userspace, there will be
>>>>>> nothing that breaks if we enforce limits from the start.
>>>>> But what about sub-pixel coordinates? You're totally ignoring them here.
>>>>> We need to come up with some proper rules for this stuff.
>>>>>>>> +
>>>>>>>> +                /* Clipping would cause a 1-3 pixel gap at the edge of the screen? */
>>>>>>>> +                if ((crtc_x2 > crtc_state->pipe_src_w && crtc_state->pipe_src_w %
>>>>> 4) ||
>>>>>>>> +                    (crtc_y2 > crtc_state->pipe_src_h && crtc_state->pipe_src_h % 4))
>>>>> {
>>>>>>>> +                                DRM_DEBUG_KMS("It's not possible to clip %u,%u to
>>>>> %u,%u\n",
>>>>>>>> +                                                      crtc_x2, crtc_y2,
>>>>>>>> +                                                      crtc_state->pipe_src_w, crtc_state->pipe_src_h);
>>>>>>>> +                                return -EINVAL;
>>>>>>>> +                }
>>>>>>> Why should we care? The current code already plays it fast and loose
>>>>>>> and allows the dst rectangle to shrink to accomodate the hw limits.
>>>>>>> If we want to change that we should change it universally.
>>>>>> Unfortunately for the other formats we already have an existing
>>>>>> userspace
>>>>>> (X.org) that doesn't perform any validation. We can't change it for
>>>>>> that, but we can prevent future mistakes.
>>>>> We should do it uniformly. Not per-format. That will make the code
>>>>> unmaintainable real quick.
>>>>>>>> +
>>>>>>>> +                plane_state->base.src.x1 =
>>>>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.x1, 1 << 18) <<
>>>>> 18;
>>>>>>>> +                plane_state->base.src.x2 =
>>>>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.x2, 1 << 18) <<
>>>>> 18;
>>>>>>>> +                plane_state->base.src.y1 =
>>>>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.y1, 1 << 18) <<
>>>>> 18;
>>>>>>>> +                plane_state->base.src.y2 =
>>>>>>>> +                                DIV_ROUND_CLOSEST(plane_state->base.src.y2, 1 << 18) <<
>>>>> 18;
>>>>>>> Since this can now increase the size of the source rectangle our
>>>>>>> scaling factor checks are no longer 100% valid. We might end up with
>>>>>>> a scaling factor that is too high.
>>>>>>> I don't really like any of these "let's make NV12 behave special"
>>>>>>> tricks. We should make the code behave the same way for all pixel
>>>>>>> formats instead of adding format specific hacks.
>>>>>> This is not nivalid because we restrict the original src coordinates
>>>>>> to be a multiple of 4, you can only clip to something smaller, not to
>>>>>> something bigger. :)
>>>>> The clipped coordinates can be whatever thanks to scaling/etc.
>>>>> Also why are we trying to make everything a multiple of four? I don't
>>>>> remember any hw restrictions like that.
>>>>
>>>> Hi
>>>>
>>>>
>>>> As per WA1106, Display corruption/color shift observed when using NV12 with 270 rotation or 90 rotation + horizontal flip.
>>>>
>>>> WA: NV12 with 270 rotation or 90 rotation + horizontal flip requires the programmed plane height to be a multiple of 4.
>>> Does plane height here mean src height or dst height?
>>>
>>> Either way I don't see why we aren't just checking for the right thing
>>> instead of trying to mandate a four pixel alignment everywhere.
>>>
>> Agreed, what about the below diff, would this be acceptable to you? I deliberately ignore the last 16 bits as that is what we currently do anyway for all formats.
>>
>> diff --git a/drivers/gpu/drm/i915/intel_display.c b/drivers/gpu/drm/i915/intel_display.c
>> index 4b3735720fee..3ff7b5491446 100644
>> --- a/drivers/gpu/drm/i915/intel_display.c
>> +++ b/drivers/gpu/drm/i915/intel_display.c
>> @@ -3090,6 +3090,31 @@ static int skl_check_main_surface(const struct intel_crtc_state *crtc_state,
>>  	return 0;
>>  }
>>  
>> +static int
>> +skl_check_nv12_surface(const struct intel_crtc_state *crtc_state,
>> +		       struct intel_plane_state *plane_state)
>> +{
>> +	/* Display WA #1106 */
>> +	if (plane_state->base.rotation != (DRM_MODE_REFLECT_X | DRM_MODE_ROTATE_90) &&
>> +	    plane_state->base.rotation != DRM_MODE_ROTATE_270)
>> +		return 0;
> Hmm. I wonder if that's what the spec actually means. The HSDs only
> talk about 270 degree rotation. So I guess this interpretation could
> be correct.
>
>> +
>> +	/* Because x/y are src coordinates will be rotated, we look at x/width here. */
>> +	if (((plane_state->base.src_x >> 16) % 4) != 0 ||
>> +	    ((plane_state->base.src_w >> 16) % 4) != 0) {
>> +		DRM_DEBUG_KMS("src x/w must be multiple of 4 for rotated NV12\n");
>> +		return -EINVAL;
>> +	}
>> +
>> +	/* And round y here */
>> +	plane_state->base.src.y1 =
>> +		DIV_ROUND_CLOSEST(plane_state->base.src.y1, 1 << 18) << 18;
>> +	plane_state->base.src.y2 =
>> +		DIV_ROUND_CLOSEST(plane_state->base.src.y2, 1 << 18) << 18;
> Why not just a simple
>
> if (drm_rect_height(src) >> 16 % 4 != 0)
> 	return -EINVAL;
>
So lets do that, but we have to fix the rounding errors in i915 to make it useful:

upscaling 16x16 to 2560x1440:
(kms_plane_scaling:1172) igt_kms-DEBUG: plane A.1: Setting property "SRC_W" to 0x100000/1048576
(kms_plane_scaling:1172) igt_kms-DEBUG: plane A.1: Setting property "SRC_H" to 0x100000/1048576
(kms_plane_scaling:1172) igt_kms-DEBUG: plane A.1: Setting property "CRTC_X" to 0x0/0
(kms_plane_scaling:1172) igt_kms-DEBUG: plane A.1: Setting property "CRTC_Y" to 0x0/0
(kms_plane_scaling:1172) igt_kms-DEBUG: plane A.1: Setting property "CRTC_W" to 0xa00/2560
(kms_plane_scaling:1172) igt_kms-DEBUG: plane A.1: Setting property "CRTC_H" to 0x5a0/1440

[  290.766555] [drm:drm_ioctl [drm]] pid=1172, dev=0xe200, auth=1, DRM_IOCTL_MODE_ATOMIC
...
[  290.767276] [drm:skl_update_scaler [i915]] NV12: src dimensions not met: 15 < 16
[  290.767284] [drm:drm_atomic_helper_check_planes [drm_kms_helper]] [PLANE:33:plane 2A] atomic driver check failed
[  290.767422] [drm:drm_ioctl [drm]] pid=1172, ret = -22

If we want to go with that patch, I'll commit it but let the tests fail since rounding is not a NV12 specific error.

~Maarten
diff mbox

Patch

diff --git a/drivers/gpu/drm/i915/intel_display.c b/drivers/gpu/drm/i915/intel_display.c
index 4b3735720fee..3ff7b5491446 100644
--- a/drivers/gpu/drm/i915/intel_display.c
+++ b/drivers/gpu/drm/i915/intel_display.c
@@ -3090,6 +3090,31 @@  static int skl_check_main_surface(const struct intel_crtc_state *crtc_state,
 	return 0;
 }
 
+static int
+skl_check_nv12_surface(const struct intel_crtc_state *crtc_state,
+		       struct intel_plane_state *plane_state)
+{
+	/* Display WA #1106 */
+	if (plane_state->base.rotation != (DRM_MODE_REFLECT_X | DRM_MODE_ROTATE_90) &&
+	    plane_state->base.rotation != DRM_MODE_ROTATE_270)
+		return 0;
+
+	/* Because x/y are src coordinates will be rotated, we look at x/width here. */
+	if (((plane_state->base.src_x >> 16) % 4) != 0 ||
+	    ((plane_state->base.src_w >> 16) % 4) != 0) {
+		DRM_DEBUG_KMS("src x/w must be multiple of 4 for rotated NV12\n");
+		return -EINVAL;
+	}
+
+	/* And round y here */
+	plane_state->base.src.y1 =
+		DIV_ROUND_CLOSEST(plane_state->base.src.y1, 1 << 18) << 18;
+	plane_state->base.src.y2 =
+		DIV_ROUND_CLOSEST(plane_state->base.src.y2, 1 << 18) << 18;
+
+	return 0;
+}
+
 static int skl_check_nv12_aux_surface(struct intel_plane_state *plane_state)
 {
 	const struct drm_framebuffer *fb = plane_state->base.fb;
@@ -3173,6 +3198,9 @@  int skl_check_plane_surface(const struct intel_crtc_state *crtc_state,
 	 * the main surface setup depends on it.
 	 */
 	if (fb->format->format == DRM_FORMAT_NV12) {
+		ret = skl_check_nv12_surface(crtc_state, plane_state);
+		if (ret)
+			return ret;
 		ret = skl_check_nv12_aux_surface(plane_state);
 		if (ret)
 			return ret;