From patchwork Fri May 13 20:29:02 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Nicolas Dufresne X-Patchwork-Id: 12849392 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id ECD96C433F5 for ; Fri, 13 May 2022 20:30:01 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1384351AbiEMU3t (ORCPT ); Fri, 13 May 2022 16:29:49 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34100 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1384305AbiEMU3r (ORCPT ); Fri, 13 May 2022 16:29:47 -0400 Received: from bhuna.collabora.co.uk (bhuna.collabora.co.uk [46.235.227.227]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7EA3F7981F; Fri, 13 May 2022 13:29:32 -0700 (PDT) Received: from [127.0.0.1] (localhost [127.0.0.1]) (Authenticated sender: nicolas) with ESMTPSA id 35F6D1F46480 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=collabora.com; s=mail; t=1652473769; bh=OWznsKmKtT3QluhG/yWwf11S8JLYLw82u9Ar9baljbk=; h=From:To:Cc:Subject:Date:From; b=WHr9KsupJFwUyDF9H+3pyha3nsZq6nOLxVuKLkyFwdSsDy0+SADYwunh0eO2I3LYq hOZMN2mrLxS9gdqE3rKvQZOlYmJAK5VoKr+bZfcZ/UL9viSj/bKjBYEjTW5RynM2Py K/8VINmoRBlMT/zsnMrGQm7U5tz7foCE3N4o7lzKJgLXKn3QyFOfsI1njTmL6SNAJG 8QOJnXLjcm0PEHOtKPkpk9v3uzFiIbiasKjmXxMvL/Gxju2OX2BVOLvNJFiAYk3wkX 826T5KxtTGGnZQ9fP5Ro+0PYrjaEj2yWXpbm9rS1HPoCtPqoqRdrm7EFXsBJ3vdjxA K2h4gfG5EVBhg== From: Nicolas Dufresne Cc: nicolas@ndufresne.ca, linux-media@vger.kernel.org, linux-kernel@vger.kernel.org Subject: [PATCH v5 00/20] H.264 Field Decoding Support for Frame-based Decoders Date: Fri, 13 May 2022 16:29:02 -0400 Message-Id: <20220513202922.13846-1-nicolas.dufresne@collabora.com> X-Mailer: git-send-email 2.34.3 MIME-Version: 1.0 To: unlisted-recipients:; (no To-header on input) Precedence: bulk List-ID: X-Mailing-List: linux-media@vger.kernel.org NOTE: This is a rebased series to solve merge conflicts, some patches have been skipped as they are already in media_stage tree Until now, only Cedrus (a slice base decoder) supported interlaced decoding. In order to support field decoding in our frame-based decoder, the v4l2-h264 library needed adaptation to produce the appropriate reference lists. This patch extends the v4l2-h264 library to produce the larger references list needed to represent fields separately. Hantro, MTK-VCODEC and RKVDEC drivers have been adapted to accommodate the larger lists. Though, only Hantro and RKVDEC actually have HW support for field decoding. So only these two have been updated to make use of the larger lists. All this work has been done using the H.264 specification, LibreELEC downstream kernel patches, Rockchip MPP reference software and Hantro reference software. For reviewers, the following is the map of all commit. Patches that could be merge independently of this serie are marked as independent. Note that the test results do depend on the generic fixes. 01-07 : v4l2-h264 field decoding support 08-14 : rkvdec h.264 generic fixes (independent) 15-16 : rkvdec h.264 field decoding support 17-20 : hantro h.264 field decoding support All this work have been tested using GStreamer mainline implementation but also with FFMPEG LibreELEC fork using the testing tool fluster running through the ITU-T H.264 (2016-02) AVCv2 set of bitstream. Before this patch, the scores were: Hantro: FFMPEG: 88/135 GSteamer: 90/135 RKVDEC: FFMPEG: 73/135 GSteamer: 77/135 And after these changes: Hantro: FFMPEG: 118/135 GSteamer: 129/135 RKVDEC: FFMPEG: 118/135 GSteamer: 129/135 Note that a bug in FFMPEG / LibreELEC fork was noticed and fixed with the following change: Some useful links: Detailed Hantro Results: https://gitlab.freedesktop.org/-/snippets/5189 Detailed RKVDEC Results: https://gitlab.freedesktop.org/-/snippets/5253 ITU-T H.264 (2016-02) AVCv2: https://www.itu.int/net/itu-t/sigdb/spevideo/VideoForm-s.aspx?val=102002641 Fluster: https://github.com/fluendo/fluster GStreamer: https://gitlab.freedesktop.org/gstreamer/gstreamer/ FFMPEG Fork: https://github.com/jernejsk/FFmpeg/tree/v4l2-request-hwaccel-4.4 Rockchip MPP: https://github.com/rockchip-linux/mpp Changes in v5: - Rebased on media_stage tree - Fixed merge conflict due to some Meditatek vcodec driver refactoring - Fixes made by Hans in his pull request v2 (build fix) - Fixed xmas coding style in Tegra driver (from a late review comment) Changes in v4: - Fixed mtk-vcodec port - Ported tegra-vde driver (compiled tested only) - Applied Hans' commit message, comment and documentation suggestions Changes in v3: - Improved debug message on timestamp miss-match - Moved H264 SPS validation into rkvdec-h264 - Added more comments around H264 SPS validation - Also validate at streamon (rkvdec start()) - Applied more Review-by and Fixes tag - Fixed Signed-off-by chain in Jonas patch Changes in v2: - Applied most of Sebastian's suggestion in comments and commit messages. - Use a bool for dpb_valid and dpb_bottom in rkvdec - Dropped one wrong typo fix (media: v4l2-mem2mem: Fix typo in trace message) - Dropped Alex fix (media: rkvdec-h264: Don't hardcode SPS/PPS parameters + I will carry this one later, it seems cosmetic Jonas Karlman (5): media: rkvdec: h264: Fix bit depth wrap in pps packet media: rkvdec: h264: Validate and use pic width and height in mbs media: rkvdec: h264: Fix reference frame_num wrap for second field media: rkvdec: Ensure decoded resolution fit coded resolution media: hantro: h264: Make dpb entry management more robust Nicolas Dufresne (15): media: h264: Use v4l2_h264_reference for reflist media: h264: Increase reference lists size to 32 media: h264: Store current picture fields media: h264: Store all fields into the unordered list media: v4l2: Trace calculated p/b0/b1 initial reflist media: h264: Sort p/b reflist using frame_num media: v4l2: Reorder field reflist media: rkvdec: Stop overclocking the decoder media: rkvdec: h264: Fix dpb_valid implementation media: rkvdec: Move H264 SPS validation in rkvdec-h264 media: rkvdec-h264: Add field decoding support media: rkvdec: Enable capture buffer holding for H264 media: hantro: Stop using H.264 parameter pic_num media: hantro: Add H.264 field decoding support media: hantro: Enable HOLD_CAPTURE_BUF for H.264 .../vcodec/vdec/vdec_h264_req_common.c | 21 +- .../vcodec/vdec/vdec_h264_req_common.h | 11 +- .../mediatek/vcodec/vdec/vdec_h264_req_if.c | 15 +- .../vcodec/vdec/vdec_h264_req_multi_if.c | 27 +- .../media/platform/nvidia/tegra-vde/h264.c | 19 +- drivers/media/v4l2-core/v4l2-h264.c | 271 +++++++++++++++--- .../staging/media/hantro/hantro_g1_h264_dec.c | 38 +-- drivers/staging/media/hantro/hantro_h264.c | 134 +++++++-- drivers/staging/media/hantro/hantro_hw.h | 8 +- drivers/staging/media/hantro/hantro_v4l2.c | 25 ++ .../media/hantro/rockchip_vpu2_hw_h264_dec.c | 98 +++---- drivers/staging/media/rkvdec/rkvdec-h264.c | 157 +++++++--- drivers/staging/media/rkvdec/rkvdec.c | 35 +-- drivers/staging/media/rkvdec/rkvdec.h | 2 + include/media/v4l2-h264.h | 31 +- 15 files changed, 646 insertions(+), 246 deletions(-) diff --git a/libavcodec/v4l2_request_h264.c b/libavcodec/v4l2_request_h264.c index 88da8f0a2d..394bae0550 100644 --- a/libavcodec/v4l2_request_h264.c +++ b/libavcodec/v4l2_request_h264.c @@ -66,7 +66,7 @@ static void fill_dpb_entry(struct v4l2_h264_dpb_entry *entry, const H264Picture { entry->reference_ts = ff_v4l2_request_get_capture_timestamp(pic->f); entry->pic_num = pic->pic_id; - entry->frame_num = pic->frame_num; + entry->frame_num = pic->long_ref ? pic->pic_id : pic->frame_num; entry->fields = pic->reference & V4L2_H264_FRAME_REF; entry->flags = V4L2_H264_DPB_ENTRY_FLAG_VALID; if (entry->fields)