diff mbox series

archive: add test testing MIME for created archive through compression filter

Message ID 20211011112712.475306-1-bagasdotme@gmail.com (mailing list archive)
State New, archived
Headers show
Series archive: add test testing MIME for created archive through compression filter | expand

Commit Message

Bagas Sanjaya Oct. 11, 2021, 11:27 a.m. UTC
The common use of `tar.<format>.command` config is to specify
compression program filter for creating compressed tar archive.

Add a test that tests MIME type of archives created through the filter.
The generated archives must not be `application/x-tar` (POSIX tar
archive) type, so that these can be decompressed with the corresponding
decompression program (such as gunzip, bunzip2, and unxz).

Signed-off-by: Bagas Sanjaya <bagasdotme@gmail.com>
---
 t/t5000-tar-tree.sh | 17 +++++++++++++++++
 1 file changed, 17 insertions(+)


base-commit: 106298f7f9cca4158a980de149ef217751e1f943

Comments

Junio C Hamano Oct. 11, 2021, 3:48 p.m. UTC | #1
Bagas Sanjaya <bagasdotme@gmail.com> writes:

> +	cat >expect <<EOF &&
> +HEAD.tar.bz2: application/x-bzip2; charset=binary
> +HEAD.tar.gz:  application/gzip; charset=binary
> +HEAD.tar.xz:  application/x-xz; charset=binary
> +EOF

Use <<-EOF and you can align here-document to the script.

> +	file -i HEAD.tar.bz2 HEAD.tar.gz HEAD.tar.xz 2>/dev/null >actual &&

This looks super un-portable.

  https://pubs.opengroup.org/onlinepubs/9699919799/utilities/file.html
Junio C Hamano Oct. 11, 2021, 3:53 p.m. UTC | #2
Bagas Sanjaya <bagasdotme@gmail.com> writes:

> +test_expect_success GZIP,BZIP2,XZ 'git archive with gzip, bzip2, and xz filters creates compressed tar archive with proper MIME type' '
> +	git config tar.tar.bz2.command "bzip2 -c" &&
> +	git config tar.tar.xz.command "xz -c" &&
> +	git archive --output HEAD.tar.gz --prefix=src/ HEAD &&

I think a lot more portable and robust way to test the feature is to
configure git config tar.tar.test.command (or come up with a name for
a suffix to be used in the test), point it at a script created in this
test script and run git archive with output filename that would trigger
the command.  Then, arrange the test to notice if the "test" script was
called with expected command line arguments and standard input.

That way, you do not need to rely on prereqs and you do not have to
resort to un-portable use of the "file" command.  After all, you are
*not* testing if "bzip2 -c" the user happens to have on their $PATH
produces output their "find" recognises as bzip2 compressed.
Jeff King Oct. 11, 2021, 4:07 p.m. UTC | #3
On Mon, Oct 11, 2021 at 08:53:11AM -0700, Junio C Hamano wrote:

> Bagas Sanjaya <bagasdotme@gmail.com> writes:
> 
> > +test_expect_success GZIP,BZIP2,XZ 'git archive with gzip, bzip2, and xz filters creates compressed tar archive with proper MIME type' '
> > +	git config tar.tar.bz2.command "bzip2 -c" &&
> > +	git config tar.tar.xz.command "xz -c" &&
> > +	git archive --output HEAD.tar.gz --prefix=src/ HEAD &&
> 
> I think a lot more portable and robust way to test the feature is to
> configure git config tar.tar.test.command (or come up with a name for
> a suffix to be used in the test), point it at a script created in this
> test script and run git archive with output filename that would trigger
> the command.  Then, arrange the test to notice if the "test" script was
> called with expected command line arguments and standard input.
> 
> That way, you do not need to rely on prereqs and you do not have to
> resort to un-portable use of the "file" command.  After all, you are
> *not* testing if "bzip2 -c" the user happens to have on their $PATH
> produces output their "find" recognises as bzip2 compressed.

That would work. Or perhaps using something portable like "tr" to
do a nonsense conversion, and verifying that we can get convert back to
the original. Like say, the tests added to t5000 by 767cf4579f (archive:
implement configurable tar filters, 2011-06-21).

I don't think the newly proposed test is adding anything beyond that
(except checking the system "bzip2" command, but as you say, that is not
useful to us, nor portable).

-Peff
Bagas Sanjaya Oct. 12, 2021, 3:14 a.m. UTC | #4
On 11/10/21 22.53, Junio C Hamano wrote:
> I think a lot more portable and robust way to test the feature is to
> configure git config tar.tar.test.command (or come up with a name for
> a suffix to be used in the test), point it at a script created in this
> test script and run git archive with output filename that would trigger
> the command.  Then, arrange the test to notice if the "test" script was
> called with expected command line arguments and standard input.
> 
> That way, you do not need to rely on prereqs and you do not have to
> resort to un-portable use of the "file" command.  After all, you are
> *not* testing if "bzip2 -c" the user happens to have on their $PATH
> produces output their "find" recognises as bzip2 compressed.
> 

The intent of this test is to ensure `git archive -o 
something.tar.<format>` produces proper compressed tar archive that can 
be decompressed with the corresponding decompression tool (such as 
gunzip for gz files), and not just with `tar xvf`.
Junio C Hamano Oct. 12, 2021, 7:15 p.m. UTC | #5
Bagas Sanjaya <bagasdotme@gmail.com> writes:

> On 11/10/21 22.53, Junio C Hamano wrote:
>> I think a lot more portable and robust way to test the feature is to
>> configure git config tar.tar.test.command (or come up with a name for
>> a suffix to be used in the test), point it at a script created in this
>> test script and run git archive with output filename that would trigger
>> the command.  Then, arrange the test to notice if the "test" script was
>> called with expected command line arguments and standard input.
>> That way, you do not need to rely on prereqs and you do not have to
>> resort to un-portable use of the "file" command.  After all, you are
>> *not* testing if "bzip2 -c" the user happens to have on their $PATH
>> produces output their "find" recognises as bzip2 compressed.
>> 
>
> The intent of this test is to ensure `git archive -o
> something.tar.<format>` produces proper compressed tar archive that
> can be decompressed with the corresponding decompression tool (such as 
> gunzip for gz files), and not just with `tar xvf`.

Yes, that is why relying on "file -i" is not the right thing to do.

We'd want to use the matching decompressor, or better yet, trust
that the compressor-decompressor pair the system offers do work
correctly (after all, it's not our test suite's job to debug the
system software the tester uses) and validate that

 (1) the specified compressor is spawned with the right parameters
     taken from the configuration file, and that

 (2) the compressor is fed the correct input stream.

Using a script prepared by the test itself as the "compressor" would
be one good way to validate (1); it can record how it was invoked,
so that the test can check its correctness.  Using a reversible
transformation in the script as a fake compressor and comparing the
reversed output with the output without any compression filter, may
be one way to validate (2), like Peff mentioned.

But asking "file -i" validates neither.

The code that reads the configuration and spawns the compressor with
specified parameters may be broken and may not pass the right
parameters, or the archive stream may not be passed to the
compressor intact.  But the compressor may still produce a header
block that is looking correct enough for the "file" command to say
"ah, that uses X compressor", and the tests in the patch under
discussion does not even see if the payload was passed correctly
without corruption to the compressor at all.
diff mbox series

Patch

diff --git a/t/t5000-tar-tree.sh b/t/t5000-tar-tree.sh
index 2c88d1c159..04cdad5bff 100755
--- a/t/t5000-tar-tree.sh
+++ b/t/t5000-tar-tree.sh
@@ -38,6 +38,8 @@  test_lazy_prereq TAR_NEEDS_PAX_FALLBACK '
 '
 
 test_lazy_prereq GZIP 'gzip --version'
+test_lazy_prereq BZIP2 'bzip2 --version'
+test_lazy_prereq XZ 'xz --version'
 
 get_pax_header() {
 	file=$1
@@ -374,6 +376,21 @@  test_expect_success GZIP 'remote tar.gz can be disabled' '
 		>remote.tar.gz
 '
 
+test_expect_success GZIP,BZIP2,XZ 'git archive with gzip, bzip2, and xz filters creates compressed tar archive with proper MIME type' '
+	git config tar.tar.bz2.command "bzip2 -c" &&
+	git config tar.tar.xz.command "xz -c" &&
+	git archive --output HEAD.tar.gz --prefix=src/ HEAD &&
+	git archive --output HEAD.tar.bz2 --prefix=src/ HEAD &&
+	git archive --output HEAD.tar.xz --prefix=src/ HEAD &&
+	cat >expect <<EOF &&
+HEAD.tar.bz2: application/x-bzip2; charset=binary
+HEAD.tar.gz:  application/gzip; charset=binary
+HEAD.tar.xz:  application/x-xz; charset=binary
+EOF
+	file -i HEAD.tar.bz2 HEAD.tar.gz HEAD.tar.xz 2>/dev/null >actual &&
+	test_cmp expect actual
+'
+
 test_expect_success 'archive and :(glob)' '
 	git archive -v HEAD -- ":(glob)**/sh" >/dev/null 2>actual &&
 	cat >expect <<EOF &&