ffmpeg

mirror of https://git.ffmpeg.org/ffmpeg.git synced 2026-01-06 14:15:29 +01:00

Author	SHA1	Message	Date
Niklas Haas	ef856ef93e	avfilter/vf_libplacebo: also output a frame during input gaps In constant FPS mode, it's possible for there no be no input active at the current timestamp, even though there would be active inputs later in the timeline (nb_active > 0). This would previously hit the AVERROR_BUG case. With this change, we can instead output an empty frame.	2025-09-02 09:55:31 +00:00
Niklas Haas	68c083c43f	avfilter/vf_libplacebo: refactor status handling Instead of having a single s->status field to track the most recently EOF'd stream, track the number of active streams directly and only query the most recent status at the end. The reason this worked before is because we implicitly relied on the assumption that `ok` would always be true until all streams are EOF, but because it's possible to have gaps in the timeline when mixing multiple streams, this is not always the case in principle. In practice, this fixes a bug where the filter would early-exit (EOF) too soon, when any input reached EOF and there is a gap in the timeline.	2025-09-02 09:55:31 +00:00
Niklas Haas	fc5a6a3b39	avfilter/vf_libplacebo: allow rendering empty frames When using libplacebo to composite multiple streams with complex PTS values, it's possible for there to be a "hole" in the output stream; in particular in constant FPS mode. This commit refactors output_frame() to allow it to handle the case of there being no active input.	2025-09-02 09:55:31 +00:00
James Almer	1069d457c6	configure: report if unstable is enabled Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 22:37:17 -03:00
James Almer	eed5843392	configure: fix logic for the unstable option A --enable-* option should not have a condition where it's enabled by some other mean, as is the case here if swscale is enabled. Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 22:31:07 -03:00
Niklas Haas	4ec2bffe62	configure: allow disabling experimental swscale code In theory we can also expand this to disable e.g. experimental codecs.	2025-09-01 19:28:36 +02:00
Niklas Haas	cc42bc1f4b	swscale/graph: allow experimental use of new format handler The humor originally contained in this commit message has been redacted to comply with the strict FFmpeg code quality standards.	2025-09-01 19:28:36 +02:00
Niklas Haas	f8f7935a97	swscale/format: add new format decode/encode logic This patch adds format handling code for the new operations. This entails fully decoding a format to standardized RGB, and the inverse. Handling it this way means we can always guarantee that a conversion path exists from A to B without having to explicitly cover logic for each path; and choosing RGB instead of YUV as the intermediate (as was done in swscale v1) is more flexible with regards to enabling further operations such as primaries conversions, linear scaling, etc. In the case of YUV->YUV transform, the redundant matrix multiplication will be canceled out anyways.	2025-09-01 19:28:36 +02:00
Niklas Haas	5e6ffa0376	tests/checkasm: add checkasm tests for swscale ops Because of the lack of an external ABI on low-level kernels, we cannot directly test internal functions. Instead, we construct a minimal op chain consisting of a read, the op to be tested, and a write. The bigger complication arises from the fact that the backend may generate arbitrary internal state that needs to be passed back to the implementation, which means we cannot directly call `func_ref` on the generated chain. To get around this, always compile the op chain twice - once using the backend to be tested, and once using the reference C backend. The actual entry point may also just be a shared wrapper, so we need to be very careful to run checkasm_check_func() on a pseudo-pointer that will actually be unique for each combination of backend and active CPU flags.	2025-09-01 19:28:36 +02:00
Niklas Haas	982d3a98d0	swscale/x86: add SIMD backend This covers most 8-bit and 16-bit ops, and some 32-bit ops. It also covers all floating point operations. While this is not yet 100% coverage, it's good enough for the vast majority of formats out there. Of special note is the packed shuffle fast path, which uses pshufb at vector sizes up to AVX512.	2025-09-01 19:28:36 +02:00
Niklas Haas	a151b426f9	swscale/ops_memcpy: add 'memcpy' backend for plane->plane copies Provides a generic fast path for any operation list that can be decomposed into a series of memcpy and memset operations. 25% faster than the x86 backend for yuv444p -> yuva444p 33% faster than the x86 backend for gray -> yuvj444p	2025-09-01 19:28:36 +02:00
Niklas Haas	5aef513fb4	swscale/ops_backend: add reference backend basend on C templates This will serve as a reference for the SIMD backends to come. That said, with auto-vectorization enabled, the performance of this is not atrocious. It easily beats the old C code and sometimes even the old SIMD. In theory, we can dramatically speed it up by using GCC vectors instead of arrays, but the performance gains from this are too dependent on exact GCC versions and flags, so it practice it's not a substitute for a SIMD implementation.	2025-09-01 19:28:36 +02:00
Niklas Haas	99d73064f5	swscale/ops_chain: add internal abstraction for kernel linking See doc/swscale-v2.txt for design details.	2025-09-01 19:28:36 +02:00
Niklas Haas	696f3be322	swscale/optimizer: add packed shuffle solver This can turn any compatible sequence of operations into a single packed shuffle, including packed swizzling, grayscale->RGB conversion, endianness swapping, RGB bit depth conversions, rgb24->rgb0 alpha clearing and more.	2025-09-01 19:28:36 +02:00
Niklas Haas	db2bc11a97	swscale/ops: add dispatch layer This handles the low-level execution of an op list, and integration into the SwsGraph infrastructure. To handle frames with insufficient padding in the stride (or a width smaller than one block size), we use a fallback loop that pads the last column of pixels using `memcpy` into an appropriately sized buffer.	2025-09-01 19:28:36 +02:00
Niklas Haas	d3ca0e300d	swscale/ops_internal: add internal ops backend API This adds an internal API for ops backends, which are responsible for compiling op lists into executable functions.	2025-09-01 19:28:36 +02:00
Niklas Haas	ea9ca3ff35	swscale/optimizer: add high-level ops optimizer This is responsible for taking a "naive" ops list and optimizing it as much as possible. Also includes a small analyzer that generates component metadata for use by the optimizer.	2025-09-01 19:28:36 +02:00
Niklas Haas	16e191c8ef	swscale/ops: introduce new low level framework See docs/swscale-v2.txt for an in-depth introduction to the new approach. This commit merely introduces the ops definitions and boilerplate functions. The subsequent commits will flesh out the underlying implementation.	2025-09-01 19:28:36 +02:00
Niklas Haas	ce0938da8c	swscale: add SWS_UNSTABLE flag Give users and developers a way to opt in to the new format conversion code, and more code from the swscale rewrite in general, even while development is still ongoing.	2025-09-01 19:28:35 +02:00
Niklas Haas	8406c56b0c	tests/checkasm: generalize DEF_CHECKASM_CHECK_FUNC to floats We split the standard macro into its body (implementation) and declaration, and use a macro argument in place of the raw `memcmp` call, with the major difference that we now take the number of pixels to compare instead of the number of bytes (to match the signature of float_near_ulp_array).	2025-09-01 19:27:53 +02:00
Niklas Haas	faf62cbdf5	tests/checkasm: increase number of runs in between measurements Sometimes, when measuring very small functions, rdtsc is not accurate enough to get a reliable measurement. This increases the number of runs inside the inner loop from 4 to 32, which should help a lot. Less important when using the more precise linux-perf API, but still useful. There should be no user-visible change since the number of runs is adjusted to keep the total time spent measuring the same.	2025-09-01 19:27:53 +02:00
Niklas Haas	5a28f392b9	tests/swscale: avoid binary literals These are C23-only.	2025-09-01 19:27:53 +02:00
Niklas Haas	322f606fa8	swscale/format: add ff_fmt_clear() Reset an SwsFormat to its fully unset/invalid state.	2025-09-01 19:27:53 +02:00
Niklas Haas	041a318299	swscale/format: rename legacy format conversion table	2025-09-01 19:27:53 +02:00
Niklas Haas	c18676aae4	swscale/graph: pass per-pass image pointers to setup() This behavior had no real justification and was just incredibly confusing, since the in/out pointers passet to setup() did not match those passed to run(), all for what is arguably an exception anyways (the palette setup).	2025-09-01 19:27:53 +02:00
Zhao Zhili	eb14d45824	avfilter/vf_colordetect: add aarch64 asm \| rpi5 gcc 12 \| m1 clang -fno-vectorize \| m1 clang --------------------------------------------------------------------------- alpha_8_full_c: \| 32159.2 ( 1.00x) \| 135.8 ( 1.00x) \| 26.4 ( 1.00x) alpha_8_full_neon: \| 1266.0 (25.40x) \| 8.0 (17.03x) \| 8.4 ( 3.15x) alpha_8_limited_c: \| 37561.9 ( 1.00x) \| 169.1 ( 1.00x) \| 47.7 ( 1.00x) alpha_8_limited_neon: \| 3967.0 ( 9.47x) \| 12.5 (13.53x) \| 13.3 ( 3.59x) alpha_16_full_c: \| 15867.9 ( 1.00x) \| 64.5 ( 1.00x) \| 13.7 ( 1.00x) alpha_16_full_neon: \| 1256.9 (12.62x) \| 7.9 ( 8.15x) \| 8.3 ( 1.64x) alpha_16_limited_c: \| 16723.7 ( 1.00x) \| 88.7 ( 1.00x) \| 103.3 ( 1.00x) alpha_16_limited_neon: \| 4031.3 ( 4.15x) \| 12.5 ( 7.08x) \| 13.2 ( 7.86x) range_8_c: \| 21819.7 ( 1.00x) \| 120.0 ( 1.00x) \| 9.4 ( 1.00x) range_8_neon: \| 1148.3 (19.00x) \| 4.3 (27.60x) \| 4.8 ( 1.97x) range_16_c: \| 10757.1 ( 1.00x) \| 45.7 ( 1.00x) \| 7.9 ( 1.00x) range_16_neon: \| 1141.5 ( 9.42x) \| 4.4 (10.38x) \| 4.6 ( 1.72x)	2025-09-01 15:35:16 +00:00
Zhao Zhili	6450e01446	checkasm/vf_colordetect: test non-aligned width	2025-09-01 15:35:16 +00:00
Niklas Haas	f07c12d806	avfilter/x86/vf_colordetect: fix alpha detect tail handling This wrapping logic still considered any nonzero return from the ASM function to be the overall result, but this is not true since the addition of FF_ALPHA_TRANSPARENT. Fix it by only early returning if FF_ALPHA_STRAIGHT is detected. Fixes: `9b8b78a815` See-Also: https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20301#issuecomment-4802	2025-09-01 15:33:43 +00:00
James Almer	9dc79241d9	avcodec/decode: always extract display matrix from Exif in frames Should help ensure no conflicting values are propagated. Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:08:04 -03:00
James Almer	6a15183e6a	avcodec/decode: parse Exif packet side data before passing it to frames Extract Orientation and export it as a display matrix if present, and set the frame's metadata with the remaining Exif entries. Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:59 -03:00
James Almer	fa1938c574	avcodec/decode: use ff_frame_new_side_data() to export Exif side data Otherwise, the user requested priority of packet side data will be ignored. Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:41 -03:00
James Almer	75bd5056a4	avcodec/decode: use av_exif_get_tag_id() where useful Removes dependency on exif_internal.h Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:41 -03:00
James Almer	310be1c5a4	avformat/mov: export Exif metadata from HEIF streams Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:35 -03:00
James Almer	74e430202d	avformat/mov: make items referencing items generic Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:14 -03:00
James Almer	6a115e2066	avformat/mov: reduce code duplication when setting tile group properties Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:14 -03:00
James Almer	11d9d7b8f3	avformat/dump: print side data type names generically Based on vf_showinfo behavior. Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:14 -03:00
James Almer	e62f9a071b	ffprobe: print EXIF packet side data size Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:14 -03:00
James Almer	4ffd621523	avcodec/packet: add an Exif side data type Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 12:07:07 -03:00
James Almer	45dcfb2d60	avcodec/version_major: remove unnecessary define for an internal function It being avpriv and not public, it can be removed in the next bump regardless of when it happens. Signed-off-by: James Almer <jamrial@gmail.com>	2025-09-01 11:00:44 -03:00
Henrik Gramner	a66a260ae9	vp9: Add 8bpc intra prediction AVX2 asm	2025-09-01 13:54:52 +00:00
Zhao Zhili	9a34ddc345	avformat/movenc: remove mvex from final mp4 in hybrid_fragmented mode In hybrid_fragmented mode, the first moov at start should contain mvex tag, since it's fmp4. When writing the moov at the second time, it's not fmp4 any more, so mvex should be skipped.	2025-09-01 12:41:38 +00:00
Cameron Gutman	126671e730	avcodec/mfenc: add AVLowLatencyMode support Set CODECAPI_AVLowLatencyMode when AV_CODEC_FLAG_LOW_DELAY is enabled on the AVCodecContext. AVLowLatencyMode can acheive lower latency encoding that may not be accessible using eAVScenarioInfo options alone. Signed-off-by: Cameron Gutman <aicommander@gmail.com>	2025-09-01 12:40:36 +00:00
Zhao Zhili	f2414fb1aa	.forgejo/CODEOWNERS: add myself for various files Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2025-09-01 11:14:30 +00:00
Martin Storsjö	3ea6c2fe25	aacencdsp: Improve consistency with assembly, for x87 math Currently, the aacencdsp checkasm tests fails for many seeds, if the C code has been built with x87 math. This happens because the excess precision of x87 math can make it end up rounding to a different integer, and the checkasm tests checks that the output integers match exactly between C and assembly. One such failing case is "tests/checkasm/checkasm --test=aacencdsp 41" when compiled with GCC. When compiled with Clang, the test seed 21 produces a failure. To avoid the issue, we need to limit the precision of intermediates to their nominal float range, matching the assembly implementations. This can be achieved when compiling with GCC, by just adding a single cast. To observe the effect of this cast, compile the following snippet, int cast(float a, float b) { return (int) #ifdef CAST (float) #endif (a + b); } with "gcc -m32 -std=c17 -O2", with/without -DCAST. For x86_64 cases (without the "-m32"), the cast doesn't make any difference on the generated code. This cast would seem to not have any effect, as a binary expression with float inputs also would have the type float. However, if compiling with GCC with -fexcess-precision=standard, the cast forces limiting the precision according to the language standard here - according to the GCC docs [1]: > When compiling C or C++, if -fexcess-precision=standard is > specified then excess precision follows the rules specified in > ISO C99 or C++; in particular, both casts and assignments cause > values to be rounded to their semantic types (whereas -ffloat-store > only affects assignments). This option is enabled by default for > C or C++ if a strict conformance option such as -std=c99 or > -std=c++17 is used. Ffmpeg's configure scripts enables -std=c17 by default. This only helps with GCC though - the cast doesn't make any difference for Clang. (Although, upstream Clang seems to default to SSE math, while Ubuntu provided Clang defaults to x87 math.) Limiting the precision with Clang would require casting to volatile float for both intermediates here - and that does have a code generation effect on all architectures. [1] https://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html	2025-09-01 09:22:39 +00:00
Timo Rothenpieler	883162d5b4	forgejo/workflows: make autolabel workflow always use token from secrets Using the built-in token seems to only partially work. And in the latest Forgejo release the issue forcing the use of the built-in token was fixed with: https://codeberg.org/forgejo/forgejo/pulls/9025	2025-09-01 03:02:51 +02:00
Cameron Gutman	b2910ec92e	avcodec/mfenc: fix memory leak with D3D11 input surfaces Fixes: `d56522c6eb` ("avcodec/mfenc: add support for D3D11 input surfaces") Signed-off-by: Cameron Gutman <aicommander@gmail.com>	2025-08-31 14:36:20 -05:00
YiboFang	ae448e00af	avcodec/pcm: use compile-time guards for PCM table init Change the PCM table init to use compile-time #if instead of a runtime `if (CONFIG_...)`. This makes sure code for disabled encoders never gets built, without depending on compiler dead-code elimination. Signed-off-by: YiboFang <fangyibo@xiaomi.com> Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2025-08-31 21:40:29 +08:00
Gyan Doshi	f317b6c052	lavc/codec_desc: add lossless flags for hevc and av1 Both codecs support a lossless mode akin to H.264/AVC	2025-08-31 16:48:03 +05:30
Tong Wu	9893d66add	avcodec/d3d12va_encode: add max_frame_size option Add the max_frame_size option to support setting max frame size in bytes. Max frame size is the maximum cap in the bitrate algorithm per each encoded frame. Signed-off-by: Tong Wu <wutong1208@outlook.com>	2025-08-31 10:46:11 +00:00
Marton Balint	2296a9c1bc	avutil/bprint: fix av_bprint_strftime with %p format string reporting truncated output strftime returns 0 in case of an empty output (e.g. %p format string with some locales), there is no way to distinguish this from a buffer-too-small error condition. So we must use some heuristics to handle this case, and not consume INT_MAX RAM and falsely report a truncated output. Signed-off-by: Marton Balint <cus@passwd.hu>	2025-08-31 09:37:59 +02:00

1 2 3 4 5 ...

120905 Commits