Add image processors refactor to v5 migration guide by yonigozlan · Pull Request #45556 · huggingface/transformers

yonigozlan · 2026-04-21T19:00:48Z

What does this PR do?

As discussed internally @vasqu
Cc @stevhliu

HuggingFaceDocBuilderDev · 2026-04-21T19:11:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

stevhliu

thanks for adding!

stevhliu · 2026-04-21T19:55:05Z


+### Image Processors
+
+The old slow/fast dual-file design — a PIL-based `image_processing_<model>.py` paired with a torchvision-based `image_processing_<model>_fast.py` — has been replaced with a named-backend architecture:


Suggested change

The old slow/fast dual-file design — a PIL-based `image_processing_<model>.py` paired with a torchvision-based `image_processing_<model>_fast.py` — has been replaced with a named-backend architecture:

The old slow/fast dual-file design has been replaced with a named-backend architecture. Each model previously had a PIL-based `image_processing_<model>.py` and a torchvision-based `image_processing_<model>_fast.py`. The new layout is:

stevhliu · 2026-04-21T20:02:09Z

+- `image_processing_<model>.py` → **torchvision** backend (default; was previously `FooImageProcessorFast`)
+- `image_processing_pil_<model>.py` → **PIL** backend (was previously `FooImageProcessor`)
+
+Processor classes now inherit from `TorchvisionBackend` or `PilBackend` (defined in `image_processing_backends.py`), which provide ready-made implementations of all standard operations (`resize`, `rescale`, `normalize`, `center_crop`, `pad`) and a default `_preprocess` pipeline. `BaseImageProcessor` (in `image_processing_utils`) handles the shared preprocessing boilerplate — kwargs validation, default-filling from class attributes, and input preparation — so model-specific processors contain only what is genuinely unique to the model. Most processors now simply inherit from a backend and declare class-attribute defaults; only processors with custom logic (e.g. patch tiling) need to override `_preprocess`.


Suggested change

Processor classes now inherit from `TorchvisionBackend` or `PilBackend` (defined in `image_processing_backends.py`), which provide ready-made implementations of all standard operations (`resize`, `rescale`, `normalize`, `center_crop`, `pad`) and a default `_preprocess` pipeline. `BaseImageProcessor` (in `image_processing_utils`) handles the shared preprocessing boilerplate — kwargs validation, default-filling from class attributes, and input preparation — so model-specific processors contain only what is genuinely unique to the model. Most processors now simply inherit from a backend and declare class-attribute defaults; only processors with custom logic (e.g. patch tiling) need to override `_preprocess`.

Processor classes now inherit from `TorchvisionBackend` or `PilBackend` (defined in `image_processing_backends.py`), which provide ready-made implementations of all standard operations (`resize`, `rescale`, `normalize`, `center_crop`, `pad`) and a default `_preprocess` pipeline. `BaseImageProcessor` (in `image_processing_utils`) handles shared preprocessing boilerplate: kwargs validation, default-filling from class attributes, and input preparation. Model-specific processors contain only what is unique to the model. Most processors inherit from a backend and declare class-attribute defaults. Only those with custom logic (e.g. patch tiling) need to override `_preprocess`.

stevhliu · 2026-04-21T20:03:54Z

 - Minor change: `XXXFastImageProcessorKwargs` is removed in favor of `XXXImageProcessorKwargs` which will be shared between fast and slow processors (https://github.com/huggingface/transformers/pull/40931)


+### Image Processors


Suggested change

### Image Processors

### Image processors

Add image processors refactor to v5 migration guide

eb9c837

stevhliu approved these changes Apr 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add image processors refactor to v5 migration guide#45556

Add image processors refactor to v5 migration guide#45556
yonigozlan wants to merge 1 commit intohuggingface:mainfrom
yonigozlan:add-im-proc-refactor-mig-guide

yonigozlan commented Apr 21, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 21, 2026

Uh oh!

stevhliu left a comment

Uh oh!

stevhliu Apr 21, 2026

Uh oh!

stevhliu Apr 21, 2026

Uh oh!

stevhliu Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		### Image Processors

		The old slow/fast dual-file design — a PIL-based `image_processing_<model>.py` paired with a torchvision-based `image_processing_<model>_fast.py` — has been replaced with a named-backend architecture:

	The old slow/fast dual-file design — a PIL-based `image_processing_<model>.py` paired with a torchvision-based `image_processing_<model>_fast.py` — has been replaced with a named-backend architecture:
	The old slow/fast dual-file design has been replaced with a named-backend architecture. Each model previously had a PIL-based `image_processing_<model>.py` and a torchvision-based `image_processing_<model>_fast.py`. The new layout is:

		- Minor change: `XXXFastImageProcessorKwargs` is removed in favor of `XXXImageProcessorKwargs` which will be shared between fast and slow processors (https://github.com/huggingface/transformers/pull/40931)


		### Image Processors

Conversation

yonigozlan commented Apr 21, 2026

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Apr 21, 2026

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

stevhliu Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants