Remove special mapping of `auto` to {} in `open_zarr` #11010

jsignell · 2025-12-12T17:35:43Z

Closes Inconsistent chunking between xr.open_zarr and xr.open_dataset(..., engine='zarr') with chunks="auto" #11002
Tests added
User visible changes (including notable bug fixes) are documented in whats-new.rst

This PR makes the handling of chunks="auto" consistent between open_zarr and open_dataset(..., engine="zarr").

The handling of chunks still differs in open_zarr vs open_dataset(..., engine="zarr") in that the default in open_zarr is to use chunks="auto" and a chunk manager (aka dask) when available in your env. And in open_dataset the default is to use chunks=None (aka no chunks).

dcherian · 2025-12-12T22:29:08Z

doc/whats-new.rst

 Breaking Changes
 ~~~~~~~~~~~~~~~~

+- Remove special mapping of ``"auto"`` to ``{}`` in ``open_zarr``. This matches the behavior of ``open_dataset(..., engine="zarr")``


Someone is going to complain but I think this is an improvement.

it removes one source of confusion.

in general you do want a multiple of the on-disk chunks.

users can still pass {} to get previous behaviour.

^ perhaps we can call that third point out in the release note.

Also, since this could be pretty breaking I'd like us to get a few more thumbs up on it.

since this could be pretty breaking I'd like us to get a few more thumbs up on it.

Yeah absolutely.

I am wondering if it's worth trying to improve the auto logic to make it less willing to split chunks before we merge this. If auto never split chunks then I think this is a much less breaking change. As it currently stands, auto prefers to keep existing chunk boundaries, but it isn't strict about it. I would kind of like something that is logically (max({}, "auto"), but maybe that should be a different name?

Someone is going to complain

😏

Out of laziness I have a lot of code that assumes open_zarr(path) is shorthand foropen_dataset(path, engine="zarr", chunks={}).

Fine, that can be changed, although a deprecation cycle would be appreciated.

But in my opinion auto-magic chunking as defined by some hidden logic in the chunkmanager is not very predictable and therefore not great default behavior. We should trust that the user had intent in storing the data with the chunk structure they did, and explicitly opt in to modifying that rather than the other way around. Even more so if "auto" doesn't even respect chunk boundaries.

I never use "auto" so I don't know the extent to which it does or does not preserve the on-disk chunking. If it usually leaves them alone, then maybe not a big issue.

This should be easy to resolve by setting chunks=None as a default instead of "auto", interpreting that as {} for the period of the deprecation cycle, and switching that out to mean the same as chunks=None for open_dataset (which is no dask/cubed chunks) once the deprecation cycle is done..

Anyone that specifies "auto" today is asking for auto-magical behaviour, and should be mostly unaffected I hope. Or at least, I am OK with that breakage.

Yeah that seems good, and could pave the way to remove open_zarr entirely. Although that's been rejected before (#7496), and there's the asymmetry of specialized to_ methods for each file type and a single generic open_ method.

Remove special mapping of auto to {} in open_zarr

e7015da

github-actions bot added topic-backends topic-zarr Related to zarr storage library io labels Dec 12, 2025

dcherian approved these changes Dec 12, 2025

View reviewed changes

jsignell self-assigned this Dec 12, 2025

jsignell added 2 commits December 12, 2025 13:57

Add test to prove that auto can differe from on-disk

77c1189

Update what's new

797f1b4

jsignell marked this pull request as ready for review December 12, 2025 19:10

dcherian reviewed Dec 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Remove special mapping of `auto` to {} in `open_zarr` #11010

Remove special mapping of `auto` to {} in `open_zarr` #11010

jsignell commented Dec 12, 2025 •

edited

Loading

Uh oh!

dcherian Dec 12, 2025

Uh oh!

jsignell Dec 15, 2025

Uh oh!

slevang Dec 15, 2025

Uh oh!

dcherian Dec 15, 2025 •

edited

Loading

Uh oh!

slevang Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Remove special mapping of auto to {} in open_zarr #11010

Are you sure you want to change the base?

Remove special mapping of auto to {} in open_zarr #11010

Conversation

jsignell commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcherian Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

jsignell Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

slevang Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

dcherian Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

slevang Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Remove special mapping of `auto` to {} in `open_zarr` #11010

Remove special mapping of `auto` to {} in `open_zarr` #11010

jsignell commented Dec 12, 2025 •

edited

Loading

dcherian Dec 15, 2025 •

edited

Loading