Skip to content

Extract DeletionVector logic from PuffinFile#3491

Open
ebyhr wants to merge 1 commit into
apache:mainfrom
ebyhr:ebi/puffin-refactoring
Open

Extract DeletionVector logic from PuffinFile#3491
ebyhr wants to merge 1 commit into
apache:mainfrom
ebyhr:ebi/puffin-refactoring

Conversation

@ebyhr

@ebyhr ebyhr commented Jun 13, 2026

Copy link
Copy Markdown
Member

Rationale for this change

PuffinFile handles two tasks: format parsing (magic bytes, footer, blobs) and deletion vector domain logic (bitmap deserialization and PyArrow conversion).
This will become problematic when we introduce support for the NDV apache-datasketches-theta-v1 blob in the future.

Are these changes tested?

Yes

Are there any user-facing changes?

Yes - PuffinFile class user needs to call DeletionVector.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant