Skip to content

refactor: optimize ExprBloomFilter to use references instead of clones #18157

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

dantengsky
Copy link
Member

@dantengsky dantengsky commented Jun 13, 2025

I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/

Summary

Modifies the ExprBloomFilter implementation to accept a reference to BinaryFuse16 instead of taking ownership of the filter. This optimization eliminates unnecessary cloning when applying runtime bloom filters during data deserialization in data sources.

Key changes:

  • Add lifetime parameter to ExprBloomFilter struct
  • Change filter field from owned BinaryFuse16 to a reference
  • Update constructor to accept and store references
  • Remove redundant clone() calls in native and parquet deserializers

Tests

  • Unit Test
  • Logic Test
  • Benchmark Test
  • No Test - Explain why

Type of change

  • Bug Fix (non-breaking change which fixes an issue)
  • New Feature (non-breaking change which adds functionality)
  • Breaking Change (fix or feature that could cause existing functionality not to work as expected)
  • Documentation Update
  • Refactoring
  • Performance Improvement
  • Other (please describe):

This change is Reviewable

… clones

This change modifies the ExprBloomFilter implementation to accept a reference to BinaryFuse16
instead of taking ownership of the filter. This optimization eliminates unnecessary cloning
when applying runtime bloom filters during data deserialization in data sources.

Key changes:
- Add lifetime parameter to ExprBloomFilter struct
- Change filter field from owned BinaryFuse16 to a reference
- Update constructor to accept and store references
- Remove redundant clone() calls in native and parquet deserializers
@dantengsky dantengsky added the ci-benchmark-cloud Benchmark: run only cloud tests for tpch/hits label Jun 13, 2025
@github-actions github-actions bot added the pr-refactor this PR changes the code base without new features or bugfix label Jun 13, 2025
Copy link
Contributor

Docker Image for PR

  • tag: pr-18157-ee3b432-1749801796

note: this image tag is only available for internal use.

Copy link
Contributor

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci-benchmark-cloud Benchmark: run only cloud tests for tpch/hits pr-refactor this PR changes the code base without new features or bugfix
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant