[common] Introduce MAP type for ARROW, COMPACTED and INDEXED formats #2190

XuQianJin-Stars · 2025-12-17T06:34:30Z

Purpose

Linked issue: close #1973

Introduce MAP type support for INDEXED format, achieving feature parity with ARROW and COMPACTED formats for MAP data type handling.

Brief change log

Added readMap() method to IndexedRowReader for MAP type deserialization, following the same pattern as readArray()
Removed UnsupportedOperationException for MAP type in IndexedRowReader.createFieldReader()
Added missing InternalMap import to IndexedRowReader
Updated BinaryWriterTest.testCreateValueSetterForMapThrowsException() to testValueSetterWithMapType() - changed from exception test to positive validation test
Extended IndexedRowTest.assertAllTypeEquals() to include MAP type assertions (validates size, key array, value array, and null handling)
Extended IndexedRowReaderTest.assertAllTypeEqualsForReader() to verify MAP reading

Tests

Unit Tests:

IndexedRowTest (7 tests) - Validates MAP read/write operations with IndexedRow
IndexedRowReaderTest (2 tests) - Validates MAP deserialization with IndexedRowReader
BinaryWriterTest (18 tests) - Validates MAP type ValueWriter creation and usage
ArrowReaderWriterTest (2 tests) - Confirms ARROW format MAP support continues to work
CompactedRowTest (20 tests) - Confirms COMPACTED format MAP support continues to work

Test Results: All 49 tests passed successfully (0 failures, 0 errors, 0 skipped)

Test Command:
mvn test -pl fluss-common -Dtest=IndexedRowTest,IndexedRowReaderTest,BinaryWriterTest,ArrowReaderWriterTest,CompactedRowTest -Dcheckstyle.skip=true -Dspotless.check.skip=true### API and Format

API: No breaking changes. This is purely additive functionality that removes an UnsupportedOperationException and enables existing MAP type infrastructure for INDEXED format.

Storage Format: No format changes. The MAP storage format in INDEXED rows was already defined and uses the same BinaryMap serialization mechanism as other formats. This change only enables reading MAP data that was already writeable.

Documentation

No documentation changes required. This change brings INDEXED format to feature parity with ARROW and COMPACTED formats for MAP type support. MAP type usage is already documented in existing Fluss documentation. The change completes internal implementation of existing functionality rather than introducing new user-facing features.

Copilot

Pull request overview

This PR introduces MAP type support for the INDEXED row format, achieving feature parity with ARROW and COMPACTED formats. The implementation adds MAP serialization/deserialization capabilities across the Fluss common layer, Flink integration, and lake integrations (Paimon, Iceberg placeholder).

Key Changes:

Added MAP type support to IndexedRow format with IndexedRowReader.readMap() and IndexedRowWriter.writeMap() methods
Introduced GenericMap, BinaryMap, and format-specific map implementations (IndexedMap, CompactedMap, AlignedMap) with corresponding serializers
Extended Flink, Paimon, and Arrow integrations to handle MAP type conversions and vector operations

Reviewed changes

Copilot reviewed 66 out of 66 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`IndexedRowReader.java`	Added `readMap()` method and MAP case in field reader factory
`IndexedRowWriter.java`	Added `writeMap()` method for MAP serialization
`BinaryMap.java`	New binary representation of MAP type with key/value arrays
`GenericMap.java`	Generic MAP implementation wrapping Java maps
`MapSerializer.java`	Serializer for converting InternalMap to BinaryMap
`FlinkAsFlussMap.java`	Adapter for Flink MapData to Fluss InternalMap
`PaimonMapAsFlussMap.java`	Adapter for Paimon map to Fluss InternalMap
`ArrowMapWriter.java` / `ArrowMapColumnVector.java`	Arrow format MAP support
Various test files	Comprehensive test coverage for MAP operations

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-26T13:53:35Z

fluss-common/src/main/java/org/apache/fluss/row/BinaryArray.java

            Object arr, int offset, int length, int elementSize) {
        final long headerInBytes = calculateHeaderInBytes(length);
-        final long valueRegionInBytes = elementSize * length;
+        final long valueRegionInBytes = (long) elementSize * length;


The cast to (long) is applied only to elementSize, not to the entire multiplication result. This could still cause integer overflow when elementSize * length exceeds Integer.MAX_VALUE before the cast is applied. The cast should wrap the entire multiplication expression: (long) elementSize * length.

Copilot · 2025-12-26T13:53:35Z

fluss-common/src/main/java/org/apache/fluss/row/GenericMap.java

+
+        java.util.Map<Object, Object> javaMap = new java.util.LinkedHashMap<>();
+        for (int i = 0; i < values.length; i += 2) {
+            javaMap.put(values[i], (i + 1 < values.length) ? values[i + 1] : null);


Test is always true, because of this condition.

Suggested change

javaMap.put(values[i], (i + 1 < values.length) ? values[i + 1] : null);

javaMap.put(values[i], values[i + 1]);

Copilot · 2025-12-26T13:53:36Z

fluss-common/src/main/java/org/apache/fluss/row/BinaryMap.java

+ * @since 0.9
+ */
+@PublicEvolving
+public class BinaryMap extends BinarySection implements InternalMap {


Class BinaryMap overrides hashCode but not equals.

XuQianJin-Stars force-pushed the feature/issue-1973-support-map-format branch 3 times, most recently from 26bf237 to ed1f8fc Compare December 24, 2025 15:57

[common] Introduce MAP type for ARROW, COMPACTED and INDEXED formats

a7642db

XuQianJin-Stars force-pushed the feature/issue-1973-support-map-format branch from ed1f8fc to a7642db Compare December 25, 2025 02:28

This was referenced Dec 25, 2025

Support Map type in KV table #1976

Open

[Feature] Support for Arrow map data type #834

Open

wuchong added the priority=critical label Dec 26, 2025

wuchong requested a review from Copilot December 26, 2025 13:44

Copilot started reviewing on behalf of wuchong December 26, 2025 13:45 View session

Copilot AI reviewed Dec 26, 2025

View reviewed changes

fix some comments

d32c901

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[common] Introduce MAP type for ARROW, COMPACTED and INDEXED formats #2190

[common] Introduce MAP type for ARROW, COMPACTED and INDEXED formats #2190

Uh oh!

XuQianJin-Stars commented Dec 17, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 26, 2025

Uh oh!

Copilot AI Dec 26, 2025

Uh oh!

Copilot AI Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	javaMap.put(values[i], (i + 1 < values.length) ? values[i + 1] : null);
	javaMap.put(values[i], values[i + 1]);

[common] Introduce MAP type for ARROW, COMPACTED and INDEXED formats #2190

Are you sure you want to change the base?

[common] Introduce MAP type for ARROW, COMPACTED and INDEXED formats #2190

Uh oh!

Conversation

XuQianJin-Stars commented Dec 17, 2025

Purpose

Brief change log

Tests

Documentation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants