fix(#25764): Implement UTF-8 encoding standardization for CSV import/… by Darshan3690 · Pull Request #27409 · open-metadata/OpenMetadata

Darshan3690 · 2026-04-16T03:52:09Z

PR Summary: Fix Chinese Character Garbling in CSV Import/Export (#25764)

Issue

Chinese and other non-ASCII characters were getting garbled during CSV import and export flows.

Root Cause

Encoding was not consistently enforced across the full pipeline:

UI upload and API request headers
Backend CSV endpoint media types
CSV import parsing for BOM content
CSV download behavior for Excel compatibility

What Changed

Backend updates

Added UTF-8 BOM helper support in CsvUtil with UTF8_BOM constant and stripUtf8Bom() method
Added BOM stripping in shared import flow in EntityResource so all CSV imports normalize input safely
Standardized CSV import/export resource endpoints to explicitly use UTF-8 charset on text/plain CSV payloads
Fixed OpenAPI contract: async export endpoints now correctly annotated with @Produces(APPLICATION_JSON) to match actual JSON response type (TableResource, GlossaryResource, GlossaryTermResource)
Added backend unit tests for:
- Chinese character preservation in generated CSV
- BOM stripping behavior for BOM, non-BOM, empty, and null inputs

Frontend updates

Standardized CSV import request headers to include charset UTF-8 across all import API endpoints
Updated file upload reading to explicitly decode as UTF-8
Updated CSV download logic to:
- prepend BOM for CSV exports
- avoid duplicate BOM when content already includes BOM
- use RFC-compliant mime type format: text/csv; charset=utf-8 (removed trailing semicolon)
- preserve non-CSV behavior unchanged
Added and updated Jest coverage for:
- UTF-8 request header assertions in all import/export API calls
- CSV BOM prepend behavior (proper BOM insertion)
- duplicate BOM prevention logic
- non-CSV file behavior (no BOM for non-CSV)
Proper Blob global restoration in ExportUtils Jest tests to prevent mock leakage between test suites

Playwright updates

Added Chinese data in glossary import/export E2E flow with:
- Chinese term name: 术语{uuid}
- Chinese display name: 中文术语展示名
- Chinese description: 这是用于验证导入导出编码的中文描述。
- Chinese synonyms: 中文同义词;测试
- URI-safe encoded reference URL: https://example.com/%E4%B8%AD%E6%96%87 (avoids URI validation errors)
Added assertion to verify Chinese term visibility after import
Updated reference URL in test data to percent-encoded form to avoid URI validation flakiness

Cleanup updates

Removed accidental debug artifact file from PR
Added debug.json to .gitignore to prevent future re-commit

Copilot Review Comment Fixes (Completed)

✅ Fixed CSV mime type format: text/csv;charset=utf-8; → text/csv; charset=utf-8 (RFC-compliant)
✅ Fixed OpenAPI contract for async export endpoints: Changed @Produces(TEXT_PLAIN + "; charset=UTF-8") to @Produces(APPLICATION_JSON) in:
- TableResource.java line 585
- GlossaryResource.java line 539
- GlossaryTermResource.java line 1245
✅ Updated ExportUtils.test.tsx expectations to match corrected mime type format
✅ Double-BOM prevention logic verified and tested
✅ Blob mock restoration verified in test teardown
✅ URI-safe URL encoding verified in Playwright test data

Additional Review Follow-up Fixes

Based on review comments, this PR includes:

URI-safe encoded URL in Playwright glossary references field to prevent validation flakiness
Proper Blob global restoration in ExportUtils Jest tests to prevent mock leakage between test runs
Added Unicode round-trip import/export integration coverage in BaseEntityIT for entities that support CSV import/export
All async endpoint OpenAPI contracts corrected to reflect actual response types

Validation Status

✅ Passed Locally

Backend Unit Tests: openmetadata-service CsvUtilTest - 7/7 PASSED
- testFormatCsvPreservesChineseCharacters() ✅
- testStripUtf8Bom() ✅
- All existing tests ✅
Code Formatting: Java spotless check - PASSED
Git Status: Branch synced with main - NO CONFLICTS

✅ Test Coverage Added

Backend unit coverage for UTF-8/BOM behavior
UI unit coverage for header assertions and BOM download behavior
Playwright scenario with Chinese content import/export
Integration-level Unicode CSV round-trip test in BaseEntityIT

✅ Code Quality

All review comments addressed
No lint errors or type issues
RFC-compliant mime types
OpenAPI contract now matches implementation

📝 Environment Note

Integration-test module compile/run requires local snapshot artifacts. All code updates are complete and validated. Full integration module execution depends on snapshot dependencies being available in CI or a fully bootstrapped local build.

Files Modified

CsvUtil.java - Added UTF-8 BOM helper
EntityResource.java - BOM stripping in import flow
TableResource.java - Fixed @produces annotation
GlossaryResource.java - Fixed @produces annotation
GlossaryTermResource.java - Fixed @produces annotation
ColumnResource.java - UTF-8 charset in headers
TestCaseResource.java - UTF-8 charset in headers
LineageResource.java - UTF-8 charset in headers
TeamResource.java - UTF-8 charset in headers
UserResource.java - UTF-8 charset in headers
CsvUtilTest.java - Added Unicode tests
BaseEntityIT.java - Added Unicode round-trip test
UploadFile.tsx - UTF-8 file reading
importExportAPI.ts - UTF-8 charset in headers
importExportAPI.test.ts - Updated header assertions
columnAPI.ts - UTF-8 charset in headers
databaseAPI.ts - UTF-8 charset in headers
tableAPI.ts - UTF-8 charset in headers
teamsAPI.ts - UTF-8 charset in headers
ExportUtils.ts - Fixed mime type, added BOM logic
ExportUtils.test.tsx - Updated test expectations, added mock restoration
GlossaryImportExport.spec.ts - Added Chinese content tests
.gitignore - Added debug.json

Compatibility and Risk

✅ Backward compatible for existing clients
✅ Plain text CSV clients continue to work
✅ UTF-8 handling is now explicit and consistent
✅ BOM handling is defensive and avoids double-BOM corruption
✅ No breaking changes to API contracts

Impact

CSV import/export now reliably preserves Chinese and other Unicode characters across backend and frontend workflows, including Excel-friendly CSV download behavior. The implementation is consistent across all entity types and follows RFC standards for media type declarations.

Merge Status: ✅ READY TO MERGE

All Copilot review comments resolved
Backend unit tests passing (7/7)
Code formatting verified
No merge conflicts
Branch synced with main
Changes pushed to remote: b17a8ff264

Summary by Gitar

Refactored Imports:
- Updated UploadFile.tsx to import Transi18next from ../../utils/i18next/LocalUtil instead of ../../utils/CommonUtils.

_{This will update automatically on new commits.}

@consumes

…r CSV import/export ## Overview Resolve Chinese character garbling in CSV import/export workflows by implementing end-to-end UTF-8 encoding standardization across backend REST endpoints and frontend file handling. ## Root Causes Fixed 1. Missing charset=UTF-8 declarations on CSV transport layer (HTTP headers) 2. No UTF-8 BOM handling for Windows Excel compatibility 3. Inconsistent encoding across 9+ independent resource classes 4. Browser FileReader lacking explicit encoding specification 5. No UTF-8 BOM prepending in CSV downloads ## Changes Implemented ### Backend (11 files) **CSV Utility (CsvUtil.java)** - Added UTF8_BOM constant (\uFEFF) - Added stripUtf8Bom(String value) utility method for safe BOM removal - Handles null, empty string, and multi-byte character scenarios **Shared Import Flow (EntityResource.java)** - Import CsvUtil dependency - Normalize CSV input by stripping BOM before repository parsing - Applied to all entity types (Table, Glossary, Team, User, TestCase, etc.) **REST Endpoints (9 resource files)** - ColumnResource.java: Updated 3 @Produces/@consumes annotations - TableResource.java: Updated 4 annotations (export, async export, import, async import) - UserResource.java: Updated 3 annotations - TeamResource.java: Updated 4 annotations - TestCaseResource.java: Updated 3 annotations - GlossaryResource.java: Updated 4 annotations - GlossaryTermResource.java: Updated 4 annotations - LineageResource.java: Updated 1 annotation (export) - All changed from TEXT_PLAIN → TEXT_PLAIN + "

github-actions · 2026-04-16T03:52:34Z

Hi there 👋 Thanks for your contribution!

The OpenMetadata team will review the PR shortly! Once it has been labeled as safe to test, the CI workflows
will start executing and we'll be able to make sure everything is working as expected.

Let us know if you need any help!

Darshan3690 · 2026-04-16T03:54:40Z

HI , @harshach add safe test label

Copilot

Pull request overview

Implements end-to-end UTF-8 handling for CSV import/export to prevent non-ASCII (e.g., Chinese) character corruption by standardizing charset usage across UI requests, backend endpoints, and CSV parsing/downloading.

Changes:

Standardize CSV import request encoding (UI sends text/plain; charset=UTF-8; backend consumes/produces UTF-8 explicitly).
Add UTF-8 BOM handling (backend strips BOM on import; UI prepends BOM for CSV downloads for Excel compatibility).
Extend automated coverage (Java unit tests, Jest tests, and a Playwright E2E scenario with Chinese content).

Reviewed changes

Copilot reviewed 21 out of 22 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
openmetadata-ui/src/main/resources/ui/src/utils/Export/ExportUtils.ts	Prepends BOM and enforces CSV MIME type for downloads to improve Excel UTF-8 handling.
openmetadata-ui/src/main/resources/ui/src/utils/Export/ExportUtils.test.tsx	Updates/adds tests for BOM behavior on CSV vs non-CSV downloads.
openmetadata-ui/src/main/resources/ui/src/rest/teamsAPI.ts	Adds UTF-8 charset to CSV import request headers for team/user imports.
openmetadata-ui/src/main/resources/ui/src/rest/tableAPI.ts	Adds UTF-8 charset to CSV import request headers for table import.
openmetadata-ui/src/main/resources/ui/src/rest/importExportAPI.ts	Adds UTF-8 charset to CSV import request headers for multiple entity import APIs.
openmetadata-ui/src/main/resources/ui/src/rest/importExportAPI.test.ts	Updates assertions to validate UTF-8 charset headers in import requests.
openmetadata-ui/src/main/resources/ui/src/rest/databaseAPI.ts	Adds UTF-8 charset to CSV import request headers for database/schema imports.
openmetadata-ui/src/main/resources/ui/src/rest/columnAPI.ts	Adds UTF-8 charset to CSV import request headers for column CSV import APIs.
openmetadata-ui/src/main/resources/ui/src/components/UploadFile/UploadFile.tsx	Forces `FileReader.readAsText(..., 'utf-8')` for CSV uploads.
openmetadata-ui/src/main/resources/ui/playwright/e2e/Pages/GlossaryImportExport.spec.ts	Adds Chinese glossary term data to validate E2E import/export behavior.
openmetadata-service/src/test/java/org/openmetadata/csv/CsvUtilTest.java	Adds unit tests for BOM stripping and Chinese character preservation in CSV formatting.
openmetadata-service/src/main/java/org/openmetadata/service/resources/teams/UserResource.java	Adds UTF-8 charset to CSV import/export endpoint annotations for users.
openmetadata-service/src/main/java/org/openmetadata/service/resources/teams/TeamResource.java	Adds UTF-8 charset to CSV import/export endpoint annotations for teams.
openmetadata-service/src/main/java/org/openmetadata/service/resources/lineage/LineageResource.java	Adds UTF-8 charset to lineage CSV export endpoint annotation.
openmetadata-service/src/main/java/org/openmetadata/service/resources/glossary/GlossaryTermResource.java	Adds UTF-8 charset to glossary term CSV import/export endpoint annotations.
openmetadata-service/src/main/java/org/openmetadata/service/resources/glossary/GlossaryResource.java	Adds UTF-8 charset to glossary CSV import/export endpoint annotations.
openmetadata-service/src/main/java/org/openmetadata/service/resources/dqtests/TestCaseResource.java	Adds UTF-8 charset to test case CSV import/export endpoint annotations.
openmetadata-service/src/main/java/org/openmetadata/service/resources/databases/TableResource.java	Adds UTF-8 charset to table CSV import/export endpoint annotations.
openmetadata-service/src/main/java/org/openmetadata/service/resources/columns/ColumnResource.java	Adds UTF-8 charset to column CSV import endpoint annotations.
openmetadata-service/src/main/java/org/openmetadata/service/resources/EntityResource.java	Centralizes BOM stripping for entity CSV imports via `CsvUtil.stripUtf8Bom(...)`.
openmetadata-service/src/main/java/org/openmetadata/csv/CsvUtil.java	Introduces UTF-8 BOM constant and helper to strip BOM from imported CSV strings.

Comments suppressed due to low confidence (6)

openmetadata-ui/src/main/resources/ui/src/components/UploadFile/UploadFile.tsx:49

setUploading(false) runs in the finally block immediately after readAsText(...) is initiated, but FileReader completes asynchronously. This means the loader state will be cleared before onload/onerror fires (and errors thrown inside reader.onerror won't be caught by this try/catch). Move the setUploading(false) into reader.onloadend (or onload/onerror) and surface errors via the callback/rejection rather than throwing a string in an async handler.

      setUploading(true);
      try {
        const reader = new FileReader();
        reader.onload = onCSVUploaded;
        reader.onerror = () => {
          throw t('server.unexpected-error');
        };
        reader.readAsText(options.file as Blob, 'utf-8');
      } catch (error) {
        showErrorToast(error as AxiosError);
      } finally {
        setUploading(false);
      }

openmetadata-service/src/main/java/org/openmetadata/service/resources/teams/TeamResource.java:755

The sync exportCsv(...) endpoint produces plain text CSV, but the @ApiResponse content is still declared as application/json. This makes the generated OpenAPI spec incorrect for clients. Update the response @Content(mediaType=...) to text/plain (or text/csv) to match what is actually returned.

  @GET
  @Path("/name/{name}/export")
  @Produces({MediaType.TEXT_PLAIN + "; charset=UTF-8"})
  @Valid
  @Operation(
      operationId = "exportTeams",
      summary = "Export teams in CSV format",
      responses = {
        @ApiResponse(
            responseCode = "200",
            description = "Exported csv with teams information",
            content =
                @Content(
                    mediaType = "application/json",
                    schema = @Schema(implementation = String.class)))

openmetadata-service/src/main/java/org/openmetadata/service/resources/glossary/GlossaryResource.java:575

The sync exportCsv(...) endpoint returns CSV (String) and is annotated as @Produces(text/plain; charset=UTF-8), but the @ApiResponse still declares application/json. Adjust the documented response media type to text/plain (or text/csv) so generated clients don’t try to parse JSON.

  @GET
  @Path("/name/{name}/export")
  @Produces({MediaType.TEXT_PLAIN + "; charset=UTF-8"})
  @Valid
  @Operation(
      operationId = "exportGlossary",
      summary = "Export glossary in CSV format",
      responses = {
        @ApiResponse(
            responseCode = "200",
            description = "Exported csv with glossary terms",
            content =
                @Content(
                    mediaType = "application/json",
                    schema = @Schema(implementation = String.class)))

openmetadata-service/src/main/java/org/openmetadata/service/resources/databases/TableResource.java:622

The sync exportCsv(...) endpoint returns plain-text CSV but its @ApiResponse still advertises application/json. This makes the OpenAPI spec inaccurate for CSV consumers. Update the documented response @Content(mediaType=...) to text/plain (or text/csv).

  @GET
  @Path("/name/{name}/export")
  @Produces({MediaType.TEXT_PLAIN + "; charset=UTF-8"})
  @Valid
  @Operation(
      operationId = "exportTable",
      summary = "Export table in CSV format",
      responses = {
        @ApiResponse(
            responseCode = "200",
            description = "Exported csv with columns from the table",
            content =
                @Content(
                    mediaType = "application/json",
                    schema = @Schema(implementation = String.class)))
      })

openmetadata-service/src/main/java/org/openmetadata/service/resources/lineage/LineageResource.java:416

exportLineage(...) is annotated to produce plain text, but the OpenAPI @ApiResponse is documented as returning a SearchResponse JSON payload. Since the method returns a CSV String, update the documented response content/media type to text/plain (or text/csv) to avoid generating incorrect clients.

  @GET
  @Path("/export")
  @Produces({MediaType.TEXT_PLAIN + "; charset=UTF-8"})
  @Operation(
      operationId = "exportLineage",
      summary = "Export lineage",
      responses = {
        @ApiResponse(
            responseCode = "200",
            description = "search response",
            content =
                @Content(
                    mediaType = "application/json",
                    schema = @Schema(implementation = SearchResponse.class)))
      })

openmetadata-service/src/main/java/org/openmetadata/service/resources/teams/UserResource.java:1701

exportUsersCsv(...) is annotated as producing plain text, but the OpenAPI @ApiResponse content is still declared as application/json. This makes the generated spec misleading for CSV consumers. Update the documented response @Content(mediaType=...) to text/plain (or text/csv) to match the actual response body.

  @GET
  @Path("/export")
  @Produces({MediaType.TEXT_PLAIN + "; charset=UTF-8"})
  @Valid
  @Operation(
      operationId = "exportUsers",
      summary = "Export users in a team in CSV format",
      responses = {
        @ApiResponse(
            responseCode = "200",
            description = "Exported csv with user information",
            content =
                @Content(
                    mediaType = "application/json",
                    schema = @Schema(implementation = String.class)))
      })

…pend

github-actions · 2026-04-16T04:23:57Z

Hi there 👋 Thanks for your contribution!

The OpenMetadata team will review the PR shortly! Once it has been labeled as safe to test, the CI workflows
will start executing and we'll be able to make sure everything is working as expected.

Let us know if you need any help!

github-actions · 2026-04-16T15:58:05Z

Hi there 👋 Thanks for your contribution!

The OpenMetadata team will review the PR shortly! Once it has been labeled as safe to test, the CI workflows
will start executing and we'll be able to make sure everything is working as expected.

Let us know if you need any help!

Darshan3690 · 2026-04-16T16:00:21Z

Hi @harshach @PubChimps @pmbrull add safe to test label

Copilot

Pull request overview

Copilot reviewed 21 out of 22 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (1)

openmetadata-ui/src/main/resources/ui/src/components/UploadFile/UploadFile.tsx:48

setUploading(false) runs in the finally block immediately after calling FileReader.readAsText(...), but FileReader is asynchronous. This makes the loader state inaccurate (it will flip back to false before onload/onerror fires). Move setUploading(false) into the onload and onerror handlers (and call options.onSuccess/onError if needed) so the UI reflects the actual read lifecycle.

        reader.readAsText(options.file as Blob, 'utf-8');
      } catch (error) {
        showErrorToast(error as AxiosError);
      } finally {
        setUploading(false);

Copilot · 2026-04-16T16:04:35Z

+  displayName: '中文术语展示名',
+  description: '这是用于验证导入导出编码的中文描述。',
+  synonyms: '中文同义词;测试',
+  references: '参考;https://example.com/中文',


references includes a URL with raw non-ASCII characters (https://example.com/中文). In the GlossaryTerm schema, termReference.endpoint is format: uri, so validators may reject IRIs that are not RFC3986-encoded. To avoid a flaky/invalid test while still exercising Chinese text, keep Chinese in the reference name and percent-encode the URL path (or use an ASCII-only URL).

Suggested change

references: '参考;https://example.com/中文',

references: '参考;https://example.com/%E4%B8%AD%E6%96%87',

Copilot · 2026-04-16T16:04:35Z

-      expect(MockBlob).toHaveBeenCalledWith(['content'], {
+      expect(MockBlob).toHaveBeenCalledWith(['\uFEFFcontent'], {
+        type: 'text/csv;charset=utf-8;',
+      });


These tests overwrite global.Blob but never restore it. jest.restoreAllMocks() won’t revert direct assignments, so the mocked Blob can leak into later tests/files and cause hard-to-debug failures. Capture the original global.Blob and restore it in afterEach (or use a spy/mocking approach that is automatically restored).

harshach · 2026-04-16T16:48:46Z

@Darshan3690 this requires lot of test coverage.

you need to add unit tests in openmetadata-service
you need to add integration-tests which shows import/export with unicode chars for import/export csv you can check the BaseEntityIT which has comon tests across diffeent entities that supports import/export
You also need to add unit-test coverage for the UI and playwright tests which simulates the import/export cc @PubChimps

Darshan3690 · 2026-04-16T16:56:25Z

@Darshan3690 this requires lot of test coverage.

1. you need to add unit tests in openmetadata-service

2. you need to add integration-tests which shows import/export with unicode chars for import/export csv you can check the BaseEntityIT which has comon tests across diffeent entities that supports import/export

3. You also need to add unit-test coverage for the UI and playwright tests which simulates the import/export cc @PubChimps

okay sir i will update the pr and add safe to test label

github-actions · 2026-04-16T17:42:44Z

Hi there 👋 Thanks for your contribution!

The OpenMetadata team will review the PR shortly! Once it has been labeled as safe to test, the CI workflows
will start executing and we'll be able to make sure everything is working as expected.

Let us know if you need any help!

github-actions · 2026-04-16T17:50:57Z

Hi there 👋 Thanks for your contribution!

The OpenMetadata team will review the PR shortly! Once it has been labeled as safe to test, the CI workflows
will start executing and we'll be able to make sure everything is working as expected.

Let us know if you need any help!

Copilot

Pull request overview

Copilot reviewed 22 out of 23 changed files in this pull request and generated 2 comments.

github-actions · 2026-04-17T02:50:39Z

Hi there 👋 Thanks for your contribution!

The OpenMetadata team will review the PR shortly! Once it has been labeled as safe to test, the CI workflows
will start executing and we'll be able to make sure everything is working as expected.

Let us know if you need any help!

Darshan3690 · 2026-04-17T11:56:16Z

hi @harshach @PubChimps add safe to test label

Darshan3690 · 2026-04-17T16:06:19Z

@harshach @PubChimps add safe to test label .

gitar-bot · 2026-04-21T19:35:52Z

Code Review ✅ Approved 2 resolved / 2 findings

Standardizes UTF-8 encoding for CSV imports by removing the redundant BOM double-prepend and cleaning up the accidentally committed large debug.json artifact.

✅ 2 resolved

✅ Quality: Accidentally committed 101K-line debug.json test artifact

📄 openmetadata-ui/src/main/resources/ui/debug.json:1-15
The file openmetadata-ui/src/main/resources/ui/debug.json is a 101,730-line Jest test output file that was accidentally committed. It contains local Windows file paths (e.g., C:\open source con\OpenMetadata\...) and full test failure details. This bloats the repository significantly and exposes local development environment information. It is not referenced by any build config or .gitignore entry.

✅ Bug: BOM may be double-prepended if content already contains one

📄 openmetadata-ui/src/main/resources/ui/src/utils/Export/ExportUtils.ts:26-32
In ExportUtils.ts, the downloadFile function unconditionally prepends a UTF-8 BOM (\uFEFF) to all CSV content. However, there's no check whether the content already starts with a BOM. On the import side, the backend strips the BOM via CsvUtil.stripUtf8Bom(), but if a future code path or the backend export ever includes a BOM in the response, the download will contain \uFEFF\uFEFF — a double BOM. The first BOM is consumed as expected, but the second appears as an invisible zero-width no-break space character at the start of the first header, which can cause subtle CSV parsing failures on re-import.

Options

Display: compact → Showing less information.

Comment with these commands to change:

`Compact`
`gitar display:verbose`

_{Was this helpful? React with 👍 / 👎 | Gitar}

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

sonarqubecloud · 2026-04-21T20:00:56Z

Quality Gate passed for 'open-metadata-ui'

Issues
993 New issues
0 Accepted issues

Measures
15 Security Hotspots
0.0% Coverage on New Code
3.8% Duplication on New Code

See analysis details on SonarQube Cloud

sonarqubecloud · 2026-04-21T20:43:53Z

Quality Gate passed for 'open-metadata-ingestion'

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Darshan3690 · 2026-04-22T03:37:15Z

@harshach @PubChimps @pmbrull review this pr

Copilot AI review requested due to automatic review settings April 16, 2026 03:52

Darshan3690 requested a review from a team as a code owner April 16, 2026 03:52

Copilot started reviewing on behalf of Darshan3690 April 16, 2026 03:52 View session

gitar-bot Bot reviewed Apr 16, 2026

View reviewed changes

Comment thread openmetadata-ui/src/main/resources/ui/src/utils/Export/ExportUtils.ts

Copilot AI reviewed Apr 16, 2026

View reviewed changes

fix(open-metadata#25764): remove debug artifact and guard CSV BOM pre…

91e0199

…pend

Darshan3690 mentioned this pull request Apr 16, 2026

BUG: Chinese characters are garbled in all import/export functionalities (UTF-8 encoding issue) #25764

Open

Merge branch 'main' into fix/25764-utf8-csv-import-export

eed9c2c

Copilot AI review requested due to automatic review settings April 16, 2026 15:57

Copilot started reviewing on behalf of Darshan3690 April 16, 2026 15:58 View session

Copilot AI reviewed Apr 16, 2026

View reviewed changes

fix(open-metadata#25764): address PR review coverage feedback

e11f67d

Merge branch 'main' into fix/25764-utf8-csv-import-export

0ac38bf

Copilot AI review requested due to automatic review settings April 16, 2026 17:50

Copilot started reviewing on behalf of Darshan3690 April 16, 2026 17:51 View session

Copilot AI reviewed Apr 16, 2026

View reviewed changes

Comment thread openmetadata-integration-tests/src/test/java/org/openmetadata/it/tests/BaseEntityIT.java

Comment thread openmetadata-ui/src/main/resources/ui/src/utils/Export/ExportUtils.ts Outdated

Merge branch 'main' into fix/25764-utf8-csv-import-export

3154978

Darshan3690 had a problem deploying to test April 21, 2026 16:48 — with GitHub Actions Error

Darshan3690 temporarily deployed to test April 21, 2026 16:48 — with GitHub Actions Inactive

Darshan3690 had a problem deploying to test April 21, 2026 16:48 — with GitHub Actions Error

Merge branch 'main' into fix/25764-utf8-csv-import-export

f88a77f

Darshan3690 had a problem deploying to test April 21, 2026 18:22 — with GitHub Actions Error

GitHub Copilot added 4 commits April 22, 2026 00:23

fix(api): update csv media types and bom handling

16e0f02

fix(user-api): resolve duplicate import operation ids

11e76b0

fix(ui): correct async csv upload handling

71063fa

test(ui): fix csv export mocks and formatting

62905d9

Copilot AI review requested due to automatic review settings April 21, 2026 19:34

Copilot started reviewing on behalf of Darshan3690 April 21, 2026 19:34 View session

Darshan3690 temporarily deployed to test April 21, 2026 19:44 — with GitHub Actions Inactive

Copilot AI reviewed Apr 21, 2026

View reviewed changes

	references: '参考;https://example.com/中文',
	references: '参考;https://example.com/%E4%B8%AD%E6%96%87',

Conversation

Darshan3690 commented Apr 16, 2026 • edited by gitar-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary: Fix Chinese Character Garbling in CSV Import/Export (#25764)

Issue

Root Cause

What Changed

Backend updates

Frontend updates

Playwright updates

Cleanup updates

Copilot Review Comment Fixes (Completed)

Additional Review Follow-up Fixes

Validation Status

✅ Passed Locally

✅ Test Coverage Added

✅ Code Quality

📝 Environment Note

Files Modified

Compatibility and Risk

Impact

Merge Status: ✅ READY TO MERGE

Summary by Gitar

Uh oh!

github-actions Bot commented Apr 16, 2026

Uh oh!

Darshan3690 commented Apr 16, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Apr 16, 2026

Uh oh!

github-actions Bot commented Apr 16, 2026

Uh oh!

Darshan3690 commented Apr 16, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

harshach commented Apr 16, 2026

Uh oh!

Darshan3690 commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Apr 16, 2026

Uh oh!

github-actions Bot commented Apr 16, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Apr 17, 2026

Uh oh!

Darshan3690 commented Apr 17, 2026

Uh oh!

Darshan3690 commented Apr 17, 2026

Uh oh!

gitar-bot Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Darshan3690 commented Apr 16, 2026 •

edited by gitar-bot Bot

Loading

Darshan3690 commented Apr 16, 2026 •

edited

Loading

gitar-bot Bot commented Apr 21, 2026 •

edited

Loading