Fix CSV quote stripping in custom evaluators #44526

Copilot · 2025-12-29T21:19:02Z

pd.read_csv() by default treats quotation marks as field delimiters and strips them. This caused custom evaluators to receive test when the CSV cell contained "test", making it impossible to distinguish quoted from unquoted values.

Changes

CSVDataFileLoader.load(): Add quoting=csv.QUOTE_NONE to preserve quotes as literal characters
Test coverage: Add test validating quote preservation across multiple scenarios

Example

response,ground_truth
test,"test"

Before: Custom evaluator receives both values as test (quotes stripped)
After: Custom evaluator receives test and "test" (quotes preserved)

This brings CSV behavior in line with JSONL handling, which already preserves quotes correctly.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

pypi.org
- Triggering command: /home/REDACTED/work/azure-sdk-for-python/azure-sdk-for-python/.venv/bin/python3 python3 -m pip install pandas -q (dns block)
scanning-api.github.com
- Triggering command: /home/REDACTED/work/_temp/ghcca-node/node/bin/node /home/REDACTED/work/_temp/ghcca-node/node/bin/node --enable-source-maps /home/REDACTED/work/_temp/copilot-developer-action-main/dist/index.js (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

Original prompt

This section details on the original issue you should resolve

<issue_title>azure-ai-evaluation: creating custom code evaluator drops leading and trailing quotation marks in parameter column</issue_title>
<issue_description>- Package Name: azure-ai-evaluation

Package Version: 1.6.0

Operating System: windows

Python Version: 3.11.9

Describe the bug
If the value of a column starts AND ends with the quotation character (") the quotation characters are dropped when sent to the evaluator's custom python function.

If the value only starts or ends with a quotation mark, it is not dropped. It is only when it starts and ends with the quotation mark that the problem occurs.

Interestingly, the built in code evaluators (BLEU, F1 Score, GLEU and METEOR) all seem to be able to see that the result is not the same as the ground_truth so they must be receiving the leading and trailing quotation marks.

To Reproduce
Steps to reproduce the behavior:

create a custom code evaluator to compare response and ground_truth

prepare dataset, e.g: {"response":"test","ground_truth":"\"test\""}

run the evaluator, the result always said response is equal with ground_truth

Expected behavior
The values sent to the parameters in the python function for a custom code evaluator should be exactly what they are in the dataset's cell with no characters stripped away.
Screenshots
If applicable, add screenshots to help explain your problem.

Additional context
Add any other context about the problem here.
</issue_description>

Comments on the Issue (you are @copilot in this section)
@w-javed Team is working on the fix. We'll have a fix in upcoming release. Thanks!

Fixes azure-ai-evaluation: creating custom code evaluator drops leading and trailing quotation marks in parameter column #40996

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: luigiw <1483379+luigiw@users.noreply.github.com>

Initial plan

20509f1

Copilot AI assigned Copilot and luigiw Dec 29, 2025

Copilot started work on behalf of luigiw December 29, 2025 21:19 View session

Copilot AI and others added 5 commits December 29, 2025 21:26

Fix CSV quote stripping and add test

e3160e5

Co-authored-by: luigiw <1483379+luigiw@users.noreply.github.com>

Fix malformed CSV test data based on code review

eb4e32e

Co-authored-by: luigiw <1483379+luigiw@users.noreply.github.com>

Fix CSV test data and update test expectations

dc0921c

Co-authored-by: luigiw <1483379+luigiw@users.noreply.github.com>

Add explanatory comment for CSV quote handling

fc3782b

Co-authored-by: luigiw <1483379+luigiw@users.noreply.github.com>

Add detailed documentation to test explaining quote handling behavior

c2e38df

Co-authored-by: luigiw <1483379+luigiw@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix issue dropping leading and trailing quotation marks~~ Fix CSV quote stripping in custom evaluators Dec 29, 2025

Copilot AI requested a review from luigiw December 29, 2025 21:39

Copilot finished work on behalf of luigiw December 29, 2025 21:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix CSV quote stripping in custom evaluators #44526

Fix CSV quote stripping in custom evaluators #44526

Copilot AI commented Dec 29, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix CSV quote stripping in custom evaluators #44526

Are you sure you want to change the base?

Fix CSV quote stripping in custom evaluators #44526

Conversation

Copilot AI commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Example

I tried to connect to the following addresses, but was blocked by firewall rules:

Comments on the Issue (you are @copilot in this section)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 29, 2025 •

edited

Loading