`write_byte_record` and `write_field` does not mix well and this is not properly documented. #335

vi · 2023-09-17T23:14:49Z

What version of the `csv` crate are you using?

1.2.2

Briefly describe the question, bug or feature request.

When I use write_byte_record with a non-empty iterator after prior series of write_fields, the library produces strange output and may fail with UnequalLengths.

Include a complete program demonstrating a problem.

fn main() -> Result<(), Box<dyn std::error::Error>>{
    let mut b = csv::WriterBuilder::new();
    b.has_headers(false);
    let mut csv = b.from_path("out.csv")?;
    let mut record = csv::ByteRecord::new();
    record.push_field(b"12");
    for _ in 0..10000 {
        csv.write_field("F")?;
        csv.write_byte_record(&record)?;
    }
    Ok(())
}

I used similar code (with a csv.write_field("")? workaround to insert the missing comma), thinking that write_record is only for string records, not byte ones.

What is the observed behavior of the code above?

Output file contains records glued together (except of the last line before UnequalLengths error).

The documentation does not suggest, but also does not explicitly prohibit this combination of csv::Writer methods.

What is the expected or desired behavior of the code above?

write_byte_record's documentation explicitly mentions that it write_field should not be used to prepend fields. Or the program behaves just like as if it were write_record instead.

Maybe usage of write_byte_record after write_field should panic, at least in debug profile.

Here are steps of how similar code can end up in a project:

Example code from write_field's documentation with wtr.write_record(None::<&[u8]>)?; (why there is no dedicated method to avoid those turbofishes?);
None::<&[u8]> gets replaced with a record from other CSV file (e.g. to round trip with prepended fields).
Let's preserve non-UTF-8 content, so ByteRecord instead of StringRecord. As we have switched to byte records, assume (erroneously) that we need to switch to write_byte_record from write_record. It also mentions "more quickly" in the docs, which makes the switch even more attractive.

The text was updated successfully, but these errors were encountered:

BurntSushi · 2023-09-17T23:44:46Z

Thanks for the well written issue!

Or the program behaves just like as if it were write_record instead.

This is probably what should happen. I don't have the context paged into cache to know how tricky (if at all) this is. Hopefully not tricky at all.

Example code from write_field's documentation with wtr.write_record(None::<&[u8]>)?; (why there is no dedicated method to avoid those turbofishes?);

Because I perceive this to be a niche/weird case and I'm not sure it warrants another method in the public API.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`write_byte_record` and `write_field` does not mix well and this is not properly documented. #335

`write_byte_record` and `write_field` does not mix well and this is not properly documented. #335

vi commented Sep 17, 2023 •

edited

BurntSushi commented Sep 17, 2023

write_byte_record and write_field does not mix well and this is not properly documented. #335

write_byte_record and write_field does not mix well and this is not properly documented. #335

Comments

vi commented Sep 17, 2023 • edited

What version of the csv crate are you using?

Briefly describe the question, bug or feature request.

Include a complete program demonstrating a problem.

What is the observed behavior of the code above?

What is the expected or desired behavior of the code above?

BurntSushi commented Sep 17, 2023

`write_byte_record` and `write_field` does not mix well and this is not properly documented. #335

`write_byte_record` and `write_field` does not mix well and this is not properly documented. #335

vi commented Sep 17, 2023 •

edited

What version of the `csv` crate are you using?