1
Create a Python file
csv_row_chunking.py
2
Set up your virtual environment
3
Install dependencies
4
Run PgVector
5
Run the script
CSV Row Chunking Params
| Parameter | Type | Default | Description |
|---|---|---|---|
rows_per_chunk | int | 100 | The number of rows to include in each chunk. |
skip_header | bool | False | Whether to skip the header row when chunking. |
clean_rows | bool | True | Whether to clean and normalize row data. |
include_header_in_chunks | bool | False | Whether to include the header row in each chunk. |
max_chunk_size | int | 5000 | Maximum character size for each chunk (fallback limit). |