1
Create a Python file
2
Add the following code to your Python file
csv_row_chunking.py
3
Create a virtual environment
Open the
Terminal and create a python virtual environment.4
Install libraries
5
Run PgVector
6
Run the script
CSV Row Chunking Params
| Parameter | Type | Default | Description |
|---|---|---|---|
rows_per_chunk | int | 100 | The number of rows to include in each chunk. |
skip_header | bool | False | Whether to skip the header row when chunking. |
clean_rows | bool | True | Whether to clean and normalize row data. |
include_header_in_chunks | bool | False | Whether to include the header row in each chunk. |
max_chunk_size | int | 5000 | Maximum character size for each chunk (fallback limit). |