Bulk download of CRS data#
Based on oda-reader. Downloads directly the entire raw CRS data from using the OECD API. Optionally it can be saved using save_to_path.
download_crs_parquet(file_url, save_to_path=None)
#
Download the CRS Parquet file from the given URL and extract its contents.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
file_url
|
str
|
The URL of the CRS Parquet file. |
required |
save_to_path
|
str
|
The path to save the extracted files. If None, files are not saved. |
None
|
Returns:
| Type | Description |
|---|---|
DataFrame
|
pd.DataFrame: A DataFrame containing the concatenated data from all parquet files. |
Source code in src\download_crs.py
get_full_crs_parquet_url()
#
Fetch the latest CRS Parquet file ID from the OECD website. Necessary since the file ID changes with each update.
Returns:
| Name | Type | Description |
|---|---|---|
str |
str
|
The full URL of the CRS Parquet file. |