GIATAR: Global Invasive and Alien Traits and Records Dataset¶
Category: Invasive Species · Size: 42.7 MB · Format: ZIP License: CC-BY-NC-ND-4.0 (Non-commercial + No-derivatives: do not republish derivatives, link to the original) · Zenodo record · Data sheet on the CSDH
Consolidated dataset with dated presence records for 46,666 invasive and alien taxa across 249 countries, with standardised biological information on pests.
The data is mounted read-only at /srv/data/giatar-invasive/.
Save anything you produce in your personal folder (~/).
What's in the dataset¶
from pathlib import Path
DATA = Path('/srv/data/giatar-invasive')
for f in sorted(DATA.rglob('*')):
if f.is_file():
print(f"{f.relative_to(DATA)} ({f.stat().st_size/1e6:,.1f} MB)")
GIATAR - Data and query code.zip (42.7 MB)
Explore the ZIP¶
The dataset comes compressed. We list its contents without extracting; if it contains CSVs, pandas can read them straight from inside the ZIP. Remember: /srv/data is read-only — if you need to extract, do it into your folder (~/).
import zipfile
import pandas as pd
zips = sorted(DATA.rglob('*.zip'))
z = zipfile.ZipFile(zips[0])
print('Using:', zips[0].name)
names = z.namelist()
print(f'{len(names)} files inside; first 20:')
for n in names[:20]:
print(' ', n)
csv_inside = [n for n in names if n.lower().endswith('.csv')]
if csv_inside:
df = pd.read_csv(z.open(csv_inside[0]), nrows=100_000, low_memory=False)
display(df.head())
Using: GIATAR - Data and query code.zip 68 files inside; first 20: GIATAR - Data and query code/.env GIATAR - Data and query code/dataset/ GIATAR - Data and query code/dataset/CABI data/ GIATAR - Data and query code/dataset/CABI data/CABI_tables/ GIATAR - Data and query code/dataset/CABI data/CABI_tables/toairTemperature.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/toclimate.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/tocontributors.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/todistributionDatabaseTable.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/toenvironments.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/tohostPlants.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/toimpactSummary.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/tointroductions.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/tolatitudeAndAltitudeRanges.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/tolinksToWebsites.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/tonaturalEnemies.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/topathwayCauses.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/topathwayVectors.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/toplantTrade.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/torainfall.csv GIATAR - Data and query code/dataset/CABI data/CABI_tables/tosymptomsOrSigns.csv
| code | section | Parameter | Lower limit | Upper limit | usageKey | |
|---|---|---|---|---|---|---|
| 0 | 9630 | toairTemperature | Mean annual temperature (ºC) | 5.0 | 37.0 | 3517956 |
| 1 | 109097 | toairTemperature | Mean annual temperature (ºC) | 0.0 | 45.0 | 2096154 |
| 2 | 119196 | toairTemperature | Absolute minimum temperature (ºC) | 17.0 | NaN | 10826565 |
| 3 | 119196 | toairTemperature | Mean annual temperature (ºC) | 17.0 | 37.0 | 10826565 |
| 4 | 3651 | toairTemperature | Absolute minimum temperature (ºC) | 15.0 | NaN | 10304386 |
Your turn¶
This is just the starting point. Some ideas:
- Check the dataset challenge on its CSDH data sheet.
- Work on a copy: right-click the file → Duplicate (or Save Notebook As…). Your changes only live in your Hub space — they're never pushed to GitHub.
- Edited this notebook and want the original back? Use the Restore cell
below (or the
restore.ipynbnotebook). - Questions and results: on the platform forum.
Attribution: data from GIATAR: Global Invasive and Alien Traits and Records Dataset, license CC-BY-NC-ND-4.0. Notebook from the Citizen Science Data Hub (CSDH) — Fundación Ibercivis.
# ⚠️ RESTORE: this DISCARDS YOUR CHANGES to this notebook and resets it to the original.
# 1. Uncomment the line below (remove the #) 2. Run this cell
# 3. Then: menu File → Reload Notebook from Disk
# !git -C ~/citizen-science-data fetch -q origin && git -C ~/citizen-science-data checkout origin/main -- giatar-invasive.ipynb && echo "Restored. Now: File → Reload Notebook from Disk"