pray-calc-ml/research/baghdad-sky-brightness-2021.md
Aric Camarata ada08e7ec4 data: expand dataset from 5.9k to 91k records via 6 new SQM sources
Add 6 new data collection pipelines and their processed outputs:

Sources added:
- TESS/Stars4All photometer network: 37 months (Jun 2017-Aug 2020),
  ~40k raw events from 100+ European stations via Zenodo archives
- Globe at Night citizen science: 26k twilight observations (2006-2024),
  filtered from 308k total observations for solar depression 6-22 deg
- GaN-MN continuous monitoring: 45 months (Jan 2022-Sep 2025),
  ~12.5k twilight events from 88 stations across 20+ countries
- Galicia SQM network: 14 stations, 1-min resolution, 7.5k events
- Madrid/Majadahonda SQM: multi-year continuous monitoring, 3.1k events
- washetdonker.nl Netherlands: 7 stations, 3.3k morning events
- Academic papers: Jordan (Abed 2015), Fayum Egypt, India photometer

Pipeline changes:
- ingest.py: add all new files to APPROVED_RAW_CSVS allowlist,
  fix filter to use allowlist instead of hardcoded exclusions
- .gitignore: exclude bulk raw data directories (BSRN, TESS, GaN-MN,
  washetdonker, Globe at Night downloads)

Final dataset: 56,668 Fajr + 34,763 Isha = 91,431 total records
Previous: 5,871 Fajr + 46 Isha = 5,917 total records
2026-03-22 16:39:29 -04:00

1.1 KiB

Sky Brightness Contrast Measurement before Sunrise and after Sunset over a Baghdad Region

Authors: Unknown (Iraqi researchers) Year: ~2021 Journal: Unknown (possibly Iraqi Journal of Science or Journal of Physics: Conference Series) URL: Could not locate on ResearchGate, Google Scholar, or direct search Sites studied: Baghdad, Iraq (~33 20'N, 44 23'E) Observation method: Sky brightness measurement (likely SQM) Date range: ~2 nights Records: Minimal (2 nights)

Status

COULD NOT LOCATE PAPER. Searched:

  • ResearchGate (403 blocked)
  • Google Scholar (JavaScript required, no results in static HTML)
  • Direct DOI search (no DOI known)
  • PubMed (not indexed)

The paper title and approximate details were provided but no accessible URL could be found. The paper reportedly covers only 2 nights of data from Baghdad, which would provide minimal additional training value.

For ML Training

Even if found, 2 nights of data from a single urban location provides very limited training value. Low priority for further search. Baghdad coordinates (33.33N, 44.38E) and UTC+3 are noted for reference.