pray-calc-ml/src/collect
Aric Camarata 3b8c665aca chore: add remaining processors and analysis scripts, gitignore experimental
Tracked: BSRN/SURFRAD processors (reference, excluded from pipeline),
GaN-MN downloader, academic paper fetcher, Madrid SQM processor,
ML analysis scripts (src/analyze/), umsu_medan_2024 raw sightings.

Gitignored: global_extrapolator, instant_1m_injector/vectorized,
massive_harvest_engine, massive_sqm_downloader, global_sqm_harvester,
run_infinite_pipeline.sh, run_massive_collection.sh, search_papers.py
(agent-generated experimental scripts, not part of core pipeline).
2026-03-23 06:44:01 -04:00
..
__init__.py Rebuild as Python data science project 2026-02-25 19:32:47 -05:00
academic_paper_fetcher.py chore: add remaining processors and analysis scripts, gitignore experimental 2026-03-23 06:44:01 -04:00
brin_multistation_processor.py data: update pipeline + dataset to latest collected records 2026-02-28 11:55:24 -05:00
brin_multistation_sqm.py data: update pipeline + dataset to latest collected records 2026-02-28 11:55:24 -05:00
brin_timau_sqm.py data: update pipeline + dataset to latest collected records 2026-02-28 11:55:24 -05:00
download_gan_mn.py chore: add remaining processors and analysis scripts, gitignore experimental 2026-03-23 06:44:01 -04:00
download_gan_mn_gdrive.py data: expand dataset from 5.9k to 91k records via 6 new SQM sources 2026-03-22 16:39:29 -04:00
gan_mn_processor.py data: expand dataset from 5.9k to 91k records via 6 new SQM sources 2026-03-22 16:39:29 -04:00
madrid_sqm_processor.py chore: add remaining processors and analysis scripts, gitignore experimental 2026-03-23 06:44:01 -04:00
openfajr.py data: update pipeline + dataset to latest collected records 2026-02-28 11:55:24 -05:00
paper_extractor.py data: update pipeline + dataset to latest collected records 2026-02-28 11:55:24 -05:00
pdf_extractor.py Expand dataset to 5,871 Fajr / 46 Isha across 114 locations 2026-02-28 10:51:01 -05:00
precomputed_angles.py data: update pipeline + dataset to latest collected records 2026-02-28 11:55:24 -05:00
source_db.py data: update pipeline + dataset to latest collected records 2026-02-28 11:55:24 -05:00
tess_processor.py data: expand dataset from 5.9k to 91k records via 6 new SQM sources 2026-03-22 16:39:29 -04:00
verified_sightings.py Expand dataset to 5,871 Fajr / 46 Isha across 114 locations 2026-02-28 10:51:01 -05:00