Source Ingest Queue
This is the master queue for source collection and ingest.
Acquisition lanes
- Initial Corpus Batch 01
- Ilya Top 30
- Autonomous Driving Seminal Papers
- Vla And Driving
- Llm Seminal Papers
Intake workflow
For each source:
- Save the raw artifact to
raw/inbox/orraw/papers/. - Normalize filename to
year-short-title.pdfwhen possible. - Create or update the matching
wiki/sources/page. - Route updates into concept pages and comparisons.
- Record unresolved questions that the source creates.
Notes
- Citation-count thresholds should be verified at ingest time.
- For source discovery, use paper graphs and citations to expand outward from canonical papers rather than scraping arbitrary lists.
- If AlphaXiv-specific tooling is unavailable, use arXiv, OpenReview, Semantic Scholar, OpenAlex, and project pages.
- The initial real paper set now lives in Initial Corpus Batch 01.