Setup 2025 files and started parsing the archive site but was rate limited. Will need to finish it in the future.
This commit is contained in:
49
memory/MEMORY.md
Normal file
49
memory/MEMORY.md
Normal file
@@ -0,0 +1,49 @@
|
||||
# Inlander Restaurant Week Picker - Project Memory
|
||||
|
||||
## Quick Reference
|
||||
- See `scraping-guide.md` for full year-scraping instructions and script templates
|
||||
- See `html-structures.md` for HTML parsing patterns per restaurant type
|
||||
- Project dir: `\\WinServ-20-3.chns.local\Profiles\derekc\Documents\Coding Projects\Gitea-CooperandGoodman-Inlander-Restaurant-Week-Picker\Inlander-Restaurant-Week-Picker`
|
||||
|
||||
## Key Constraints (CRITICAL)
|
||||
- **WebFetch cannot access web.archive.org** — use `curl` via Bash tool instead
|
||||
- **PowerShell cannot run scripts from UNC paths** (\\server\...) — always `cp` scripts to local temp first
|
||||
- **bash `/tmp`** = `C:\Users\DEREKC~1.CHN\AppData\Local\Temp` (8.3 short name)
|
||||
- **PowerShell temp** = `C:\Users\derekc.CHNSLocal\AppData\Local\Temp` (long name) — same dir, different string
|
||||
- **Wayback Machine rate limits** to ~20 requests before throttling with 429; use 3-5 sec delays, wait 30+ min after getting blocked
|
||||
|
||||
## JSON Schema
|
||||
Each entry in `YEAR-restaurants.json`:
|
||||
```json
|
||||
{
|
||||
"name": "Restaurant Name",
|
||||
"slug": "restaurantslug",
|
||||
"price": 45,
|
||||
"areas": ["Downtown"],
|
||||
"cuisine": "American",
|
||||
"url": "https://inlanderrestaurantweek.com/project/SLUG/",
|
||||
"menu": {
|
||||
"hours": "Menu served 5pm-close",
|
||||
"phone": "(509) 555-1234",
|
||||
"courses": {
|
||||
"First Course": [{"name": "Dish Name", "desc": "Description"}],
|
||||
"Second Course": [...],
|
||||
"Third Course": [...]
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
Price is always 25, 35, or 45. gardenparty genuinely has 4 Third Course options.
|
||||
|
||||
## 2025 Data Status
|
||||
- **File**: `2025-restaurants.json` (121 restaurants)
|
||||
- **Wayback snapshot used**: `20250306132630` (primary), `20250401000000` (backup for some)
|
||||
- **Complete (3/3/3+)**: 111 restaurants
|
||||
- **gardenparty**: 3/3/4 — correct, it genuinely offers 4 dessert choices
|
||||
- **tavolata**: 3/3/0 — needs fix-tavolata.ps1 run when rate limit resets
|
||||
- **0/0/0 (JS-only, unrecoverable)**: heritage, kismet, littlenoodle, macdaddys, purgatory, redtail, republickitchen, republicpi, vicinopizza
|
||||
|
||||
## Scripts in Project Directory
|
||||
- `fix-tavolata.ps1` — run after rate limit resets to recover tavolata Third Course
|
||||
- Copy to local temp and run: `cp ...\fix-tavolata.ps1 C:\Users\derekc.CHNSLocal\AppData\Local\Temp\`
|
||||
- Then: `powershell.exe -ExecutionPolicy Bypass -File C:\Users\derekc.CHNSLocal\AppData\Local\Temp\fix-tavolata.ps1`
|
||||
Reference in New Issue
Block a user