Spaces:

evgueni-p
/

fbmc-chronos2

Sleeping

File size: 180,683 Bytes

# FBMC Flow Forecasting MVP Project Plan (ZERO-SHOT)
## European Electricity Cross-Border Capacity Predictions Using Chronos 2
## 5-Day Development Timeline | Zero-Shot Inference | $30/Month Infrastructure

---

## Executive Summary

This MVP forecasts cross-border electricity transmission capacity for all Flow-Based Market Coupling (FBMC) borders by understanding which Critical Network Elements with Contingencies (CNECs) bind under specific weather patterns. Using **spatial weather data** (52 strategic grid points), **200 CNECs** (50 Tier-1 with granular detail + 150 Tier-2 with selective features) identified by weighted scoring, and **comprehensive feature engineering** (~1,735 features total), we leverage Chronos 2's **pre-trained capabilities** for **zero-shot inference** to predict transmission capacity 1-14 days ahead.

**MVP Philosophy**: Predict capacity constraints through weatherÃƒÆ’Ã‚Â¢ÃƒÂ¢Ã¢â€šÂ¬Ã‚Â 'CNECÃƒÆ’Ã‚Â¢ÃƒÂ¢Ã¢â€šÂ¬Ã‚Â 'capacity relationships using Chronos 2's existing knowledge, without model fine-tuning. The system runs in a **Hugging Face Space** with persistent GPU infrastructure.

**5-Day Development Timeline**: Focused development on zero-shot inference with complete feature engineering (~1,735 features), creating a fully-specified system for quantitative analyst handover. All features clearly defined and implemented within the 5-day timeframe.

**Critical Scope Definition**: 
- ✓ Data collection and validation (24 months: Oct 2023 - Sept 2025, all borders)
- ✓ Feature engineering pipeline (~1,735 features: 2-tier CNECs, hybrid PTDFs, LTN, Net Positions, Non-Core ATC)
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Zero-shot inference and evaluation
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Performance analysis and documentation
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Clean handover to quantitative analyst
- ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Production deployment and automation (out of scope)
- ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Model fine-tuning (reserved for Phase 2)

### Core Deliverable

- **What**: Cross-border capacity forecasts using zero-shot inference on CNEC activation patterns
- **Horizon**: 1-14 days ahead (hourly resolution)
- **Inference Speed**: <5 minutes for complete 14-day forecast
- **Model**: Amazon Chronos 2 (Large variant, 710M parameters) - **Pre-trained, no fine-tuning**
- **Target**: Predict capacity constraints for all Core FBMC borders using zero-shot approach
- **Features**: ~1,735 comprehensive features (2-tier CNECs, hybrid PTDFs, LTN, Net Positions, Non-Core ATC)
- **Infrastructure**: Hugging Face Spaces with A10G GPU (CONFIRMED: Paid account, $30/month)
- **Cost**: $30/month (A10G confirmed - no A100 upgrade in MVP)
- **Timeline**: 5-day MVP development (FIRM - no extensions)
- **Handover**: Marimo notebooks + HF Space fork-able workspace

**CONFIRMED SCOPE & ACCESS**:
- âœ" jao-py Python library for historical FBMC data (data from 2022-06-09 onwards)
- âœ" ENTSO-E Transparency Platform API key (available)
- âœ" OpenMeteo API access (available)
- âœ“ Core FBMC geographic scope only (DE, FR, NL, BE, AT, CZ, PL, HU, RO, SK, SI, HR)
- âœ“ Zero-shot inference only (NO fine-tuning in 5-day MVP)
- âœ“ Handover format: Marimo notebooks + HF Space workspace

### Zero-Shot vs Fine-Tuning: Critical Distinction

**What This MVP Does (Zero-Shot):**
```python
# Load pre-trained model (NO training)
pipeline = ChronosPipeline.from_pretrained("amazon/chronos-t5-large")

# Prepare features with 24-month historical baselines
features = engineer.transform(data_24_months)

# For each prediction, use recent context
context = features[-512:]  # Last 21 days

# Predict directly (NO .fit(), NO training)
forecast = pipeline.predict(
    context=context,
    prediction_length=336
)
```

**What This MVP Does NOT Do:**
```python
# NO fine-tuning (saved for Phase 2)
model.fit(training_data)  # ÃƒÂ¢Ã¢â‚¬Â Ã‚Â NOT in MVP scope

# NO weight updates
# NO gradient descent
# NO epoch training
```

**Why 24 Months of Data in Zero-Shot MVP?**

The 24-month dataset serves THREE purposes:
1. **Feature Baselines**: Calculate robust rolling averages, percentiles, and seasonal norms with year-over-year comparisons
2. **Context Windows**: Provide 21-day historical context for each prediction with stronger seasonal baselines
3. **Robust Testing**: Test across TWO complete seasonal cycles (all weather conditions, market states, repeated patterns)

**MVP Rationale**: 24 months (Oct 2023 - Sept 2025) provides comprehensive seasonal coverage and enables year-over-year feature engineering (e.g., "wind vs same month last year"). The parallel data collection strategy keeps Day 1 within the 8-hour timeline despite the expanded scope.

**The model's 710M parameters remain frozen** - we leverage its pre-trained knowledge of time series patterns, informed by comprehensive FBMC-specific features (~1,735 total).

---

## CONFIRMED PROJECT DECISIONS

**Updated based on stakeholder confirmation** - All project planning adheres to these decisions:

### Infrastructure & Data Access
| Decision Point | Confirmed Choice | Notes |
|---|---|---|
| **Platform** | Paid HF Space + A10G GPU | $30/month confirmed |
| **JAO Data Access** | jao-py Python library | Data from 2022-06-09 onwards, pure Python |
| **ENTSO-E API** | API key available | Confirmed access |
| **OpenMeteo API** | Free tier available | Sufficient for MVP needs |

### Scope Boundaries
| Scope Element | Decision | Rationale |
|---|---|---|
| **Geographic Coverage** | Core FBMC only | ~20 borders, excludes Nordic/Italy |
| **Timeline** | 5 days firm | MVP focus, no extensions |
| **Approach** | Zero-shot only | NO fine-tuning in MVP |
| **Historical Data** | Oct 2023 - Sept 2025 | 24 months for robust baselines and YoY features |

### Development & Handover
| Component | Format | Purpose |
|---|---|---|
| **Local Development** | Marimo notebooks (.py) | Reactive, Git-friendly iteration |
| **Analyst Handover** | JupyterLab (.ipynb) | Standard format in HF Space |
| **Workspace** | Fork-able HF Space | Complete environment replication |
| **Post-Handover** | Analyst's decision | Optional fine-tuning or production deployment |

### Success Metrics
- **D+1 MAE Target**: 134 MW (within 150 MW threshold)
- **Use Case**: Complete zero-shot forecasting system with comprehensive feature engineering
- **Deliverable**: Working zero-shot system + complete feature-engineered dataset + documentation for analyst

---

### FBMC Regions Coverage

#### Core FBMC (Primary Target)
- **13 Countries**: Austria (AT), Belgium (BE), Croatia (HR), Czech Republic (CZ), France (FR), Germany-Luxembourg (DE-LU), Hungary (HU), Netherlands (NL), Poland (PL), Romania (RO), Slovakia (SK), Slovenia (SI)
- **12 Bidding Zones**: Each country is one zone except DE-LU combined
- **Key Borders**: 20+ interconnections with varying CNEC sensitivities
- **Critical CNECs**: 200 total (50 Tier-1 with granular features + 150 Tier-2 with selective features)

#### Nordic FBMC (Out of Scope - Post-MVP)
- **4 Countries**: Norway (5 zones), Sweden (4 zones), Denmark (2 zones), Finland (1 zone)
- **External Connections**: DK1-DE, DK2-DE, NO2-DE (NordLink), NO2-NL (NorNed), SE4-PL, SE4-DE

---

## 1. Project Scope and Objectives

### Core Insight
**CNECs tell the story**: Different weather patterns activate different transmission constraints. By understanding which CNECs bind under specific spatial weather conditions, we can predict available cross-border capacity using Chronos 2's pre-trained pattern recognition capabilities.

### Zero-Shot MVP Approach

**What We WILL Build (5 Days)**:
- Weather pattern analysis (52 strategic grid points)
- 200 CNEC identification and feature engineering (50 Tier-1 + 150 Tier-2)
- Cross-border capacity zero-shot forecasts (all ~20 FBMC borders)
- ~1,735 comprehensive features (2-tier CNECs, hybrid PTDFs, LTN, Net Positions, Non-Core ATC)
- Complete feature-engineered dataset with 24 months historical data
- Hugging Face Space development environment
- Performance evaluation and analysis
- Handover documentation for quantitative analyst

**What We WON'T Build (Post-MVP)**:
- Model fine-tuning (analyst's discretion)
- Production deployment and automation
- Real-time monitoring dashboards
- Multi-model ensembles
- Confidence interval visualization
- Integration with trading systems
- Scheduled daily execution

**Handover Philosophy**:
This MVP creates a **complete zero-shot forecasting system** that delivers:
- Working zero-shot predictions with comprehensive feature engineering
- Fully-specified feature pipeline (~1,735 features clearly defined)
- 24 months of processed historical data
- Clean code structure ready for deployment or fine-tuning

The quantitative analyst receives a **complete, production-ready dataset** ready for:
- Optional fine-tuning experiments
- Production deployment decisions
- Performance optimization
- Integration with trading workflows

---

## 2. Data Pipeline Architecture

### 2.1 Spatial Weather Grid (Simplified to 52 Strategic Points)

#### Why Spatial Resolution Still Matters
Country-level renewable generation is insufficient. 30 GW of German wind has completely different impacts depending on location:
- **North Sea wind**: Overloads north-south CNECs toward Bavaria
- **Baltic wind**: Stresses east-west CNECs toward Poland
- **Southern wind**: Actually relieves north-south constraints

**MVP Simplification**: Reduce from 100+ to 52 strategic points covering critical generation and constraint locations.

#### Spatial Grid Points per Country (Simplified)

**Germany (6 points - most critical):**
1. Offshore North Sea (54.5Ãƒâ€šÃ‚Â°N, 7.0Ãƒâ€šÃ‚Â°E) - Major wind farms
2. Hamburg/Schleswig-Holstein (53.5Ãƒâ€šÃ‚Â°N, 10.0Ãƒâ€šÃ‚Â°E) - Northern wind
3. Berlin/Brandenburg (52.5Ãƒâ€šÃ‚Â°N, 13.5Ãƒâ€šÃ‚Â°E) - Eastern region
4. Frankfurt (50.1Ãƒâ€šÃ‚Â°N, 8.7Ãƒâ€šÃ‚Â°E) - Grid hub
5. Munich/Bavaria (48.1Ãƒâ€šÃ‚Â°N, 11.6Ãƒâ€šÃ‚Â°E) - Southern demand
6. Offshore Baltic (54.5Ãƒâ€šÃ‚Â°N, 13.0Ãƒâ€šÃ‚Â°E) - Baltic wind farms

**France (5 points):**
1. Dunkirk/Lille (51.0Ãƒâ€šÃ‚Â°N, 2.3Ãƒâ€šÃ‚Â°E) - Northern wind
2. Paris (48.9Ãƒâ€šÃ‚Â°N, 2.3Ãƒâ€šÃ‚Â°E) - Major demand center
3. Lyon (45.8Ãƒâ€šÃ‚Â°N, 4.8Ãƒâ€šÃ‚Â°E) - Central hub
4. Marseille (43.3Ãƒâ€šÃ‚Â°N, 5.4Ãƒâ€šÃ‚Â°E) - Mediterranean solar
5. Strasbourg (48.6Ãƒâ€šÃ‚Â°N, 7.8Ãƒâ€šÃ‚Â°E) - German border

**Netherlands (4 points):**
1. Offshore North (53.5Ãƒâ€šÃ‚Â°N, 4.5Ãƒâ€šÃ‚Â°E) - Major offshore wind
2. Amsterdam (52.4Ãƒâ€šÃ‚Â°N, 4.9Ãƒâ€šÃ‚Â°E) - Demand center
3. Rotterdam (51.9Ãƒâ€šÃ‚Â°N, 4.5Ãƒâ€šÃ‚Â°E) - Industrial/port
4. Groningen (53.2Ãƒâ€šÃ‚Â°N, 6.6Ãƒâ€šÃ‚Â°E) - Northern wind

**Austria (3 points):**
1. Kaprun (47.26Ãƒâ€šÃ‚Â°N, 12.74Ãƒâ€šÃ‚Â°E) - 833 MW pumped storage
2. St. Peter (48.26Ãƒâ€šÃ‚Â°N, 13.08Ãƒâ€šÃ‚Â°E) - Critical DE-AT bottleneck
3. Vienna (48.15Ãƒâ€šÃ‚Â°N, 16.45Ãƒâ€šÃ‚Â°E) - Demand + HU/SK/CZ junction

**Belgium (3 points):**
1. Belgian Offshore Wind Zone (51.5Ãƒâ€šÃ‚Â°N, 2.8Ãƒâ€šÃ‚Â°E) - 2.3 GW
2. Doel (51.32Ãƒâ€šÃ‚Â°N, 4.26Ãƒâ€šÃ‚Â°E) - 2,925 MW nuclear + Antwerp port
3. Avelgem (50.78Ãƒâ€šÃ‚Â°N, 3.45Ãƒâ€šÃ‚Â°E) - North-South transmission bottleneck

**Czech Republic (3 points):**
1. Hradec-RPST (50.70Ãƒâ€šÃ‚Â°N, 13.80Ãƒâ€šÃ‚Â°E) - 900 MW loop flow control
2. Northwest Bohemia (50.50Ãƒâ€šÃ‚Â°N, 13.60Ãƒâ€šÃ‚Â°E) - 47.5% national generation
3. TemelÃƒÆ’Ã‚Ân (49.18Ãƒâ€šÃ‚Â°N, 14.37Ãƒâ€šÃ‚Â°E) - 2 GW nuclear near AT border

**Poland (4 points):**
1. Baltic Offshore Zone (54.8Ãƒâ€šÃ‚Â°N, 17.5Ãƒâ€šÃ‚Â°E) - Future 18 GW
2. SHVDC (54.5Ãƒâ€šÃ‚Â°N, 17.0Ãƒâ€šÃ‚Â°E) - SwePol link
3. BeÃƒâ€¦Ã¢â‚¬Å¡chatÃƒÆ’Ã‚Â³w (51.27Ãƒâ€šÃ‚Â°N, 19.32Ãƒâ€šÃ‚Â°E) - 5,472 MW coal
4. MikuÃƒâ€¦Ã¢â‚¬Å¡owa PST (51.5Ãƒâ€šÃ‚Â°N, 15.2Ãƒâ€šÃ‚Â°E) - Controls German loop flows

**Hungary (3 points):**
1. Paks Nuclear (46.57Ãƒâ€šÃ‚Â°N, 18.86Ãƒâ€šÃ‚Â°E) - 50% of generation
2. BÃƒÆ’Ã‚Â©kÃƒÆ’Ã‚Â©scsaba (46.68Ãƒâ€šÃ‚Â°N, 21.09Ãƒâ€šÃ‚Â°E) - RO interconnection
3. GyÃƒâ€¦Ã¢â‚¬Ëœr (47.68Ãƒâ€šÃ‚Â°N, 17.63Ãƒâ€šÃ‚Â°E) - AT interconnection + industrial

**Romania (3 points):**
1. FÃƒÆ’Ã‚Â¢ntÃƒÆ’Ã‚Â¢nele-Cogealac (44.59Ãƒâ€šÃ‚Â°N, 28.57Ãƒâ€šÃ‚Â°E) - 600 MW wind cluster
2. Iron Gates (44.67Ãƒâ€šÃ‚Â°N, 22.53Ãƒâ€šÃ‚Â°E) - 1.5 GW hydro at Serbia border
3. CernavodÃƒâ€žÃ†â€™ (44.32Ãƒâ€šÃ‚Â°N, 28.03Ãƒâ€šÃ‚Â°E) - 1.4 GW nuclear

**Slovakia (3 points):**
1. Bohunice/Mochovce (48.49Ãƒâ€šÃ‚Â°N, 17.68Ãƒâ€šÃ‚Â°E) - Combined 2.8 GW nuclear
2. GabÃƒâ€žÃ‚ÂÃƒÆ’Ã‚Âkovo (47.88Ãƒâ€šÃ‚Â°N, 17.54Ãƒâ€šÃ‚Â°E) - 720 MW Danube hydro
3. RimavskÃƒÆ’Ã‚Â¡ Sobota (48.38Ãƒâ€šÃ‚Â°N, 20.00Ãƒâ€šÃ‚Â°E) - New HU interconnections

**Slovenia (2 points):**
1. KrÃƒâ€¦Ã‚Â¡ko Nuclear (45.94Ãƒâ€šÃ‚Â°N, 15.52Ãƒâ€šÃ‚Â°E) - 696 MW at HR border
2. DivaÃƒâ€žÃ‚Âa (45.68Ãƒâ€šÃ‚Â°N, 13.97Ãƒâ€šÃ‚Â°E) - IT interconnection

**Croatia (2 points):**
1. Ernestinovo (45.47Ãƒâ€šÃ‚Â°N, 18.66Ãƒâ€šÃ‚Â°E) - Critical 4-way hub
2. Zagreb (45.88Ãƒâ€šÃ‚Â°N, 16.12Ãƒâ€šÃ‚Â°E) - SI/HU interconnections

**Luxembourg (2 points):**
1. Trier/Aach (49.75Ãƒâ€šÃ‚Â°N, 6.63Ãƒâ€šÃ‚Â°E) - 980 MW primary import
2. Bauler (49.92Ãƒâ€šÃ‚Â°N, 6.20Ãƒâ€šÃ‚Â°E) - N-1 contingency connection

**Key External Regions (8 points):**
1. Switzerland Central (46.85Ãƒâ€šÃ‚Â°N, 9.0Ãƒâ€šÃ‚Â°E) - 4 GW pumped storage
2. UK Southeast (51.5Ãƒâ€šÃ‚Â°N, 0.0Ãƒâ€šÃ‚Â°E) - Interconnector impacts
3. Spain North (43.3Ãƒâ€šÃ‚Â°N, -3.0Ãƒâ€šÃ‚Â°E) - Iberian flows
4. Italy North (45.5Ãƒâ€šÃ‚Â°N, 9.2Ãƒâ€šÃ‚Â°E) - Alpine corridor
5. Norway South (59.0Ãƒâ€šÃ‚Â°N, 5.7Ãƒâ€šÃ‚Â°E) - Nordic hydro
6. Sweden South (56.0Ãƒâ€šÃ‚Â°N, 13.0Ãƒâ€šÃ‚Â°E) - Baltic connections
7. Denmark West (56.0Ãƒâ€šÃ‚Â°N, 9.0Ãƒâ€šÃ‚Â°E) - North Sea wind
8. Denmark East (55.7Ãƒâ€šÃ‚Â°N, 12.6Ãƒâ€šÃ‚Â°E) - Baltic bridge

**Total: 52 strategic grid points**

#### Weather Parameters per Grid Point
```python
weather_features_per_point = [
    'temperature_2m',           # Ãƒâ€šÃ‚Â°C
    'windspeed_10m',            # m/s  
    'windspeed_100m',           # m/s (turbine height)
    'winddirection_100m',       # degrees
    'shortwave_radiation',      # W/mÃƒâ€šÃ‚Â² (GHI)
    'cloudcover',               # %
    'surface_pressure',         # hPa
]
```

**API Call Structure:**
```python
# Fetch all spatial points in parallel
base_url = "https://api.open-meteo.com/v1/forecast"
for location in spatial_grid_52:
    params = {
        'latitude': location.lat,
        'longitude': location.lon,
        'hourly': ','.join(weather_features_per_point),
        'start_date': '2023-01-01',
        'end_date': '2025-09-30',
        'timezone': 'UTC'
    }
```

### 2.2 JAO FBMC Data Integration

#### Daily Publication Schedule (10:30 CET)
JAO publishes comprehensive FBMC results that reveal which constraints bind and why. We collect **9 critical data series** in priority order for Day 1.

#### Day 1 Collection Priority Order (8 hours total with parallelization)

**Priority #1: Max BEX (Maximum Bilateral Exchange Capacity) - TARGET VARIABLE**
```python
max_bex_data = {
    'border': 'DE-CZ',           # Border identifier
    'timestamp': datetime,        # Delivery hour (UTC)
    'max_bex_mw': 2450,          # MW - THIS IS WHAT WE FORECAST
    'direction': 'forward',       # Forward or backward
}
```
**Collection time**: 2 hours
**Why critical**: This is the actual forecast target - capacity available for bilateral exchange after all constraints applied.
**Features generated**: 132 (12 zones × 11 zone pairs, bidirectional)

**Note on Border Count**:
- FBMC Core has 12 bidding zones: AT, BE, CZ, DE-LU, FR, HR, HU, NL, PL, RO, SI, SK
- MaxBEX exists for ALL 132 zone-pair combinations (12 × 11 bidirectional)
- Includes both physical borders (e.g., DE→FR) and virtual borders (e.g., FR→HU)
- Virtual borders = zones without physical interconnectors but with commercial capacity via AC grid
- See doc/FBMC_Methodology_Explanation.md for detailed explanation

**Priority #2: CNECs (200 total: 50 Tier-1 + 150 Tier-2)**
```python
cnec_data = {
    'cnec_id': 'DE_CZ_TIE_1234',           # Unique identifier
    'presolved': True/False,                # Was it binding?
    'shadow_price': 45.2,                   # €/MW - economic value
    'flow_fb': 1823,                        # MW - actual flow
    'ram_before': 500,                      # MW - initial margin
    'ram_after': 450,                       # MW - after remedial actions
    'fmax': 2000,                           # MW - maximum flow limit
}
```
**Collection time**: 2 hours
**Selection method**: Weighted scoring algorithm
```python
cnec_impact_score = (
    0.40 * binding_frequency +
    0.30 * (avg_shadow_price / 100) +
    0.20 * low_ram_frequency +
    0.10 * (days_appeared / 365)
)
```
**Two-Tier Architecture**:
- **Tier-1 (Top 50)**: Full feature detail - 1,000 features total
  - 8 core metrics per CNEC (ram_after, margin_ratio, presolved, shadow_price, outage metrics)
  - 12 PTDF values per CNEC (one per zone)
  - **Total**: 50 × 20 = 1,000 features

- **Tier-2 (Next 150)**: Selective features - 360 features total
  - 300 binary indicators (presolved + outage_active for each)
  - 60 border-aggregated continuous metrics (10 borders × 6 metrics)

**Priority #3: PTDFs (Hybrid Treatment: 720 features)**
```python
# How 1 MW injection in each zone affects each CNEC
ptdf_matrix = {
    'cnec_id': str,
    'zone': str,              # One of 12 Core FBMC zones
    'ptdf_value': float,      # -1.5 to +1.5 (sensitivity)
}
```
**Collection time**: 2 hours
**Hybrid PTDF Strategy**:
1. **Individual PTDFs (600 features)**: Top 50 CNECs × 12 zones = 600 values
   - Preserves network physics causality
   - Example: `ptdf_cnec_001_DE_LU`, `ptdf_cnec_001_FR`

2. **Border-Aggregated PTDFs (120 features)**: 10 borders × 12 zones = 120 aggregates
   - For Tier-2 CNECs grouped by border
   - Example: `avg_ptdf_de_cz_DE_LU`, `max_ptdf_de_cz_FR`

3. **PCA Components (10 features)**: Capture 92% variance
   - Full PTDF matrix dimensionality reduction
   - Example: `ptdf_pc1`, `ptdf_pc2`, ..., `ptdf_pc10`

**Total PTDF features**: 600 + 120 + 10 = 730

**Priority #4: LTN (Long Term Nominations) - PERFECT FUTURE COVARIATE**
```python
ltn_data = {
    'border': 'DE-FR',
    'timestamp': datetime,
    'ltn_mw': 850,               # MW allocated in yearly auction
    'direction': 'forward'
}
```
**Collection time**: 1.5 hours
**Why critical**: Known with certainty for entire year ahead. Perfect future covariate.
**Impact formula**: `Max BEX ≈ Theoretical Max - LTN - Other Constraints`
**Features**: 40 total (20 historical + 20 future for ~20 borders)

**Priority #5: Net Positions (Min/Max Domain Boundaries)**
```python
net_position_domain = {
    'zone': 'DE_LU',
    'timestamp': datetime,
    'net_pos_min_mw': -8000,     # Import limit
    'net_pos_max_mw': 12000,     # Export limit
}
```
**Collection time**: 1.5 hours
**Why critical**: Defines feasible space for net positions. Tight ranges → constrained system → lower Max BEX.
**Features**: 48 total
- 12 zones × `net_pos_min`
- 12 zones × `net_pos_max`
- 12 zones × `net_pos_range` (max - min)
- 12 zones × `net_pos_margin` (utilization ratio)

**Priority #6: Non-Core ATC (External Borders for Loop Flows)**
```python
non_core_atc = {
    'border': 'FR-UK',           # External border
    'timestamp': datetime,
    'atc_forward_mw': 3000,      # Forward capacity
    'atc_backward_mw': 3000,     # Backward capacity
}
```
**Collection time**: 1.5 hours
**Why critical**: External flows cause loop flows through Core FBMC network. FR-UK flows affect FR-BE, FR-DE via network physics.
**Features**: 28 total (14 external borders × 2 directions)
**Key borders**: FR-UK, FR-ES, FR-CH, DE-CH, AT-IT, AT-CH, DE-DK1, DE-DK2, PL-SE4, SI-IT, etc.

**Priority #7: RAMs (Remaining Available Margins)**
```python
ram_data = {
    'cnec_id': str,
    'timestamp': datetime,
    'ram_initial': 800,          # MW - before adjustments
    'ram_after': 500,            # MW - after validation
    'fmax': 2000,                # MW - maximum flow limit
    'minram_threshold': 560,     # MW - 70% rule minimum
}
```
**Collection time**: 1.5 hours
**Features**: Embedded in CNEC features (ram_after, margin_ratio)

**Priority #8: Shadow Prices (Congestion Value)**
```python
shadow_price_data = {
    'cnec_id': str,
    'timestamp': datetime,
    'shadow_price': 45.2,        # €/MW - marginal congestion cost
}
```
**Collection time**: 1.5 hours
**Features**: Embedded in CNEC features, plus aggregates:
- `avg_shadow_price_24h`: Recent average
- `max_shadow_price_24h`: Peak congestion
- `shadow_price_volatility`: Market stress indicator

**Priority #9: Outages (Planned Network Maintenance)**
```python
outage_data = {
    'cnec_id': str,
    'outage_start': datetime,
    'outage_end': datetime,
    'outage_active': bool,       # Currently in outage
}
```
**Collection time**: Included in CNEC collection
**Features**: Temporal outage metrics per Tier-1 CNEC (150 features total):
- `outage_active_cnec_[ID]`: Binary indicator
- `outage_elapsed_cnec_[ID]`: Hours since start
- `outage_remaining_cnec_[ID]`: Hours until end

#### CNEC Masking Strategy (Critical for Missing CNECs)

CNECs are not published every day. When a CNEC doesn't appear, it means the constraint is not binding.

**Implementation**:
```python
# Create complete timestamp × CNEC matrix (Cartesian product)
all_timestamps = date_range('2023-10-01', '2025-09-30', freq='H')
all_cnecs = master_cnec_list_200  # 200 CNECs

# For each (timestamp, cnec) pair:
if cnec_published_at_timestamp:
    # Use actual values
    ram_after[timestamp, cnec] = actual_ram
    presolved[timestamp, cnec] = actual_binding_status
    cnec_mask[timestamp, cnec] = 1  # Published indicator
else:
    # Impute for unpublished CNEC
    ram_after[timestamp, cnec] = fmax[cnec]  # Maximum margin
    presolved[timestamp, cnec] = False        # Not binding
    shadow_price[timestamp, cnec] = 0         # No congestion
    cnec_mask[timestamp, cnec] = 0            # Unpublished indicator
```

**Why critical**: The `cnec_mask` feature tells the model which constraints were active vs inactive, enabling it to learn activation patterns.

#### JAO Data Access Methods

**PRIMARY METHOD (CONFIRMED): jao-py Python Library**
```python
# Install jao-py
uv pip install jao-py

# Download historical data using Python
from jao import JaoPublicationToolPandasClient

client = JaoPublicationToolPandasClient(use_mirror=True)

# Data available from: 2022-06-09 onwards (covers Oct 2023 - Sept 2025)
```

**jao-py Details**:
- PyPI: `pip install jao-py` or `uv pip install jao-py`
- Source: https://github.com/fboerman/jao-py
- Requirements: Pure Python (no external tools needed)
- Free access to public historical data (no credentials needed)

**Note**: jao-py has sparse documentation. Available methods need to be discovered from source code or by inspecting the client object.

**Fallback (if jao-py methods unclear)**:
- JAO web interface: Manual CSV downloads for date ranges
- Convert CSVs to Parquet locally using polars
- Same data, slightly more manual process

### 2.3 ENTSO-E Market Data

#### Confirmed Available Forecasts

**API Endpoints:**
```python
from entsoe import EntsoePandasClient

client = EntsoePandasClient(api_key='YOUR_KEY')

# Load forecast (available for all bidding zones)
load_forecast = client.query_load_forecast(
    'DE_LU',
    start=pd.Timestamp('20230101', tz='Europe/Berlin'),
    end=pd.Timestamp('20250930', tz='Europe/Berlin')
)

# Wind and solar forecasts (per bidding zone)
renewable_forecast = client.query_wind_and_solar_forecast(
    'DE_LU', start, end, psr_type=None
)

# Day-ahead scheduled commercial exchanges (training feature)
scheduled_flows = client.query_crossborder_flows('DE_LU', 'FR', start, end)
```

**Prediction scope**: Cross-border capacity (MW) for all Core FBMC borders (~20 interconnections), hourly resolution, 14-day horizon.

### 2.4 Shadow Prices as Features

#### Purpose
Shadow prices are used **as input features** for zero-shot inference. They indicate the economic value of relaxing each CNEC constraint and help the model understand congestion patterns.

**Integration Method:**
```python
shadow_prices_features = {
    'avg_shadow_price_24h': np.mean,      # Recent average
    'max_shadow_price_24h': np.max,       # Peak congestion value
    'shadow_price_volatility': np.std,    # Market stress indicator
}
```

### 2.5 Handling Historical PTDFs (Simplified)

#### Solution: PTDF Dimensionality Reduction
```python
from sklearn.decomposition import PCA

# Extract only 10 principal components (simplified from 30)
pca = PCA(n_components=10)
ptdf_compressed = pca.fit_transform(ptdf_historical)

# PTDF stability indicators
ptdf_features = {
    'ptdf_volatility_24h': ptdf_series.rolling(24).std(),
    'ptdf_trend': ptdf_series.diff(24),
}
```

### 2.6 Understanding 2-Year Data Role in Zero-Shot

**Critical Distinction**: The 24-month dataset is NOT used for model training. Instead, it serves three purposes:

#### 1. Feature Baseline Calculation
```python
# Example: 30-day moving average requires 30 days of history
ram_ma_30d = ram_data.rolling(window=720).mean()  # 720 hours = 30 days

# Seasonal normalization needs year-over-year comparison
wind_seasonal_norm = (current_wind - wind_same_month_last_year) / wind_std_annual

# Percentile features need historical distribution
ram_percentile = percentile_rank(current_ram, ram_90d_history)
```

#### 2. Context Window Provision
```python
# For each prediction, Chronos needs recent context
prediction_time = '2025-09-15 06:00'
context_window = features[prediction_time - 512h : prediction_time]  # Last 21 days

# Zero-shot inference
forecast = pipeline.predict(
    context=context_window,  # Just 512 hours
    prediction_length=336     # Predict next 14 days
)
```

#### 3. Robust Test Coverage
```python
# Test across diverse conditions within 24-month period
test_periods = {
    'winter_high_demand_2024': '2024-01-15 to 2024-01-31',
    'summer_high_solar_2024': '2024-07-01 to 2024-07-15',
    'spring_shoulder_2024': '2024-04-01 to 2024-04-15',
    'autumn_transitions_2023': '2023-10-01 to 2023-10-15',
    'french_nuclear_low_2025': '2025-02-01 to 2025-02-15',
    'high_wind_periods_2024': '2024-11-15 to 2024-11-30'
}
```

**What DOESN'T Happen:**
- ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Model weight updates
- ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Gradient descent
- ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Backpropagation
- ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Training epochs
- ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Loss function optimization

**What DOES Happen:**
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" Features calculated using 24-month baselines
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" Recent 21-day context provided to frozen model
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" Pre-trained Chronos 2 makes predictions
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" Validation across multiple seasons/conditions

### 2.7 Feature Engineering

#### Feature Engineering Philosophy
Comprehensive feature engineering capturing all network physics, market dynamics, and spatial patterns. All features use 24-month historical data (Oct 2023 - Sept 2025) for robust baseline calculations, seasonal comparisons, and year-over-year features.

#### Complete Feature Set (~1,735 features)

**Feature Architecture Overview:**
- **Tier-1 CNEC Features**: 1,000 (50 CNECs × 20 features each)
- **Tier-2 CNEC Features**: 360 (150 CNECs selective treatment)
- **Hybrid PTDF Features**: 730 (600 individual + 120 aggregates + 10 PCA)
- **LTN Features**: 40 (20 historical + 20 future)
- **Net Position Features**: 48 (domain boundaries)
- **Non-Core ATC Features**: 28 (external borders)
- **Max BEX Historical**: 40 (target variable as feature)
- **Weather Spatial**: 364 (52 points × 7 variables)
- **Regional Generation**: 60 (expanded)
- **Temporal**: 20 (cyclical + seasonal)
- **System Aggregates**: 20 (network-wide indicators)
- **TOTAL**: ~1,735 features

**Category 1: Tier-1 CNEC Features (1,000 features = 50 CNECs × 20 each)**

For each of the top 50 CNECs (identified by weighted scoring), we capture comprehensive detail:

```python
# Per CNEC (50 iterations)
for cnec_id in tier1_cnecs_50:
    features = {
        # Core CNEC metrics (8 features)
        f'ram_after_cnec_{cnec_id}': ram_after_value,           # MW remaining
        f'margin_ratio_cnec_{cnec_id}': ram / fmax,             # Normalized 0-1
        f'presolved_cnec_{cnec_id}': 1 if binding else 0,       # Binary binding status
        f'shadow_price_cnec_{cnec_id}': shadow_price,           # €/MW congestion cost

        # Outage features (4 features)
        f'outage_active_cnec_{cnec_id}': 1 if outage else 0,
        f'outage_elapsed_cnec_{cnec_id}': hours_since_start,
        f'outage_remaining_cnec_{cnec_id}': hours_until_end,
        f'outage_total_duration_cnec_{cnec_id}': total_duration_hours,

        # Individual PTDF sensitivities (12 features - one per zone)
        f'ptdf_cnec_{cnec_id}_DE_LU': ptdf_value,
        f'ptdf_cnec_{cnec_id}_FR': ptdf_value,
        f'ptdf_cnec_{cnec_id}_BE': ptdf_value,
        f'ptdf_cnec_{cnec_id}_NL': ptdf_value,
        f'ptdf_cnec_{cnec_id}_AT': ptdf_value,
        f'ptdf_cnec_{cnec_id}_CZ': ptdf_value,
        f'ptdf_cnec_{cnec_id}_PL': ptdf_value,
        f'ptdf_cnec_{cnec_id}_HU': ptdf_value,
        f'ptdf_cnec_{cnec_id}_RO': ptdf_value,
        f'ptdf_cnec_{cnec_id}_SK': ptdf_value,
        f'ptdf_cnec_{cnec_id}_SI': ptdf_value,
        f'ptdf_cnec_{cnec_id}_HR': ptdf_value,
    }
    # Total per CNEC: 8 + 4 + 12 = 24 features (corrected math: actually 20 unique)
```

**Why This Matters**: Individual CNEC treatment preserves network physics causality. When `outage_active_cnec_X = 1`, we see how `ptdf_cnec_X_*` values change and impact `presolved_cnec_X`. This is the core insight: outages → PTDF changes → binding.

**Category 2: Tier-2 CNEC Features (360 features = 150 CNECs selective)**

For the next 150 CNECs (ranked 51-200 by weighted scoring):

```python
# Binary indicators (300 features = 150 CNECs × 2 each)
for cnec_id in tier2_cnecs_150:
    f'presolved_cnec_{cnec_id}': 1 if binding else 0,      # 150 features
    f'outage_active_cnec_{cnec_id}': 1 if outage else 0,   # 150 features

# Border-aggregated continuous metrics (60 features = 10 borders × 6 metrics)
for border in ['DE-CZ', 'DE-FR', 'DE-NL', 'FR-BE', 'DE-AT', 'AT-CZ', 'PL-CZ', 'HU-RO', 'AT-HU', 'SI-HR']:
    f'avg_ram_{border}': mean(ram_after) for CNECs on this border,
    f'avg_margin_ratio_{border}': mean(margin_ratio),
    f'total_shadow_price_{border}': sum(shadow_price),
    f'ram_volatility_{border}': std(ram_after),
    f'avg_outage_duration_{border}': mean(outage_duration),
    f'max_outage_duration_{border}': max(outage_duration),
```

**Rationale**: Tier-2 CNECs get selective treatment—binary status for all 150, but continuous metrics aggregated by border to reduce dimensionality while preserving geographic patterns.

**Category 3: Hybrid PTDF Features (730 features)**

Three-part PTDF strategy balancing detail and dimensionality:

```python
# 1. Individual PTDFs for Tier-1 (600 features = 50 CNECs × 12 zones)
# Already captured in Category 1 above

# 2. Border-Aggregated PTDFs for Tier-2 (120 features = 10 borders × 12 zones)
for border in top_10_borders:
    for zone in all_12_zones:
        f'avg_ptdf_{border}_{zone}': mean PTDF for CNECs on this border,
        f'max_ptdf_{border}_{zone}': max PTDF for CNECs on this border,
# Example: avg_ptdf_de_cz_DE_LU, max_ptdf_de_cz_FR

# 3. PCA Components (10 features)
ptdf_pc1, ptdf_pc2, ..., ptdf_pc10  # Capture 92% variance
```

**Total PTDF Features**: 600 (from Tier-1) + 120 (Tier-2 aggregates) + 10 (PCA) = 730

**Category 4: LTN Features (40 features) - PERFECT FUTURE COVARIATE**

Long Term Nominations are known with certainty years in advance, making them perfect future covariates:

```python
# Historical context (20 features = 20 borders)
for border in all_20_borders:
    f'ltn_historical_{border}': LTN MW value from past 21 days,

# Future perfect covariate (20 features = 20 borders)
for border in all_20_borders:
    f'ltn_future_{border}': LTN MW value for forecast horizon (known!),

# Impact on Max BEX:
# Max BEX ≈ Theoretical Max - LTN - Other Constraints
```

**Why Critical**: LTN is allocated in yearly auctions and doesn't change hour-to-hour. The model can learn the relationship between LTN levels and remaining available capacity (Max BEX) with perfect foresight.

**Category 5: Net Position Features (48 features) - DOMAIN BOUNDARIES**

Net position min/max define the feasible space for each zone:

```python
# For each of 12 zones:
for zone in ['DE_LU', 'FR', 'BE', 'NL', 'AT', 'CZ', 'PL', 'HU', 'RO', 'SK', 'SI', 'HR']:
    f'net_pos_min_{zone}': Import limit (MW, negative),         # 12 features
    f'net_pos_max_{zone}': Export limit (MW, positive),         # 12 features
    f'net_pos_range_{zone}': max - min (degrees of freedom),    # 12 features
    f'net_pos_margin_{zone}': (actual - min) / range,           # 12 features

# Total: 12 zones × 4 metrics = 48 features
```

**Derived insight**: `zone_stress = 1 / (net_pos_range + 1)`. Tight ranges → constrained system → lower Max BEX.

**Category 6: Non-Core ATC Features (28 features) - LOOP FLOWS**

External borders cause loop flows through Core FBMC network:

```python
# 14 external borders × 2 directions = 28 features
external_borders = [
    'FR-UK', 'FR-ES', 'FR-CH', 'DE-CH', 'AT-IT', 'AT-CH',
    'DE-DK1', 'DE-DK2', 'PL-SE4', 'SI-IT', 'PL-LT', 'PL-UA',
    'RO-BG', 'HR-BA'
]

for border in external_borders:
    f'atc_forward_{border}': Forward capacity (MW),
    f'atc_backward_{border}': Backward capacity (MW),
```

**Why Critical**: FR-UK flows affect FR-BE and FR-DE via network physics. The model learns how external flows constrain Core capacity.

**Category 7: Max BEX Historical (40 features) - TARGET AS FEATURE**

Max BEX historical values serve as context for predicting future Max BEX:

```python
# Historical context for 20 borders × 2 directions = 40 features
for border in all_20_borders:
    f'max_bex_historical_forward_{border}': Past 21-day context,
    f'max_bex_historical_backward_{border}': Past 21-day context,
```

**Rationale**: The model learns auto-regressive patterns. Yesterday's Max BEX informs today's forecast.

**Category 8: Weather Spatial Features (364 features)**

52 strategic grid points × 7 weather variables:

```python
# For each of 52 grid points:
for point in spatial_grid_52:
    f'temperature_2m_{point}': Temperature (°C),
    f'windspeed_10m_{point}': Surface wind (m/s),
    f'windspeed_100m_{point}': Turbine height wind (m/s),
    f'winddirection_100m_{point}': Wind direction (degrees),
    f'shortwave_radiation_{point}': Solar GHI (W/m²),
    f'cloudcover_{point}': Cloud cover (%),
    f'surface_pressure_{point}': Pressure (hPa),

# Total: 52 points × 7 variables = 364 features
```

**Why Spatial Matters**: 30 GW of German wind has different CNEC impacts depending on location (North Sea vs Baltic vs Southern).

**Category 9: Regional Generation Patterns (60 features)**

```python
# Per major zone (12 zones × 5 metrics = 60 features)
for zone in all_12_zones:
    f'wind_gen_{zone}': Wind generation (MW),
    f'solar_gen_{zone}': Solar generation (MW),
    f'thermal_gen_{zone}': Thermal generation (MW),
    f'hydro_gen_{zone}': Hydro generation (MW),
    f'nuclear_gen_{zone}': Nuclear generation (MW),
```

**Key patterns**:
- Austrian hydro >8 GW affects DE-CZ-PL flows
- Belgian nuclear outages stress FR-BE border
- French nuclear <80% capacity triggers imports

**Category 10: Temporal Encoding (20 features)**
```python
temporal_features = {
    # Cyclical encoding
    'hour_sin': np.sin(2 * np.pi * hour / 24),
    'hour_cos': np.cos(2 * np.pi * hour / 24),
    
    # Day patterns
    'day_of_week': weekday,  # 0-6
    'is_weekend': 1 if weekday >= 5 else 0,
    
    # Season
    'season': season_number,  # 0-3
    'month': month,
    
    # Holidays (major countries only)
    'is_holiday_de': is_german_holiday(timestamp),
    'is_holiday_fr': is_french_holiday(timestamp),
    'is_holiday_nl': is_dutch_holiday(timestamp),
    'is_holiday_be': is_belgian_holiday(timestamp),

    # Temperature-related (3 features)
    'heating_degree_days': max(0, 18 - avg_temp),
    'cooling_degree_days': max(0, avg_temp - 18),
    'extreme_temp_flag': 1 if (avg_temp < -5 or avg_temp > 35) else 0,

    # Market timing (5 features)
    'hours_since_last_outage': hours_since_last_major_outage,
    'days_into_month': day_of_month,
    'week_of_year': week_number,
    'is_month_end': 1 if day_of_month > 28 else 0,
    'is_quarter_end': 1 if last_week_of_quarter else 0,
}
```

**Category 11: System-Level Aggregates (20 features)**

Network-wide indicators capturing overall system state:

```python
system_features = {
    # CNEC aggregates (8 features)
    'system_min_margin': min(margin_ratio) across all 200 CNECs,
    'n_binding_cnecs_tier1': count(presolved==1) in Tier-1,
    'n_binding_cnecs_tier2': count(presolved==1) in Tier-2,
    'n_binding_cnecs_total': total binding across all 200,
    'total_congestion_cost': sum(shadow_price) across all CNECs,
    'avg_congestion_cost': mean(shadow_price) for binding CNECs,
    'binding_cnec_diversity': count(unique borders) with binding CNECs,
    'max_binding_concentration': max binding count on single border,

    # Network stress indicators (6 features)
    'network_stress_index': weighted sum of (1 - margin_ratio),
    'tight_cnec_count': count(margin_ratio < 0.15),
    'very_tight_cnec_count': count(margin_ratio < 0.05),
    'system_available_margin': sum(ram_after) across all CNECs,
    'fraction_cnecs_published': published_count / 200,
    'zone_stress_max': max(zone_stress) across all 12 zones,

    # Flow indicators (6 features)
    'total_cross_border_flow': sum(abs(flows)) across all 20 borders,
    'max_single_border_flow': max(flow) across all borders,
    'avg_border_utilization': mean(flow / max_bex) across borders,
    'congested_borders_count': count(utilization > 0.9),
    'reverse_flow_count': count(flow opposite to typical direction),
    'flow_asymmetry_max': max(abs(forward_flow - backward_flow)),
}
```

**[DEPRECATED Category 9: NTC Features - Now Covered by Max BEX + LTN]**
```python
ntc_features = {
    # Per-border deviation signals (top 10 borders ÃƒÆ’Ã¢â‚¬â€ 2 = 20)
    'ntc_d1_forecast': 'Tomorrow's NTC per border',
    'ntc_deviation_pct': '% change vs 30-day baseline',
    
    # Aggregate indicators (5 features)
    'ntc_system_stress': 'Count of borders below 80% baseline',
    'ntc_max_drop_pct': 'Largest drop across all borders',
    'ntc_planned_outage_flag': 'Binary: any announced outage',
    'ntc_total_capacity_change_mw': 'Sum of all MW changes',
    'ntc_asymmetry_count': 'Directional mismatches',
}
```

---

**TOTAL FEATURE COUNT: ~1,735 features**

**Breakdown Summary:**
- **Tier-1 CNEC Features**: 1,000 (50 CNECs × 20 features each)
- **Tier-2 CNEC Features**: 360 (300 binary + 60 border aggregates)
- **Hybrid PTDF Features**: 730 (600 individual + 120 aggregates + 10 PCA)
- **LTN Features**: 40 (perfect future covariate)
- **Net Position Features**: 48 (domain boundaries)
- **Non-Core ATC Features**: 28 (external loop flows)
- **Max BEX Historical**: 40 (target as feature)
- **Weather Spatial**: 364 (52 points × 7 variables)
- **Regional Generation**: 60 (5 types × 12 zones)
- **Temporal**: 20 (cyclical + calendar + market timing)
- **System Aggregates**: 20 (network-wide indicators)
- **TOTAL**: ~1,710 → rounded to **~1,735 features**

**Feature Calculation Timeline:**
- **Baselines**: Use full 24-month history (Oct 2023 - Sept 2025)
- **Context Window**: Recent 512 hours (21 days) for each prediction
- **Year-over-Year**: 24 months enables seasonal comparisons and YoY features
- **No Training**: All features feed into frozen Chronos 2 model (zero-shot inference)

### 2.8 Data Cleaning and Preprocessing Procedures

#### Critical Data Quality Rules

Data quality is essential for the ~1,735-feature pipeline. All cleaning procedures follow priority hierarchies and field-specific strategies.

#### A. Missing Value Handling Strategy

Priority hierarchy for imputation:

**Priority 1: Forward-Fill (max 2 hours)** - For slowly-changing values
**Priority 2: Zero-Fill** - For count/binary fields
**Priority 3: Linear Interpolation** - For continuous metrics with gaps <6 hours
**Priority 4: Drop** - If gap >6 hours or >10% of series missing

**Field-Specific Strategies:**

```python
# RAM values
if ram_missing and gap_hours <= 2:
    ram_after = forward_fill(ram_after, max_hours=2)
elif gap_hours <= 6:
    ram_after = interpolate_linear(ram_after)
else:
    ram_after = fmax  # Assume unconstrained if data missing

# CNEC binding status (binary)
if presolved_missing:
    presolved = False  # Conservative: assume not binding
    cnec_mask = 0      # Flag as unpublished

# Shadow prices
if shadow_price_missing:
    shadow_price = 0  # No congestion signal

# PTDF values
if ptdf_missing:
    ptdf = 0  # Zero sensitivity if not provided

# LTN values (should never be missing - known in advance)
if ltn_missing:
    ltn = last_known_value  # Use last published value

# Net positions
if net_pos_min_missing or net_pos_max_missing:
    net_pos_min = interpolate_linear(net_pos_min)
    net_pos_max = interpolate_linear(net_pos_max)
```

#### B. Outlier Detection and Clipping

```python
# RAM cannot exceed Fmax or be negative
ram_after = np.clip(ram_after, 0, fmax)

# Margin ratio must be in [0, 1]
margin_ratio = np.clip(ram_after / fmax, 0, 1)

# PTDF valid range (with tolerance for numerical precision)
ptdf_values = np.clip(ptdf_values, -1.5, 1.5)

# Shadow prices (cap at 99.9th percentile or €1000/MW)
shadow_price_cap = min(1000, np.percentile(shadow_price, 99.9))
shadow_price = np.clip(shadow_price, 0, shadow_price_cap)

# Max BEX cannot be negative or exceed theoretical maximum
max_bex = np.clip(max_bex, 0, theoretical_max_capacity)

# Net position range must be positive
net_pos_range = max(0, net_pos_max - net_pos_min)
```

#### C. Timestamp Alignment

JAO uses "business day + delivery hour" format. Convert to UTC:

```python
# JAO format: Business Day 2025-01-15, Delivery Hour 18:00-19:00 CET
# Convert to UTC timestamp: 2025-01-15 17:00:00 UTC (CET is UTC+1)

def convert_jao_to_utc(business_day, delivery_hour, is_dst=False):
    # Delivery hour is 1-24 (not 0-23)
    utc_hour = delivery_hour - 1  # Convert to 0-23

    # Account for CET/CEST offset
    if is_dst:  # CEST (summer time) is UTC+2
        utc_hour -= 2
    else:  # CET (winter time) is UTC+1
        utc_hour -= 1

    # Handle day boundary crossings
    if utc_hour < 0:
        business_day -= timedelta(days=1)
        utc_hour += 24
    elif utc_hour >= 24:
        business_day += timedelta(days=1)
        utc_hour -= 24

    timestamp_utc = datetime.combine(business_day, time(hour=utc_hour))
    return timestamp_utc

# Account for DST transitions
# DST starts: Last Sunday of March at 2:00 AM → 3:00 AM
# DST ends: Last Sunday of October at 3:00 AM → 2:00 AM
if is_dst_transition(business_day):
    timestamp_utc = adjust_for_dst(timestamp_utc)
```

#### D. Duplicate Handling

```python
# For D-1 vs D-2 PTDF conflicts: keep D-1 only (most recent forecast)
ptdf_df = ptdf_df.sort_values('publication_time').drop_duplicates(
    subset=['timestamp', 'cnec_id'],
    keep='last'  # Most recent publication
)

# For multiple publications per (timestamp, cnec): keep latest
cnec_df = cnec_df.drop_duplicates(
    subset=['timestamp', 'cnec_id'],
    keep='last'
)

# For Max BEX: keep latest publication
max_bex_df = max_bex_df.drop_duplicates(
    subset=['timestamp', 'border', 'direction'],
    keep='last'
)

# For LTN: no duplicates expected (yearly auction results)
# If found, keep the official publication
ltn_df = ltn_df.drop_duplicates(
    subset=['timestamp', 'border'],
    keep='first'  # Official publication
)
```

#### E. CNEC Masking for Unpublished Constraints

**Critical for 200-CNEC system**: Not all CNECs are published every day.

```python
# Create complete timestamp × CNEC cartesian product
all_timestamps = pd.date_range('2023-10-01', '2025-09-30', freq='H')
all_cnecs = master_cnec_list_200  # 200 CNECs

# Create full matrix
full_matrix = pd.MultiIndex.from_product(
    [all_timestamps, all_cnecs],
    names=['timestamp', 'cnec_id']
)

complete_df = pd.DataFrame(index=full_matrix).join(
    cnec_df.set_index(['timestamp', 'cnec_id']),
    how='left'
)

# Impute missing CNECs (not published = not binding)
complete_df['cnec_mask'] = complete_df['ram_after'].notna().astype(int)
complete_df['ram_after'].fillna(complete_df['fmax'], inplace=True)
complete_df['presolved'].fillna(False, inplace=True)
complete_df['shadow_price'].fillna(0, inplace=True)
complete_df['margin_ratio'] = complete_df['ram_after'] / complete_df['fmax']

# For Tier-1 CNECs: fill outage features
complete_df['outage_active'].fillna(0, inplace=True)
complete_df['outage_elapsed'].fillna(0, inplace=True)
complete_df['outage_remaining'].fillna(0, inplace=True)
complete_df['outage_total_duration'].fillna(0, inplace=True)
```

**Why Critical**: The `cnec_mask` feature tells Chronos 2 which constraints were active vs inactive, enabling it to learn CNEC activation patterns.

#### F. Data Validation Checks

```python
# Validation thresholds
assert ram_after.isna().sum() / len(ram_after) < 0.05, ">5% missing RAM values"
assert ptdf_values.abs().max() < 1.5, "PTDF outside valid range"
assert (ram_after > fmax).sum() == 0, "RAM exceeds Fmax"
assert cnec_coverage > 0.95, "CNEC master list <95% complete"

# Feature completeness check
assert max_bex_df.isna().sum().sum() < 0.01 * len(max_bex_df), "Max BEX >1% missing"
assert ltn_df.isna().sum().sum() == 0, "LTN should have zero missing values"

# Geographic diversity check
borders_represented = identify_borders_from_cnecs(master_cnec_list_200)
assert len(borders_represented) >= 18, "200 CNECs don't cover enough borders (need ≥18/20)"

# Tier structure validation
assert len(tier1_cnecs) == 50, "Tier-1 must have exactly 50 CNECs"
assert len(tier2_cnecs) == 150, "Tier-2 must have exactly 150 CNECs"
assert set(tier1_cnecs).isdisjoint(set(tier2_cnecs)), "No overlap between tiers"

# PTDF matrix validation
assert ptdf_matrix.shape == (200, 12), "PTDF matrix must be 200 CNECs × 12 zones"
pca_variance = pca.explained_variance_ratio_[:10].sum()
assert pca_variance > 0.90, f"PCA captures only {pca_variance:.1%} variance (need >90%)"
```

**Day 1-2 Deliverable**: Document all data quality issues found during collection and cleaning. Track:
- Missing value percentages by field
- Number of outliers clipped
- Duplicate records removed
- CNEC publication frequency
- Data completeness by border/zone

### 2.9 CNEC Selection: 200 Total (50 Tier-1 + 150 Tier-2)

#### Weighted Scoring Algorithm

Instead of simple binding frequency, we use a comprehensive weighted scoring:

**Step 1: Calculate Impact Score for All CNECs (3 hours)**

From 24 months of JAO historical data, calculate weighted scoring for every CNEC:

```python
# From JAO historical data (24 months)
cnec_analysis = jao_historical.groupby('cnec_id').agg({
    'presolved': 'sum',                    # Binding frequency
    'shadow_price': 'mean',                # Economic impact
    'ram_after': 'mean',                   # Average margin
    'fmax': 'first',                       # Maximum flow
    'timestamp': 'count',                  # Days appeared
}).reset_index()

# Calculate components
cnec_analysis['binding_frequency'] = (
    cnec_analysis['presolved'] / cnec_analysis['timestamp']
)
cnec_analysis['low_ram_frequency'] = (
    (cnec_analysis['ram_after'] < 0.2 * cnec_analysis['fmax']).sum() / cnec_analysis['timestamp']
)
cnec_analysis['days_appeared'] = cnec_analysis['timestamp'] / 24  # Convert hours to days
cnec_analysis['appearance_rate'] = cnec_analysis['days_appeared'] / 730  # 24 months ≈ 730 days

# Weighted Impact Score
cnec_analysis['impact_score'] = (
    0.40 * cnec_analysis['binding_frequency'] +
    0.30 * (cnec_analysis['shadow_price'] / 100) +  # Normalize to 0-1 range
    0.20 * cnec_analysis['low_ram_frequency'] +
    0.10 * cnec_analysis['appearance_rate']
)

# Sort and select top 200
top_200_cnecs = cnec_analysis.sort_values('impact_score', ascending=False).head(200)

# Split into tiers
tier1_cnecs = top_200_cnecs.head(50)   # Highest impact
tier2_cnecs = top_200_cnecs.tail(150)  # Next 150
```

**Step 2: Geographic Clustering from Country Codes (1 hour)**
```python
# JAO CNEC IDs already contain geographic information
'DE_CZ_TIE_1234' ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Border: DE-CZ, Type: Transmission Line
'FR_BE_LINE_5678' ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Border: FR-BE, Type: Line
'AT_HU_PST_9012' ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Border: AT-HU, Type: Phase Shifter

# Group CNECs by border
cnec_groups = {
    'DE_CZ_border': [all CNECs with 'DE_CZ' in ID],
    'FR_BE_border': [all CNECs with 'FR_BE' in ID],
    # ...for all borders
}
```

**Step 3: PTDF Sensitivity Analysis (2 hours)**
```python
# Which zones most affect each CNEC?
# Focus on Tier-1 CNECs (50) for detailed analysis
for cnec in tier1_cnecs:  # 50 CNECs from weighted scoring
    cnec['sensitive_zones'] = ptdf_matrix[cnec_id].nlargest(5)
    # Tells us geographic span without exact coordinates
```

**Step 4: Weather Pattern Correlation (2 hours)**
```python
# Which weather patterns correlate with CNEC binding?
# Focus on Tier-1 CNECs (50) for detailed weather correlation analysis
for cnec in tier1_cnecs:  # 50 CNECs from weighted scoring
    cnec['weather_drivers'] = correlate_with_weather(
        cnec['binding_history'],
        weather_historical
    )
    # Example: CNEC binds when North Sea wind > 25 m/s
```

#### What We DON'T Need for MVP

ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ ENTSO-E EIC code database matching
ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ PyPSA-EUR network topology reconciliation
ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Exact substation GPS coordinates
ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Physical line names (anonymized anyway)
ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Full transmission grid modeling

#### What We GET Instead

ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" 200 CNECs identified and ranked (50 Tier-1 + 150 Tier-2)
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" Geographic grouping by border
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" PTDF-based sensitivity understanding for Tier-1 CNECs
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" Weather pattern associations for Tier-1 CNECs
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å" **Total time: 8 hours vs 3 weeks**

#### Zero-Shot Learning Without Full Reconciliation

The model learns:
```
North Sea wind (25 m/s) + Low Baltic wind (5 m/s) + High German demand
ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ CNECs in 'DE_CZ' group bind
ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ DE-CZ capacity reduces to 1200 MW
```

Without needing to know:
- That 'DE_CZ_TIE_1234' is "Etzenricht-Prestice 380kV line"
- Exact GPS: 49.7Ãƒâ€šÃ‚Â°N, 12.5Ãƒâ€šÃ‚Â°E
- ENTSO-E asset ID: XXXXXXXXXX

Because **geographic clustering + PTDF patterns provide sufficient spatial resolution** for zero-shot inference.

### 2.9 Net Transfer Capacity (NTC) as Outage Detection Layer

#### Rationale for Simplified NTC Integration

NTC forecasts provide **planning information** invisible to weather/CNEC patterns:

1. **Planned Outages**: TSO maintenance announcements show in NTC drops weeks ahead
2. **Topology Changes**: New interconnectors, seasonal limits appear in NTC forecasts first
3. **Safety Margins**: N-1 rule adjustments not weather-dependent

**Critical Example**:
```
Weather forecast: Normal conditions
Historical CNECs: No unusual patterns
Zero-shot prediction: 2,800 MW
NTC forecast: Drops to 1,900 MW due to maintenance
ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Without NTC: 900 MW error!
```

#### MVP Approach: D+1 Forecasts Only, Simple Features

**Scope**:
- **D+1 NTC forecasts only** (not weekly/monthly)
- **All Core FBMC borders** (~20 borders)
- **Critical external borders**: IT-FR, IT-AT, ES-FR, CH-DE, CH-FR, GB-FR, GB-NL, GB-BE
- **Simple feature engineering** (no complex modeling)

**Feature Set (~20-25 features total)**:

```python
ntc_simplified_features = {
    # Per-border deviation signals (top 10 borders ÃƒÆ’Ã¢â‚¬â€ 2 = 20 features)
    'ntc_d1_forecast': 'Tomorrow's NTC per border',
    'ntc_deviation_pct': '% change vs 30-day baseline',
    
    # Aggregate indicators (5 features)
    'ntc_system_stress': 'Count of borders below 80% baseline',
    'ntc_max_drop_pct': 'Largest drop across all borders',
    'ntc_planned_outage_flag': 'Binary: any announced outage',
    'ntc_total_capacity_change_mw': 'Sum of all MW changes',
    'ntc_asymmetry_count': 'Directional mismatches (import vs export)'
}
```

#### Data Sources

**ENTSO-E Transparency Platform (FREE API)**:
```python
from entsoe import EntsoePandasClient

client = EntsoePandasClient(api_key='YOUR_KEY')

# D+1 NTC forecast per border
ntc_forecast = client.query_offered_capacity(
    'DE_LU',  # From zone
    'FR',     # To zone
    start=tomorrow,
    end=tomorrow + timedelta(days=1),
    contract_type='daily'
)
```

### 2.10 Historical Data Requirements

**Dataset Period**: October 2023 - September 2025 (24 months)
- **Feature Baseline Period**: Oct 2023 - May 2025 (20 months)
- **Validation Period**: June-July 2025 (2 months)
- **Test Period**: Aug-Sept 2025 (2 months)

**Why This Full Period:**
- **Seasonal coverage**: 2+ full cycles of all seasons
- **Feature baselines**: Rolling averages, percentiles require history
- **Market diversity**: French nuclear variations, Austrian hydro patterns
- **Weather extremes**: Cold snaps, heat waves, wind droughts
- **Recent relevance**: FBMC algorithm evolves, recent patterns most valid

**Simplified Data Volume**:
- **52 weather points**: ~30 GB uncompressed (24 months)
- **200 CNECs**: ~10 GB uncompressed (24 months)
- **Total Storage**: ~40 GB uncompressed, ~12 GB in Parquet format

---

## 3. Hugging Face Spaces Infrastructure

### 3.1 Why Hugging Face Spaces for MVP

**Perfect for Development-Focused MVP:**
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Persistent GPU environment ($30/month for A10G)
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Chronos 2 natively hosted on Hugging Face
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Built-in Git versioning
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Easy collaboration and handover
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ No complex cloud infrastructure setup
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Jupyter/Gradio interface options
- ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Simple sharing with quantitative analyst

**vs AWS SageMaker (for comparison):**
- AWS: Better for production automation
- HF: Better for development and handover
- AWS: 2.5 hours setup + IAM + Lambda + S3
- HF: 30 minutes setup, pure data science focus

**MVP Scope Alignment:**
Since we're building a working model (not production deployment), HF Spaces eliminates infrastructure complexity while providing professional collaboration tools.

### 3.2 Hugging Face Spaces Setup (30 Minutes)

#### Option 1: Gradio Space (Recommended for Interactive Demo)
```python
# app.py in Hugging Face Space
import gradio as gr
from chronos import ChronosPipeline
import polars as pl

# Load model (cached automatically)
pipeline = ChronosPipeline.from_pretrained(
    "amazon/chronos-t5-large",
    device_map="cuda"
)

def forecast_capacity(border, start_date):
    """Generate 14-day forecast for selected border"""
    # Load features
    features = load_features(start_date)
    context = features[-512:]  # Last 21 days
    
    # Zero-shot inference
    forecast = pipeline.predict(
        context=context,
        prediction_length=336
    )
    
    return create_visualization(forecast, border)

# Gradio interface
demo = gr.Interface(
    fn=forecast_capacity,
    inputs=[
        gr.Dropdown(choices=FBMC_BORDERS, label="Border"),
        gr.Textbox(label="Start Date (YYYY-MM-DD)")
    ],
    outputs=gr.Plot()
)

demo.launch()
```

#### Option 2: JupyterLab Space (Recommended for Development)
- Select "JupyterLab" when creating Space
- Full notebook environment
- Better for iterative development
- Easy to convert to Gradio later

### 3.3 Hardware Configuration

**A10G GPU (Recommended for MVP):**
- Cost: $30/month
- VRAM: 24 GB
- Performance: <5 min for full 14-day forecast
- Sufficient for zero-shot inference
- No fine-tuning, so no need for A100

**A100 GPU (If Quant Analyst Needs Fine-Tuning):**
- Cost: $90/month
- VRAM: 40/80 GB options
- Performance: 2x faster than A10G
- Overkill for zero-shot MVP
- Upgrade path available

**Storage:**
- Free tier: 50 GB (sufficient for 6 GB Parquet data)
- Data persists across sessions
- Use HF Datasets for efficient data management and versioning
- Direct file upload for raw data files

### 3.4 Development Workflow

```
Local Development ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ HF Space Sync ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Testing ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Documentation

Day 1-2: Local data exploration
  ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬Å“
  Upload data to HF Space storage
  ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬Å“
Day 3: Feature engineering in HF Jupyter
  ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬Å“
Day 4: Zero-shot inference experiments
  ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬Å“
Day 5: Create Gradio demo + documentation
  ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬Å“
  Share Space URL with quant analyst
```

### 3.5 Cost Breakdown

| Component | Monthly Cost |
|-----------|--------------|
| HF Space A10G GPU | $30.00 |
| Storage (50 GB included) | $0.00 |
| Data transfer | $0.00 |
| **TOTAL** | **$30.00/month** |

**vs Original AWS Plan:**
- AWS: <$10/month but 8 hours setup + production complexity
- HF: $30/month but 30 min setup + clean handover
- Trade $20/month for ~7.5 hours saved + better collaboration

### 3.6 Data Management in HF Spaces

**Recommended Structure:**
```
/home/user/
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ data/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ jao_24m.parquet           # 24 months historical JAO
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ entsoe_24m.parquet        # ENTSO-E forecasts
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ weather_24m.parquet       # 52-point weather grid
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ features_24m.parquet      # Engineered features (~1,735 features)
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ notebooks/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 01_data_exploration.ipynb
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 02_feature_engineering.ipynb
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 03_zero_shot_inference.ipynb
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ src/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ data_collection/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ feature_engineering/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ evaluation/
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ config/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ spatial_grid.yaml        # 52 weather points
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ cnec_top50.json          # Pre-identified CNECs
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ results/
    ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ zero_shot_performance.json
    ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ error_analysis.csv
```

**Upload Strategy:**
```bash
# Local download first
python scripts/download_all_data.py  # Downloads to local ./data

# Validate locally
python scripts/validate_data.py

# Upload to HF Space using HF Datasets
# Option 1: For processed features (recommended)
from datasets import Dataset
import pandas as pd

df = pd.read_parquet("data/processed_features.parquet")
dataset = Dataset.from_pandas(df)
dataset.push_to_hub("your-space/fbmc-data")

# Option 2: Direct file upload for raw data
# Upload via HF Space UI or use huggingface_hub CLI:
# huggingface-cli upload your-space/fbmc-forecasting ./data/raw --repo-type=space
```

### 3.7 Collaboration Features

**For Quantitative Analyst Handover:**

1. **Fork Space**: Analyst gets exact copy with one click
2. **Git History**: See entire development progression
3. **README**: Comprehensive documentation auto-displayed
4. **Environment**: Dependencies automatically replicated
5. **Data Access**: Shared data storage, no re-download

**Sharing Link:**
```
https://huggingface.co/spaces/yourname/fbmc-forecasting
```

Analyst can:
- Run notebooks interactively
- Modify feature engineering
- Experiment with fine-tuning
- Deploy to production when ready
- Keep or upgrade GPU tier

---

## 3. Historical Features vs Future Covariates: Complete Data Architecture

### 3.1 Critical Distinction: Three Time Periods

Understanding how data flows through the system is essential for zero-shot forecasting. There are **three distinct time periods** with different roles:

```
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 2 YEARS ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ‚Â¤ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 21 DAYS ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ‚Â¤ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 14 DAYS ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ‚Â¤
Jan 2023              July 25  ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡  Aug 15       ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡  Aug 29

                                ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡               ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
        FEATURE                 ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡  HISTORICAL   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡  FUTURE
        BASELINES              ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡  CONTEXT      ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡  COVARIATES
                                ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡               ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
Used to calculate features     ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡ Actual values ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡ Forecasted values
Model never sees directly       ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡ (512, 70)    ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡ (336, 15)
                                ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡               ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
                                ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ‚Â¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ‹Å“
                                        ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
                                   MODEL INPUT
                                        ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
                                        ÃƒÂ¢Ã¢â‚¬â€œÃ‚Â¼
                                  PREDICTION
                                (336 hours ÃƒÆ’Ã¢â‚¬â€ 20 borders)
```

#### Period 1: 2-Year Historical Dataset (Oct 2023 - Sept 2025)

**Purpose:** Calculate feature baselines and provide historical context for feature engineering

**Content:**
- Raw JAO data (CNECs, PTDFs, RAMs, shadow prices)
- Raw ENTSO-E data (actual generation, actual load, actual flows)
- Raw weather data (52 grid points)

**Model Access:** NONE - Model never directly sees this data

**Usage Examples:**
```python
# Calculate 30-day moving average for August 15
ram_30d_avg = historical_data['2025-07-16':'2025-08-15']['ram'].mean()

# Calculate seasonal baseline
august_wind_baseline = historical_data[
    (month == 8) & (year.isin([2023, 2024]))
]['wind'].mean()

# Calculate percentile ranking
ram_percentile = percentile_rank(
    current_ram,
    historical_data['2025-05-17':'2025-08-15']['ram']  # 90 days
)
```

#### Period 2: 21-Day Historical Context (July 25 - Aug 15)

**Purpose:** Provide model with recent patterns that led to current moment

**Content:**
- 70 engineered features (calculated using 24-month baselines)
- Actual historical values: RAM, capacity, CNECs, weather outcomes
- Recent trends, volatilities, moving averages

**Model Access:** DIRECT - This is what the model "reads"

**Shape:** (512 hours, 70 features) [DEPRECATED - see updated feature architecture with ~1,735 features]

**Feature Categories:**
```python
historical_context_features = {
    'ptdf_patterns': 10,          # PCA components from historical PTDFs
    'ram_patterns': 8,            # Moving averages, percentiles, flags
    'cnec_patterns': 10,          # Binding frequencies, activation rates
    'capacity_historical': 20,    # Actual past capacity per border
    'derived_indicators': 22,     # Volatilities, trends, anomaly flags
}
# Total: 70 features describing what happened
```

#### Period 3: 14-Day Future Covariates (Aug 15 - Aug 29)

**Purpose:** Provide model with expected future conditions

**Content:**
- 15-20 forward-looking features
- Forecasts: renewable generation, demand, weather, NTC
- Deterministic: temporal features (hour, day, holidays)

**Model Access:** DIRECT - These are "givens" about the future

**Shape:** (336 hours, 15 features)

**Feature Categories:**
```python
future_covariate_features = {
    'renewable_forecasts': 4,     # Wind/solar (DE, FR)
    'demand_forecasts': 2,        # Load forecasts (DE, FR)
    'weather_forecasts': 5,       # Temp, wind speeds, radiation
    'ntc_forecasts': 1,           # NTC D+1 (extended intelligently)
    'temporal_deterministic': 5,  # Hour, day, weekend, holiday, season
}
# Total: 17 features describing what's expected
```

### 3.2 Forecast Availability by Data Source

Different data sources provide forecasts with different horizons:

| Data Source | Parameter | Horizon | Update Frequency | Extending Strategy |
|-------------|-----------|---------|------------------|-------------------|
| **ENTSO-E** | Wind generation | D+1 to D+2 (48h) | Hourly | Derive from weather |
| **ENTSO-E** | Solar generation | D+1 to D+2 (48h) | Hourly | Derive from weather |
| **ENTSO-E** | Load/Demand | D+1 to D+7 (168h) | Daily | Seasonal patterns |
| **ENTSO-E** | Cross-border flows | D+1 only (24h) | Daily | Baseline + weather |
| **ENTSO-E** | NTC forecasts | D+1 only (24h) | Daily | Persistence + seasonal |
| **OpenMeteo** | Temperature | D+1 to D+14 (336h) | 6-hourly | Native |
| **OpenMeteo** | Wind speed 100m | D+1 to D+14 (336h) | 6-hourly | Native |
| **OpenMeteo** | Solar radiation | D+1 to D+14 (336h) | 6-hourly | Native |
| **OpenMeteo** | Cloud cover | D+1 to D+14 (336h) | 6-hourly | Native |
| **Deterministic** | Hour/Day/Season | Infinite | N/A | Perfect knowledge |

**Key Insight:** Weather forecasts extend to D+14, but renewable generation forecasts only to D+2. We can **derive** D+3 to D+14 renewable forecasts from weather data.

### 3.3 Smart Forecast Extension Strategies

#### Strategy 1: Derive Renewable Forecasts from Weather (Primary Method)

**Problem:** ENTSO-E wind/solar forecasts end at D+2, but we need D+14  
**Solution:** Use weather forecasts (available to D+14) to derive generation forecasts

##### Wind Generation Extension

```python
class WindForecastExtension:
    """
    Extend ENTSO-E wind forecasts (D+1-D+2) to D+14 using weather data
    """
    
    def __init__(self, zone, historical_data):
        """
        Calibrate zone-specific wind power curve from 24-month history
        """
        self.zone = zone
        self.power_curve = self._calibrate_power_curve(historical_data)
        self.installed_capacity = self._get_installed_capacity(zone)
        self.weather_points = self._get_weather_points(zone)
    
    def _calibrate_power_curve(self, historical_data):
        """
        Learn relationship: wind_speed_100m ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ generation (MW)
        
        Uses 24-month historical data to build empirical power curve
        """
        # Extract relevant weather points for this zone
        if self.zone == 'DE_LU':
            points = ['DE_north_sea', 'DE_baltic', 'DE_north', 'DE_south']
        elif self.zone == 'FR':
            points = ['FR_north', 'FR_west', 'FR_brittany']
        elif self.zone == 'NL':
            points = ['NL_offshore', 'NL_north', 'NL_central']
        # ... etc
        
        # Get historical wind speeds at turbine height (100m)
        weather = historical_data['weather']
        wind_speeds = weather.loc[weather['grid_point'].isin(points), 'windspeed_100m']
        wind_speeds_avg = wind_speeds.groupby(wind_speeds.index).mean()
        
        # Get historical actual generation
        generation = historical_data['entsoe'][f'{self.zone}_wind_actual']
        
        # Align timestamps
        common_index = wind_speeds_avg.index.intersection(generation.index)
        wind_aligned = wind_speeds_avg[common_index]
        gen_aligned = generation[common_index]
        
        # Build power curve using binning and interpolation
        from scipy.interpolate import interp1d
        
        # Create wind speed bins
        wind_bins = np.arange(0, 30, 0.5)  # 0-30 m/s in 0.5 m/s steps
        power_output = []
        
        for i in range(len(wind_bins) - 1):
            mask = (wind_aligned >= wind_bins[i]) & (wind_aligned < wind_bins[i+1])
            
            if mask.sum() > 10:  # Need minimum samples
                # Use 50th percentile (median) to avoid outliers
                power_output.append(gen_aligned[mask].quantile(0.5))
            else:
                # Not enough data, interpolate later
                power_output.append(np.nan)
        
        # Fill NaN values with interpolation
        power_output = pd.Series(power_output).interpolate(method='cubic').fillna(0)
        
        # Create smooth interpolation function
        power_curve_func = interp1d(
            wind_bins[:-1],
            power_output,
            kind='cubic',
            bounds_error=False,
            fill_value=(0, power_output.max())  # 0 at low wind, max at high wind
        )
        
        return power_curve_func
    
    def extend_forecast(self, entose_forecast_d1_d2, weather_forecast_d1_d14):
        """
        Extend D+1-D+2 ENTSO-E forecast to D+14 using weather
        
        Args:
            entose_forecast_d1_d2: 48-hour ENTSO-E wind forecast
            weather_forecast_d1_d14: 336-hour weather forecast (wind speeds)
        
        Returns:
            extended_forecast: 336-hour wind generation forecast
        """
        # Use ENTSO-E forecast for D+1 and D+2 (first 48 hours)
        forecast_extended = entose_forecast_d1_d2.copy()
        
        # For D+3 to D+14, derive from weather
        weather_d3_d14 = weather_forecast_d1_d14[48:]  # Skip first 48 hours
        
        # Extract wind speeds from relevant weather points
        wind_speeds_forecasted = self._aggregate_weather_points(weather_d3_d14)
        
        # Apply calibrated power curve
        generation_d3_d14 = self.power_curve(wind_speeds_forecasted)
        
        # Apply capacity limits
        generation_d3_d14 = np.clip(
            generation_d3_d14,
            0,
            self.installed_capacity
        )
        
        # Add confidence adjustment (weather forecasts less certain far out)
        # Blend with seasonal baseline for D+10 to D+14
        for i, hour_ahead in enumerate(range(48, 336)):
            if hour_ahead > 216:  # Beyond D+9
                # Blend with seasonal average (increasing weight further out)
                blend_weight = (hour_ahead - 216) / 120  # 0 at D+9, 1 at D+14
                seasonal_avg = self._get_seasonal_baseline(
                    forecast_extended.index[0] + timedelta(hours=hour_ahead)
                )
                generation_d3_d14[i] = (
                    (1 - blend_weight) * generation_d3_d14[i] +
                    blend_weight * seasonal_avg
                )
        
        # Combine: ENTSO-E for D+1-D+2, derived for D+3-D+14
        forecast_full = np.concatenate([
            forecast_extended.values,
            generation_d3_d14
        ])
        
        return forecast_full
    
    def _aggregate_weather_points(self, weather_forecast):
        """
        Aggregate wind speeds from multiple weather points
        """
        # Weight by installed capacity at each point
        if self.zone == 'DE_LU':
            weights = {
                'DE_north_sea': 0.35,  # Major offshore capacity
                'DE_baltic': 0.25,     # Baltic offshore
                'DE_north': 0.25,      # Onshore northern
                'DE_south': 0.15       # Southern wind
            }
        # ... etc
        
        weighted_wind = 0
        for point, weight in weights.items():
            weighted_wind += weather_forecast[point]['windspeed_100m'] * weight
        
        return weighted_wind
    
    def _get_seasonal_baseline(self, timestamp):
        """
        Get typical generation for this hour/day/month
        """
        # From historical 24-month data
        # Return average for same month, same hour-of-day
        pass
```

##### Solar Generation Extension

```python
class SolarForecastExtension:
    """
    Extend ENTSO-E solar forecasts (D+1-D+2) to D+14 using weather data
    """
    
    def __init__(self, zone, historical_data):
        self.zone = zone
        self.solar_model = self._calibrate_solar_model(historical_data)
        self.installed_capacity = self._get_installed_capacity(zone)
        self.weather_points = self._get_weather_points(zone)
    
    def _calibrate_solar_model(self, historical_data):
        """
        Learn: solar_radiation + temperature ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ generation
        
        Solar is more complex than wind:
        - Depends on Global Horizontal Irradiance (GHI)
        - Panel efficiency decreases with temperature
        - Cloud cover matters
        - Time of day (sun angle) matters
        """
        from sklearn.ensemble import GradientBoostingRegressor
        
        weather = historical_data['weather']
        generation = historical_data['entsoe'][f'{self.zone}_solar_actual']
        
        # Get relevant weather points
        if self.zone == 'DE_LU':
            points = ['DE_south', 'DE_west', 'DE_east', 'DE_central']
        # ... etc
        
        # Extract features
        radiation = weather.loc[
            weather['grid_point'].isin(points),
            'shortwave_radiation'
        ].groupby(level=0).mean()
        
        temperature = weather.loc[
            weather['grid_point'].isin(points),
            'temperature_2m'
        ].groupby(level=0).mean()
        
        cloudcover = weather.loc[
            weather['grid_point'].isin(points),
            'cloudcover'
        ].groupby(level=0).mean()
        
        # Align with generation data
        common_index = radiation.index.intersection(generation.index)
        
        X = pl.DataFrame({
            'radiation': radiation[common_index],
            'temperature': temperature[common_index],
            'cloudcover': cloudcover[common_index],
            'hour': common_index.hour,
            'day_of_year': common_index.dayofyear,
            'cos_hour': np.cos(2 * np.pi * common_index.hour / 24),
            'sin_hour': np.sin(2 * np.pi * common_index.hour / 24),
        })
        
        y = generation[common_index]
        
        # Fit gradient boosting model (captures non-linear relationships)
        model = GradientBoostingRegressor(
            n_estimators=100,
            max_depth=5,
            learning_rate=0.1,
            random_state=42
        )
        
        model.fit(X, y)
        
        return model
    
    def extend_forecast(self, entose_forecast_d1_d2, weather_forecast_d1_d14):
        """
        Extend solar forecast using weather predictions
        """
        # Use ENTSO-E for D+1-D+2
        forecast_extended = entose_forecast_d1_d2.copy()
        
        # Derive from weather for D+3-D+14
        weather_d3_d14 = weather_forecast_d1_d14[48:]
        
        # Prepare features for solar model
        X_future = pl.DataFrame({
            'radiation': weather_d3_d14['shortwave_radiation'],
            'temperature': weather_d3_d14['temperature_2m'],
            'cloudcover': weather_d3_d14['cloudcover'],
            'hour': weather_d3_d14.index.hour,
            'day_of_year': weather_d3_d14.index.dayofyear,
            'cos_hour': np.cos(2 * np.pi * weather_d3_d14.index.hour / 24),
            'sin_hour': np.sin(2 * np.pi * weather_d3_d14.index.hour / 24),
        })
        
        # Predict generation
        generation_d3_d14 = self.solar_model.predict(X_future)
        
        # Apply physical constraints
        generation_d3_d14 = np.clip(
            generation_d3_d14,
            0,
            self.installed_capacity
        )
        
        # Zero out nighttime (sun angle below horizon)
        for i, timestamp in enumerate(weather_d3_d14.index):
            if timestamp.hour < 6 or timestamp.hour > 20:
                generation_d3_d14[i] = 0
        
        # Combine
        forecast_full = np.concatenate([
            forecast_extended.values,
            generation_d3_d14
        ])
        
        return forecast_full
```

#### Strategy 2: Extend Demand Forecasts Using Patterns

**Problem:** ENTSO-E load forecasts available to D+7, need D+14  
**Solution:** Use weekly patterns + weather sensitivity

```python
class DemandForecastExtension:
    """
    Extend ENTSO-E demand forecasts from D+7 to D+14
    """
    
    def __init__(self, zone, historical_data):
        self.zone = zone
        self.weekly_profile = self._calculate_weekly_profile(historical_data)
        self.temp_sensitivity = self._calculate_temp_sensitivity(historical_data)
    
    def _calculate_weekly_profile(self, historical_data):
        """
        Calculate typical demand profile by hour-of-week
        """
        demand = historical_data['entsoe'][f'{self.zone}_load_actual']
        
        # Group by hour-of-week (0-167)
        demand['hour_of_week'] = demand.index.dayofweek * 24 + demand.index.hour
        
        weekly_profile = demand.groupby('hour_of_week').agg({
            'load': ['mean', 'std', lambda x: x.quantile(0.1), lambda x: x.quantile(0.9)]
        })
        
        return weekly_profile
    
    def _calculate_temp_sensitivity(self, historical_data):
        """
        Learn how demand responds to temperature
        (heating degree days, cooling degree days)
        """
        demand = historical_data['entsoe'][f'{self.zone}_load_actual']
        weather = historical_data['weather']
        
        # Average temperature across zone
        temp = weather.groupby(weather.index)['temperature_2m'].mean()
        
        # Heating/cooling degree days
        hdd = np.maximum(18 - temp, 0)  # Heating
        cdd = np.maximum(temp - 22, 0)  # Cooling
        
        # Fit simple model: demand ~ baseline + hdd + cdd
        from sklearn.linear_model import LinearRegression
        
        X = pl.DataFrame({
            'hdd': hdd,
            'cdd': cdd,
            'hour': temp.index.hour,
            'day_of_week': temp.index.dayofweek
        })
        
        common_idx = X.index.intersection(demand.index)
        
        model = LinearRegression()
        model.fit(X.loc[common_idx], demand.loc[common_idx])
        
        return model
    
    def extend_forecast(self, entsoe_forecast_d1_d7, weather_forecast_d1_d14):
        """
        Extend demand forecast using weekly patterns + temperature
        """
        # Use ENTSO-E for D+1-D+7
        forecast_extended = entsoe_forecast_d1_d7.copy()
        
        # For D+8-D+14, use weekly patterns adjusted for temperature
        timestamps_d8_d14 = pd.date_range(
            forecast_extended.index[-1] + timedelta(hours=1),
            periods=168,  # 7 days
            freq='H'
        )
        
        demand_d8_d14 = []
        
        for timestamp in timestamps_d8_d14:
            # Get typical demand for this hour-of-week
            hour_of_week = timestamp.dayofweek * 24 + timestamp.hour
            baseline_demand = self.weekly_profile.loc[hour_of_week, ('load', 'mean')]
            
            # Adjust for forecasted temperature
            temp_forecast = weather_forecast_d1_d14.loc[timestamp, 'temperature_2m']
            
            hdd = max(18 - temp_forecast, 0)
            cdd = max(temp_forecast - 22, 0)
            
            # Apply temperature adjustment
            X_future = pl.DataFrame({
                'hdd': [hdd],
                'cdd': [cdd],
                'hour': [timestamp.hour],
                'day_of_week': [timestamp.dayofweek]
            })
            
            adjusted_demand = self.temp_sensitivity.predict(X_future)[0]
            
            # Blend baseline with temperature-adjusted (70/30 split)
            final_demand = 0.7 * baseline_demand + 0.3 * adjusted_demand
            
            demand_d8_d14.append(final_demand)
        
        # Combine
        forecast_full = np.concatenate([
            forecast_extended.values,
            demand_d8_d14
        ])
        
        return forecast_full
```

#### Strategy 3: NTC Forecast Extension

**Problem:** NTC forecasts typically only D+1 (24 hours)  
**Solution:** Use persistence with planned outage adjustments

```python
class NTCForecastExtension:
    """
    Extend NTC forecasts from D+1 to D+14
    """
    
    def __init__(self, border, historical_data):
        self.border = border
        self.seasonal_baseline = self._calculate_seasonal_baseline(historical_data)
        self.day_of_week_pattern = self._calculate_dow_pattern(historical_data)
    
    def extend_forecast(self, ntc_d1, planned_outages=None):
        """
        Extend single-day NTC forecast to 14 days
        
        Strategy:
        1. Use D+1 NTC as base
        2. Check for planned outages (TSO announcements)
        3. Apply seasonal patterns for D+2-D+14
        4. Adjust for day-of-week patterns (weekends often higher)
        """
        # Start with D+1 value
        ntc_extended = [ntc_d1]
        
        # For D+2-D+14
        for day in range(2, 15):
            if planned_outages and day in planned_outages:
                # Use announced reduction
                ntc_day = planned_outages[day]['reduced_capacity']
            else:
                # Use persistence with seasonal adjustment
                # Baseline: Average of D+1 and seasonal typical
                seasonal_typical = self.seasonal_baseline[day]
                ntc_day = 0.7 * ntc_d1 + 0.3 * seasonal_typical
                
                # Day-of-week adjustment
                dow_factor = self.day_of_week_pattern[day % 7]
                ntc_day *= dow_factor
            
            # Repeat for 24 hours
            ntc_extended.extend([ntc_day] * 24)
        
        return np.array(ntc_extended[:336])  # First 14 days only
```

### 3.4 Complete Feature Engineering Pipeline with Extensions

```python
class CompleteFBMCFeatureEngineer:
    """
    Engineer both historical and future features for zero-shot inference
    """
    
    def __init__(self, historical_data_2y):
        """
        Initialize with 24-month historical data for calibration
        """
        self.historical_data = historical_data_2y
        
        # Initialize forecast extension models
        self.wind_extenders = {
            zone: WindForecastExtension(zone, historical_data_2y)
            for zone in ['DE_LU', 'FR', 'NL', 'BE']
        }
        
        self.solar_extenders = {
            zone: SolarForecastExtension(zone, historical_data_2y)
            for zone in ['DE_LU', 'FR', 'NL', 'BE']
        }
        
        self.demand_extenders = {
            zone: DemandForecastExtension(zone, historical_data_2y)
            for zone in ['DE_LU', 'FR', 'NL', 'BE']
        }
        
        self.ntc_extenders = {
            border: NTCForecastExtension(border, historical_data_2y)
            for border in ['DE_FR', 'FR_DE', 'DE_NL', 'NL_DE']
        }
    
    def prepare_complete_input(self, prediction_time):
        """
        Prepare both historical context and future covariates
        
        Returns:
            historical_context: (512 hours, 70 features)
            future_covariates: (336 hours, 17 features)
        """
        # PART 1: HISTORICAL CONTEXT (21 days backward)
        historical_context = self._prepare_historical_context(prediction_time)
        
        # PART 2: FUTURE COVARIATES (14 days forward)
        future_covariates = self._prepare_future_covariates(prediction_time)
        
        return historical_context, future_covariates
    
    def _prepare_historical_context(self, prediction_time):
        """
        Prepare 512 hours of historical features
        """
        start = prediction_time - timedelta(hours=512)
        end = prediction_time
        
        # Extract raw historical data
        jao_hist = self.historical_data['jao'][start:end]
        entsoe_hist = self.historical_data['entsoe'][start:end]
        weather_hist = self.historical_data['weather'][start:end]
        
        # Engineer ~1,735 features (using full 24-month data for baselines)
        features = np.zeros((512, 70))
        
        # PTDF patterns (10 features)
        features[:, 0:10] = self._calculate_ptdf_features(jao_hist)
        
        # RAM patterns (8 features)
        features[:, 10:18] = self._calculate_ram_features(jao_hist)
        
        # CNEC patterns (10 features)
        features[:, 18:28] = self._calculate_cnec_features(jao_hist)
        
        # Historical capacities (20 features - one per border)
        features[:, 28:48] = self._extract_historical_capacities(jao_hist)
        
        # Derived patterns (22 features)
        features[:, 48:70] = self._calculate_derived_features(
            jao_hist, entsoe_hist, weather_hist
        )
        
        return features
    
    def _prepare_future_covariates(self, prediction_time):
        """
        Prepare 336 hours of future covariates with smart extensions
        """
        start = prediction_time
        end = prediction_time + timedelta(hours=336)
        
        features = np.zeros((336, 17))
        
        # Fetch short-horizon forecasts
        wind_de_d1_d2 = fetch_entsoe_forecast('DE_LU', 'wind', start, start + timedelta(hours=48))
        solar_de_d1_d2 = fetch_entsoe_forecast('DE_LU', 'solar', start, start + timedelta(hours=48))
        demand_de_d1_d7 = fetch_entsoe_forecast('DE_LU', 'load', start, start + timedelta(hours=168))
        ntc_de_fr_d1 = fetch_ntc_forecast('DE_FR', start, start + timedelta(hours=24))
        
        # Fetch weather forecasts (available to D+14)
        weather_d1_d14 = fetch_openmeteo_forecast(start, end)
        
        # EXTEND forecasts intelligently
        # Feature 0-3: Renewable forecasts (extended using weather)
        features[:, 0] = self.wind_extenders['DE_LU'].extend_forecast(
            wind_de_d1_d2, weather_d1_d14
        )
        features[:, 1] = self.solar_extenders['DE_LU'].extend_forecast(
            solar_de_d1_d2, weather_d1_d14
        )
        features[:, 2] = self.wind_extenders['FR'].extend_forecast(
            fetch_entsoe_forecast('FR', 'wind', start, start + timedelta(hours=48)),
            weather_d1_d14
        )
        features[:, 3] = self.solar_extenders['FR'].extend_forecast(
            fetch_entsoe_forecast('FR', 'solar', start, start + timedelta(hours=48)),
            weather_d1_d14
        )
        
        # Feature 4-5: Demand forecasts (extended using patterns)
        features[:, 4] = self.demand_extenders['DE_LU'].extend_forecast(
            demand_de_d1_d7, weather_d1_d14
        )
        features[:, 5] = self.demand_extenders['FR'].extend_forecast(
            fetch_entsoe_forecast('FR', 'load', start, start + timedelta(hours=168)),
            weather_d1_d14
        )
        
        # Feature 6-10: Weather forecasts (native D+14 coverage)
        features[:, 6] = weather_d1_d14['temperature_2m'].mean(axis=1)  # Avg temp
        features[:, 7] = weather_d1_d14['DE_north_sea']['windspeed_100m']
        features[:, 8] = weather_d1_d14['DE_baltic']['windspeed_100m']
        features[:, 9] = weather_d1_d14['shortwave_radiation'].mean(axis=1)
        features[:, 10] = weather_d1_d14['cloudcover'].mean(axis=1)
        
        # Feature 11: NTC forecast (extended with persistence)
        features[:, 11] = self.ntc_extenders['DE_FR'].extend_forecast(ntc_de_fr_d1)
        
        # Feature 12-16: Temporal (deterministic, perfect knowledge)
        timestamps = pd.date_range(start, end, freq='H', inclusive='left')
        features[:, 12] = np.sin(2 * np.pi * timestamps.hour / 24)
        features[:, 13] = np.cos(2 * np.pi * timestamps.hour / 24)
        features[:, 14] = timestamps.dayofweek
        features[:, 15] = (timestamps.dayofweek >= 5).astype(int)
        features[:, 16] = timestamps.map(lambda x: is_holiday(x, 'DE')).astype(int)
        
        return features
```

### 3.5 Data Flow Summary

**Complete prediction workflow:**

```python
# Example: Predicting on August 15, 2025 at 6 AM

# Step 1: Load 24-month historical data (one-time)
historical_data = {
    'jao': load_parquet('jao_2023_2025.parquet'),
    'entsoe': load_parquet('entsoe_2023_2025.parquet'),
    'weather': load_parquet('weather_2023_2025.parquet')
}

# Step 2: Initialize feature engineer with 24-month data
engineer = CompleteFBMCFeatureEngineer(historical_data)

# Step 3: Prepare inputs for prediction
prediction_time = '2025-08-15 06:00:00'

historical_context, future_covariates = engineer.prepare_complete_input(
    prediction_time
)

# historical_context: (512, 70) - What happened July 25 - Aug 15
# future_covariates: (336, 17) - What's expected Aug 15 - Aug 29

# Step 4: Zero-shot forecast
model = ChronosPipeline.from_pretrained("amazon/chronos-t5-large")

forecast = model.predict(
    context=historical_context,
    future_covariates=future_covariates,
    prediction_length=336
)

# forecast: (100 samples, 336 hours, 20 borders)
```

### 3.6 Why This Architecture Matters

**Without smart extensions:**
```
D+1-D+2: High accuracy (using ENTSO-E forecasts)
D+3-D+14: Poor accuracy (using crude persistence)
Result: MAE degrades rapidly beyond D+2
```

**With smart extensions:**
```
D+1-D+2: High accuracy (ENTSO-E forecasts)
D+3-D+14: Good accuracy (derived from weather, maintained patterns)
Result: MAE degrades gracefully, remains useful to D+14
```

**Expected performance improvement:**
| Horizon | Without Extensions | With Smart Extensions |
|---------|-------------------|----------------------|
| D+1 | 134 MW | 134 MW (same) |
| D+3 | 178 MW | 156 MW (-22 MW) |
| D+7 | 215 MW | 187 MW (-28 MW) |
| D+14 | 285 MW | 231 MW (-54 MW) |

**The smart extension strategies keep the model "informed" about future conditions even when official forecasts end, maintaining prediction quality across the full 14-day horizon.**

---

## 4. Zero-Shot Inference Specification

### 4.1 The Core Innovation: Pattern Recognition Without Training

**Key Insight**: Chronos 2's 710M parameters were pre-trained on 100+ billion time series datapoints. It already understands:
- Temporal patterns (hourly, daily, seasonal cycles)
- Cross-series dependencies (multivariate relationships)
- Regime changes (sudden shifts in behavior)
- Weather-driven patterns
- Economic constraints

**We don't train the model. We give it FBMC-specific context through features.**

### 4.2 Zero-Shot Inference Pipeline

```python
from chronos import ChronosPipeline
import torch
import polars as pl
import numpy as np

class FBMCZeroShotForecaster:
    """
    Zero-shot forecasting for FBMC borders using Chronos 2.
    No training, only feature-informed inference.
    """
    
    def __init__(self):
        # Load pre-trained model (parameters stay frozen)
        self.pipeline = ChronosPipeline.from_pretrained(
            "amazon/chronos-t5-large",
            device_map="cuda",
            torch_dtype=torch.float16
        )
        
        self.config = {
            'context_length': 512,        # 21 days lookback
            'prediction_length': 336,     # 14 days forecast
            'num_samples': 100,           # Probabilistic samples
            'feature_dimension': 85,      # Input features
            'target_dimension': 20,       # Border capacities
        }
        
    def prepare_context(self, features, targets, prediction_time):
        """
        Prepare context window for zero-shot inference.
        
        Args:
            features: polars DataFrame with full 24-month feature matrix
            targets: polars DataFrame with historical capacity values
            prediction_time: Timestamp to predict from
            
        Returns:
            context: Recent 512 hours of multivariate data
        """
        # Find row index for prediction time
        time_col = features.select(pl.col('timestamp')).to_series()
        idx = (time_col == prediction_time).arg_max()
        
        # Extract context window (last 512 hours) as numpy arrays
        context_features = features.slice(idx-512, 512).drop('timestamp').to_numpy()
        context_targets = targets.slice(idx-512, 512).drop('timestamp').to_numpy()
        
        # Combine features and historical capacities
        # Shape: (512 hours, 85 features + 20 borders)
        context = np.concatenate([
            context_features,
            context_targets
        ], axis=1)
        
        return torch.tensor(context, dtype=torch.float32)
    
    def forecast(self, context):
        """
        Zero-shot forecast using pre-trained Chronos 2.
        
        Args:
            context: Recent 512 hours of multivariate data
            
        Returns:
            forecast: Probabilistic predictions (samples, time, borders)
        """
        with torch.no_grad():  # No gradient computation (not training)
            forecast = self.pipeline.predict(
                context=context,
                prediction_length=self.config['prediction_length'],
                num_samples=self.config['num_samples']
            )
        
        return forecast
    
    def run_inference(self, features, targets, test_period):
        """
        Run zero-shot inference for entire test period.
        
        Args:
            features: Engineered features (24 months)
            targets: Historical capacities (24 months)
            test_period: Dates to generate forecasts for
            
        Returns:
            all_forecasts: Dictionary of forecasts by date
        """
        all_forecasts = {}
        
        for prediction_time in test_period:
            # Prepare context from recent history
            context = self.prepare_context(
                features, targets, prediction_time
            )
            
            # Zero-shot forecast
            forecast = self.forecast(context)
            
            # Store median and quantiles
            all_forecasts[prediction_time] = {
                'median': torch.median(forecast, dim=0)[0],
                'q10': torch.quantile(forecast, 0.1, dim=0),
                'q90': torch.quantile(forecast, 0.9, dim=0)
            }
            
            print(f"ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Forecast generated for {prediction_time}")
        
        return all_forecasts
```

### 4.3 What Makes This Zero-Shot

**Comparison: Training vs Zero-Shot**

```python
# ÃƒÂ¢Ã‚ÂÃ…â€™ TRAINING (what we're NOT doing)
model = ChronosPipeline.from_pretrained("amazon/chronos-t5-large")

for epoch in range(10):
    for batch in train_loader:
        # Forward pass
        predictions = model(batch.features)
        
        # Compute loss
        loss = criterion(predictions, batch.targets)
        
        # Backward pass (updates 710M parameters)
        loss.backward()
        optimizer.step()
        
# Model weights have changed ÃƒÂ¢Ã¢â‚¬Â Ã‚Â TRAINING

# ÃƒÂ¢Ã…â€œÃ¢â‚¬Â¦ ZERO-SHOT (what we ARE doing)
pipeline = ChronosPipeline.from_pretrained("amazon/chronos-t5-large")

# Prepare recent context with FBMC features
context = prepare_context(features, targets, prediction_time)

# Direct prediction (no training, no weight updates)
forecast = pipeline.predict(context, prediction_length=336)

# Model weights unchanged ÃƒÂ¢Ã¢â‚¬Â Ã‚Â ZERO-SHOT
```

**Key Differences:**
- **Training**: Adjusts model parameters to minimize prediction error on your data
- **Zero-Shot**: Uses model's pre-existing knowledge, informed by your context

**Why This Works:**
Chronos 2 learned general patterns from massive pre-training:
- *"When feature A is high and feature B is low, target tends to decrease"*
- *"Strong cyclic patterns in context predict similar cycles ahead"*
- *"Sudden feature changes often precede target regime shifts"*

Your FBMC features provide the specific context:
- *"North Sea wind is high (feature 31)"*
- *"RAM has been decreasing (feature 10-17)"*
- *"CNECs are binding more frequently (feature 18-27)"*

The model applies its pre-trained pattern recognition to your FBMC-specific context.

### 4.4 Multivariate Forecasting: All Borders Simultaneously

**Critical Design**: Predict all ~20 borders in one pass, capturing cross-border dependencies.

```python
# Input shape to Chronos 2
context_shape = (512 hours, 105 features)
# Where 105 = 85 engineered features + 20 historical border capacities

# Output shape from Chronos 2
forecast_shape = (100 samples, 336 hours, 20 borders)

# Example: How model learns cross-border effects
# If context shows:
#   - North Sea wind high (feature 31)
#   - DE-NL capacity decreasing (historical)
#   - DE-FR capacity stable (historical)
# 
# Model predicts:
#   - DE-NL continues decreasing (overload from north)
#   - DE-FR also decreases (loop flow spillover)
#   - NL-BE increases (alternative route)
#
# All predicted simultaneously, preserving network physics
```

### 4.5 Performance Evaluation

```python
def evaluate_zero_shot_performance(forecasts, actuals):
    """
    Evaluate zero-shot forecasts against actual JAO allocations.
    """
    results = {
        'aggregated': {},      # All borders combined
        'per_border': {},      # Individual border metrics
        'by_condition': {}     # Performance in different scenarios
    }
    
    # 1. AGGREGATED METRICS
    for day in range(1, 15):  # D+1 through D+14
        horizon_idx = (day - 1) * 24
        
        pred_day = forecasts[:, horizon_idx:horizon_idx+24, :].median(dim=0)[0]
        actual_day = actuals[:, horizon_idx:horizon_idx+24, :]
        
        mae = torch.abs(pred_day - actual_day).mean().item()
        mape = (torch.abs(pred_day - actual_day) / actual_day).mean().item() * 100
        
        results['aggregated'][f'D+{day}'] = {
            'mae_mw': mae,
            'mape_pct': mape
        }
    
    # 2. PER-BORDER METRICS
    for border_idx, border in enumerate(FBMC_BORDERS):
        results['per_border'][border] = {}
        
        for day in range(1, 15):
            horizon_idx = (day - 1) * 24
            
            pred_border = forecasts[:, horizon_idx:horizon_idx+24, border_idx].median(dim=0)[0]
            actual_border = actuals[:, horizon_idx:horizon_idx+24, border_idx]
            
            mae = torch.abs(pred_border - actual_border).mean().item()
            
            results['per_border'][border][f'D+{day}'] = {'mae_mw': mae}
    
    # 3. CONDITIONAL PERFORMANCE
    # Where does zero-shot struggle?
    conditions = {
        'high_wind': features[:, 31] > 25,  # North Sea wind
        'low_nuclear': features[:, 43] < 40000,  # French nuclear
        'high_demand': features[:, 28] > 60000,  # German load
        'weekend': features[:, 54] == 1
    }
    
    for condition_name, mask in conditions.items():
        pred_condition = forecasts[mask]
        actual_condition = actuals[mask]
        
        mae = torch.abs(pred_condition - actual_condition).mean().item()
        
        results['by_condition'][condition_name] = {
            'mae_mw': mae,
            'sample_size': mask.sum().item()
        }
    
    return results
```

**Performance Targets (Zero-Shot Baseline):**
- **D+1**: MAE < 150 MW (relaxed from fine-tuned target of 100 MW)
- **D+2-7**: MAE < 200 MW
- **D+8-14**: MAE < 250 MW

**If targets not met**, document specific failure modes for quantitative analyst:
- Which borders struggle most?
- Which weather conditions cause largest errors?
- Which time horizons degrade fastest?
- Where would fine-tuning help?

### 4.6 Inference Speed and Efficiency

**Expected Performance:**
```python
# Single forecast generation
context = features[-512:]  # 512 hours ÃƒÆ’Ã¢â‚¬â€ 105 features
forecast = pipeline.predict(context, prediction_length=336)
# Time: ~30 seconds on A10G GPU

# Full test period (60 days)
for prediction_time in test_period_60_days:
    forecast = pipeline.predict(...)
# Total time: ~30 minutes for 60 independent forecasts

# Batch inference (if needed)
batch_contexts = [features[i-512:i] for i in range(start, end)]
batch_forecasts = pipeline.predict(batch_contexts, ...)
# Time: ~5 minutes for 60 forecasts
```

**Memory Usage:**
- Model loading: ~3 GB VRAM
- Single inference: +1 GB VRAM
- Total: ~4 GB VRAM (well within A10G's 24 GB)

---

## 5. Project Structure (Simplified 90%)

### 5.1 Hugging Face Space Structure

```
fbmc-forecasting/  (HF Space root)
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ README.md                      # Handover documentation
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ requirements.txt               # Python dependencies
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ app.py                         # Gradio demo (optional)
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ config/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ spatial_grid.yaml          # 52 weather points
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ border_definitions.yaml    # ~20 FBMC borders
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ cnec_top50.json            # Pre-identified top CNECs
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ data/                          # HF Datasets or direct upload
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ jao_24m.parquet             # 24 months JAO data
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ entsoe_24m.parquet          # ENTSO-E forecasts
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ weather_24m.parquet         # 52-point weather grid
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ features_24m.parquet        # Engineered features (~1,735 features)
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ notebooks/                     # Development notebooks
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 01_data_exploration.ipynb
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 02_feature_engineering.ipynb
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 03_zero_shot_inference.ipynb
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 04_performance_evaluation.ipynb
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ 05_error_analysis.ipynb
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ src/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ data_collection/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ fetch_openmeteo.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ fetch_entsoe.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ fetch_jao.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ fetch_ntc.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ feature_engineering/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ spatial_gradients.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ cnec_patterns.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ ptdf_compression.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ feature_matrix.py     # ~1,735 features
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ model/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ zero_shot_forecaster.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ evaluation.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ utils/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡       ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ logging_config.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡       ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ constants.py
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ scripts/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ download_all_data.py       # Local data download
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ validate_data.py           # Data quality checks
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ identify_top_cnecs.py      # CNEC analysis
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ generate_report.py         # Performance reporting
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ results/                       # Generated outputs
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ zero_shot_performance.json
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ error_analysis.csv
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ border_metrics.json
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡   ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ visualizations/
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬Å¡
ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ docs/
    ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ HANDOVER_GUIDE.md          # For quantitative analyst
    ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ FEATURE_ENGINEERING.md     # Feature documentation
    ÃƒÂ¢Ã¢â‚¬ÂÃ…â€œÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ ZERO_SHOT_APPROACH.md      # Methodology explanation
    ÃƒÂ¢Ã¢â‚¬ÂÃ¢â‚¬ÂÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ÃƒÂ¢Ã¢â‚¬ÂÃ¢â€šÂ¬ FINE_TUNING_ROADMAP.md     # Phase 2 suggestions
```

### 5.2 Minimal Dependencies

```txt
# requirements.txt
chronos-forecasting>=1.0.0
transformers>=4.35.0
torch>=2.0.0
pandas>=2.0.0
numpy>=1.24.0
scikit-learn>=1.3.0
entsoe-py>=0.5.0
requests>=2.31.0
pyarrow>=13.0.0
pyyaml>=6.0.0
plotly>=5.17.0
gradio>=4.0.0  # Optional for demo
```

---

## 6. Technology Stack

### 6.1 Core Development Stack

| Component | Tool | Why | Performance Benefit |
|-----------|------|-----|-------------------|
| **Data Processing** | **polars** | Rust-based, parallel by default, lazy evaluation | 5-50x faster than pandas on 10M+ rows |
| **Package Manager** | **uv** | Single Rust binary, lockfile by default | 10-100x faster than pip/conda |
| **Visualization** | **Altair** | Declarative, polars-native, composable | Cleaner syntax, Vega-Lite spec |
| **Notebooks (Local)** | **Marimo** | Reactive execution, pure .py files, no hidden state | Eliminates stale cell bugs |
| **Notebooks (HF Space)** | **JupyterLab** | Standard format for handover | Analyst familiarity |
| **Infrastructure** | **HF Spaces** | Persistent GPU, Git versioning, $30/month | Zero setup complexity |
| **Model** | **Chronos 2 Large** | 710M params, pre-trained on 100B+ time series | Zero-shot capability |

### 6.2 Why Polars Over Pandas

**Performance Critical Operations in FBMC Project:**

```python
# Dataset scale
weather_data:  52 points Ãƒâ€” 7 params Ãƒâ€” 17,520 hours = 6.5M rows
jao_cnecs:     200 CNECs Ãƒâ€" 17,520 hours = 3.5M rows  
entsoe_data:   12 zones Ãƒâ€” multiple params Ãƒâ€” 17,520 hours = ~2M rows
TOTAL:         ~12M+ rows across tables

# Operations we'll do thousands of times
- Rolling window aggregations (512-hour context)
- GroupBy with multiple aggregations (CNEC patterns)
- Time-based joins (aligning weather, JAO, ENTSO-E)
- Lazy evaluation queries (filtering before loading)
```

**polars Advantages:**
1. **Parallel by default**: Uses all CPU cores (no GIL limitations)
2. **Lazy evaluation**: Only computes what's needed (memory efficient)
3. **Arrow-native**: Zero-copy reading/writing Parquet files
4. **Query optimization**: Automatically reorders operations for speed
5. **10-30x faster**: For feature engineering pipelines on 24-month dataset

**Time Saved:**
- Feature engineering (Day 2): 8 hours Ã¢â€ â€™ 4-5 hours with polars
- Data validation: 30 min Ã¢â€ â€™ 5 min with polars
- Iteration cycles: 5 min Ã¢â€ â€™ 30 seconds per iteration

### 6.3 Why uv Over pip/conda

**Benefits:**
1. **Speed**: 10-100x faster dependency resolution and installation
2. **Lockfile by default**: `uv.lock` ensures exact reproducibility for analyst handover
3. **Single binary**: No Python needed to install (simplifies HF Space setup)
4. **Better resolver**: Handles complex dependency conflicts intelligently
5. **Drop-in compatible**: Works with existing `requirements.txt`

**Day 2 Impact:**
- Feature engineering iterations require dependency updates
- uv saves 5-10 minutes per cycle
- Estimate: 5-8 iterations Ãƒâ€” 7 min saved = 35-56 minutes saved on Day 2

### 6.4 Why Altair for Visualization

**Advantages:**
1. **Declarative syntax**: Grammar of graphics (more maintainable)
2. **polars-native**: `alt.Chart(polars_df)` works directly (no conversion)
3. **Composable**: Layering, faceting, concatenation feel natural
4. **Vega-Lite backend**: Standardized JSON spec (reproducible, shareable)
5. **Less boilerplate**: ~3x less code than plotly for same chart

**FBMC Visualization Example:**
```python
# Error analysis by horizon (Altair - 6 lines)
import altair as alt

alt.Chart(error_df).mark_line().encode(
    x='horizon:Q',
    y='mae:Q',
    color='border:N'
).properties(title='MAE by Horizon and Border').interactive()

# Same in plotly (18 lines)
import plotly.graph_objects as go
fig = go.Figure()
for border in borders:
    df_border = error_df[error_df['border'] == border]
    fig.add_trace(go.Scatter(
        x=df_border['horizon'], 
        y=df_border['mae'], 
        name=border,
        mode='lines+markers'
    ))
fig.update_layout(
    title='MAE by Horizon and Border',
    xaxis_title='Horizon',
    yaxis_title='MAE (MW)'
)
fig.show()
```

### 6.5 Marimo Hybrid Approach (CONFIRMED HANDOVER FORMAT)

**Development Strategy**:
- **Local Development (Days 1-4)**: Use Marimo reactive notebooks for rapid iteration
- **HF Space Handover (Day 5)**: Export to standard JupyterLab-compatible notebooks

**Local Development with Marimo (Days 1-4):**
```python
# notebooks/01_data_exploration.py (Marimo reactive notebook)
import marimo as mo
import polars as pl

@app.cell
def load_weather():
    """Load weather data - auto-updates downstream cells on change"""
    weather = pl.read_parquet('data/weather_12m.parquet')
    return weather,

@app.cell
def calculate_gradients(weather):
    """Automatically recalculates if weather changes above"""
    gradients = weather.group_by('timestamp').agg([
        (pl.col('windspeed_100m').max() - pl.col('windspeed_100m').min()).alias('wind_gradient')
    ])
    return gradients,

@app.cell  
def visualize(gradients):
    """Reactive visualization"""
    import altair as alt
    return alt.Chart(gradients).mark_line().encode(x='timestamp', y='wind_gradient')
```

**Key Benefits During Development:**
- **Reactive execution**: Cells auto-update when dependencies change (prevents stale results)
- **Pure Python files**: `.py` instead of `.ipynb` (Git-friendly, reviewable diffs)
- **No hidden state**: Execution order enforced by dataflow graph (impossible to run cells out of order)
- **Interactive widgets**: First-class `mo.ui` components for parameter tuning
- **Faster iteration**: No manual re-running of dependent cells

**HF Space Handover (Day 5):**
```bash
# Export Marimo notebooks to standard .ipynb format
marimo export notebooks/*.py --format ipynb --output notebooks_exported/

# Upload to HF Space
git add notebooks_exported/
git commit -m "Add JupyterLab-compatible notebooks for analyst handover"
git push
```

**Result for Analyst:** 
- Receives standard JupyterLab-compatible `.ipynb` notebooks in HF Space
- Can use them immediately without installing Marimo
- Can optionally adopt Marimo for Phase 2 development if desired
- Zero friction handover - standard tools only

**Why This Hybrid Approach:**
- You get 95% of Marimo's development benefits (reactive, Git-friendly)
- Analyst gets 100% standard tooling (JupyterLab, no learning curve)
- Best of both worlds with zero handover friction

### 6.6 Complete Development Environment Setup

```bash
# Local setup with uv (10 seconds vs 2 minutes with pip)
uv venv
uv pip install polars chronos-forecasting entsoe-py altair marimo pyarrow pyyaml scikit-learn

# Create lockfile for exact reproducibility
uv pip compile requirements.txt -o requirements.lock

# Analyst can recreate exact environment
uv pip sync requirements.lock
```

**Dependencies:**
```txt
# requirements.txt (uv-managed)
polars>=0.20.0
chronos-forecasting>=1.0.0
transformers>=4.35.0
torch>=2.0.0
numpy>=1.24.0
scikit-learn>=1.3.0
entsoe-py>=0.5.0
altair>=5.0.0
marimo>=0.9.0
pyarrow>=13.0.0
pyyaml>=6.0.0
requests>=2.31.0
gradio>=4.0.0  # Optional for HF Space demo
```

### 6.7 Data Pipeline Stack

| Stage | Tool | Format | Purpose |
|-------|------|--------|---------|
| **Collection** | jao-py, entsoe-py, requests | Raw API responses | Historical data download |
| **Storage** | Parquet (via pyarrow) | Columnar compressed | ~12 GB for 24 months (vs ~50 GB CSV) |
| **Processing** | polars LazyFrame | Lazy evaluation | Only compute what's needed |
| **Features** | polars expressions | Columnar operations | Vectorized transformations |
| **ML Input** | numpy arrays | Dense matrices | Chronos 2 expects numpy |

**Workflow:**
```
JAO/ENTSO-E APIs Ã¢â€ â€™ Parquet files Ã¢â€ â€™ polars LazyFrame Ã¢â€ â€™ Feature engineering Ã¢â€ â€™ numpy arrays Ã¢â€ â€™ Chronos 2
```

### 6.8 Why This Stack Saves Time

| Task | pandas + pip + plotly | polars + uv + Altair | Time Saved |
|------|---------------------|---------------------|------------|
| Environment setup | 2-3 min | 10-15 sec | 2 min |
| Data loading (6 GB) | 45 sec | 5 sec | 40 sec |
| Feature engineering (Day 2) | 8 hours | 4-5 hours | 3-4 hours |
| Visualization code | 18 lines avg | 6 lines avg | Development velocity |
| Dependency updates | 3-5 min each | 20-30 sec each | 2-4 min per update |
| Bug debugging (stale cells) | 30-60 min (Jupyter) | 0 min (Marimo reactive) | 30-60 min |

**Total Time Saved:** ~5-6 hours across 5-day project = **20-25% efficiency gain**

**Maintained Benefits:**
- Analyst receives standard JupyterLab-compatible notebooks
- All code works with standard tools (no vendor lock-in)
- Clean handover with reproducible environment (uv.lock)

---

## 7. Implementation Roadmap (5 Days)

### Critical Success Principles

**Weather ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ CNEC Activation ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Border Capacity**

This core insight drives all design decisions. We leverage Chronos 2's pre-trained pattern recognition while providing FBMC-specific context.

**MULTIVARIATE INFERENCE REQUIREMENT**

**All borders must be predicted simultaneously** to capture cross-border dependencies, CNEC activation patterns, and loop flow physics.

Examples of why multivariate inference is required:
- **North Sea wind** affects DE-NL, DE-BE, NL-BE, DE-DK simultaneously
- **Austrian hydro dispatch** impacts DE-AT, CZ-AT, HU-AT, SI-AT
- **Polish thermal generation** creates loop flows through CZ-DE-AT
- **CNEC activations** on one border affect capacity on adjacent borders

### 5-Day Timeline (All Borders, 2-Year Data)

#### **Day 0: Environment Setup (45 minutes)**

**CONFIRMED INFRASTRUCTURE: Hugging Face Space (Paid A10G GPU)**

**What changed from planning**: Added jao-py library installation and API key configuration steps

```bash
# 1. Create HF Space (10 min)
# Visit huggingface.co/new-space
# Choose:
#   - SDK: JupyterLab (for handover compatibility)
#   - Hardware: A10G GPU ($30/month)
#   - Visibility: Private (for now)

# 2. Clone locally (2 min)
git clone https://huggingface.co/spaces/yourname/fbmc-forecasting
cd fbmc-forecasting

# 3. Initialize structure (2 min)
mkdir -p data notebooks notebooks_exported src/{data_collection,feature_engineering,model} config results docs

# 4. Local environment setup with uv (5 min)
uv venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# 5. Install dependencies (10 sec with uv vs 2 min with pip)
uv pip install polars chronos-forecasting entsoe-py altair marimo pyarrow pyyaml scikit-learn torch transformers requests gradio

# 6. Create lockfile for reproducibility (5 sec)
cat > requirements.txt << EOF
polars>=0.20.0
chronos-forecasting>=1.0.0
transformers>=4.35.0
torch>=2.0.0
numpy>=1.24.0
scikit-learn>=1.3.0
entsoe-py>=0.5.0
altair>=5.0.0
marimo>=0.9.0
pyarrow>=13.0.0
pyyaml>=6.0.0
requests>=2.31.0
gradio>=4.0.0
EOF

uv pip compile requirements.txt -o requirements.lock

# 7. Install HF CLI for data uploads (3 min)
pip install huggingface_hub
huggingface-cli login  # Use your HF token

# 8. Install jao-py library (1 min)
uv pip install jao-py
# Pure Python library - no external tools needed
# Data available from 2022-06-09 onwards

# 9. Configure API keys (2 min)
cat > config/api_keys.yaml << EOF
entsoe_api_key: "YOUR_ENTSOE_KEY_HERE"  # CONFIRMED AVAILABLE
openmeteo_api_key: null  # Not required - OpenMeteo is free
EOF

# 10. Create first Marimo notebook (5 min)
marimo edit notebooks/01_data_exploration.py
# This opens interactive editor - create basic structure and save

# 11. Initial commit (2 min)
git add .
git commit -m "Initialize FBMC forecasting project: polars + uv + Marimo + jao-py"
git push

# 10. Verify HF Space accessibility (1 min)
# Visit https://huggingface.co/spaces/yourname/fbmc-forecasting
```

**Deliverable**: 
- ÃƒÂ¢Ã…â€œ" HF Space ready with JupyterLab
- ÃƒÂ¢Ã…â€œ" Local environment with uv + polars + Marimo
- ÃƒÂ¢Ã…â€œ" HF CLI configured for data uploads
- ÃƒÂ¢Ã…â€œ" JAOPuTo tool ready for data collection
- ÃƒÂ¢Ã…â€œ" First Marimo notebook created (local development)
- ÃƒÂ¢Ã…â€œ" Reproducible environment via requirements.lock

**Stack Verification:**
```bash
# Verify installations
polars --version      # Should show 0.20.x+
uv --version          # Should show uv 0.x.x
marimo --version      # Should show marimo 0.9.x+
python -c "import altair; print(altair.__version__)"  # 5.x+
```

---

#### **Day 1: Data Collection - All Borders, 2 Years (8 hours)**

**Morning (4 hours): JAO and ENTSO-E Data**

```python
# Download 24 months of JAO FBMC data (all borders)
# This runs LOCALLY first, then uploads to HF Space

# Step 1: JAO data download
import subprocess
import polars as pl
from datetime import datetime

def download_jao_data():
    """Download 24 months of JAO FBMC data"""
    from jao import JaoPublicationToolPandasClient

    client = JaoPublicationToolPandasClient(use_mirror=True)
    # Collect data for date range
    # Methods discovered from source code
    # Save to Parquet format

    # Expected files:
    # - jao_cnec_2024_2025.parquet
    # - jao_ptdf_2024_2025.parquet (if method available)
    # - ptdfs_2023_2025.parquet (~800 MB)
    # - rams_2023_2025.parquet (~400 MB)
    # - shadow_prices_2023_2025.parquet (~300 MB)
    
    print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ JAO data downloaded")

download_jao_data()

# Step 2: ENTSO-E data download
from entsoe import EntsoePandasClient
from concurrent.futures import ThreadPoolExecutor
import time

client = EntsoePandasClient(api_key='YOUR_KEY')

zones = ['DE_LU', 'FR', 'BE', 'NL', 'AT', 'CZ', 'PL', 'HU', 'RO', 'SK', 'SI', 'HR']
start = pd.Timestamp('20230101', tz='Europe/Berlin')
end = pd.Timestamp('20250930', tz='Europe/Berlin')

def fetch_zone_data(zone):
    """Fetch all data for one zone with rate limiting"""
    try:
        # Load forecast
        load = client.query_load_forecast(zone, start, end)
        load.write_parquet(f'data/entsoe/{zone}_load.parquet')
        
        # Renewable forecasts
        renewables = client.query_wind_and_solar_forecast(zone, start, end)
        renewables.write_parquet(f'data/entsoe/{zone}_renewables.parquet')
        
        print(f"ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ {zone} complete")
        time.sleep(2)  # Rate limiting
        
    except Exception as e:
        print(f"ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ {zone} failed: {e}")

# Sequential fetch (respects API limits)
for zone in zones:
    fetch_zone_data(zone)

print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ ENTSO-E data downloaded")
```

**Afternoon (4 hours): Weather Data (52 Points) + Validation**

```python
# Weather data download (parallel)
import requests
import yaml
import polars as pl
from concurrent.futures import ThreadPoolExecutor

# Load 52 grid points
with open('config/spatial_grid.yaml', 'r') as f:
    grid_points = yaml.safe_load(f)['spatial_grid']

def fetch_weather_point(point):
    """Fetch 24 months of weather for one grid point"""
    lat, lon = point['lat'], point['lon']
    name = point['name']

    url = "https://api.open-meteo.com/v1/forecast"
    params = {
        'latitude': lat,
        'longitude': lon,
        'hourly': 'temperature_2m,windspeed_10m,windspeed_100m,winddirection_100m,shortwave_radiation,cloudcover,surface_pressure',
        'start_date': '2023-10-01',
        'end_date': '2025-09-30',
        'timezone': 'UTC'
    }
    
    try:
        response = requests.get(url, params=params)
        data = response.json()
        
        # Create polars DataFrame directly
        df = pl.DataFrame(data['hourly']).with_columns([
            pl.lit(name).alias('grid_point'),
            pl.lit(lat).alias('lat'),
            pl.lit(lon).alias('lon')
        ])
        
        return df
    except Exception as e:
        print(f"ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ {name} failed: {e}")
        return None

# Parallel fetch (10 concurrent)
with ThreadPoolExecutor(max_workers=10) as executor:
    weather_data = list(executor.map(fetch_weather_point, grid_points))

# Combine and save
weather_df = pl.concat([df for df in weather_data if df is not None])
weather_df.write_parquet('data/weather/historical_52points_12m.parquet')

print(f"ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Weather data complete: {len(weather_df)} rows")

# Data validation
def validate_data_quality():
    """Comprehensive data quality checks"""
    import os
    
    jao_cnecs = pl.read_parquet('data/jao/cnecs_2023_2025.parquet')
    entsoe_load = pl.read_parquet('data/entsoe/DE_LU_load.parquet')
    weather = pl.read_parquet('data/weather/historical_52points_12m.parquet')
    
    checks = {
        'jao_cnecs_rows': len(jao_cnecs) > 300000,
        'jao_borders_count': jao_cnecs['border'].n_unique() >= 20,
        'entsoe_zones_complete': len([f for f in os.listdir('data/entsoe') if 'load' in f]) == 12,
        'weather_points_complete': weather['grid_point'].n_unique() == 52,
        'date_range_complete': (weather['time'].max() - weather['time'].min()).days >= 900,
        'no_major_gaps': weather.group_by('grid_point').agg([
            (pl.col('time').diff().max() < pl.duration(hours=2)).alias('no_gap')
        ])['no_gap'].all()
    }
    
    print("\nData Quality Checks:")
    for check, passed in checks.items():
        print(f"  {'ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“' if passed else 'ÃƒÂ¢Ã…â€œÃ¢â‚¬â€'} {check}")
    
    return all(checks.values())

if validate_data_quality():
    # Upload to HF Space
    print("\nÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Validation passed, uploading to HF Space...")
    
    # Upload using HF Datasets or CLI
    subprocess.run(['git', 'add', 'data/'])
    subprocess.run(['git', 'commit', '-m', 'Add 24-month historical data'])
    subprocess.run(['git', 'push'])
    
    print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Data uploaded to HF Space")
else:
    print("ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ Validation failed - fix issues before proceeding")
```

**Deliverable**:
- 24 months of data for ALL borders downloaded locally
- Data validated and uploaded to HF Space
- ~12 GB compressed in Parquet format

---

#### **Day 2: Feature Engineering + Forecast Extensions (8 hours)**

**Morning (4 hours): Build Historical Feature Pipeline**

```python
# src/feature_engineering/feature_matrix.py
# Run in HF Space JupyterLab

import numpy as np
import polars as pl
from sklearn.decomposition import PCA

class FBMCFeatureEngineer:
    """
    Engineer ~1,735 features for zero-shot inference.
    All features use 24-month history for baseline calculations.

    NOTE: This simplified code example shows deprecated 87-feature design.
    See Section 2.7 "Complete Feature Set" for production architecture.
    """

    def __init__(self, weather_points=52, tier1_cnecs=50, tier2_cnecs=150):
        self.weather_points = weather_points
        self.tier1_cnecs = tier1_cnecs
        self.tier2_cnecs = tier2_cnecs
        self.pca = PCA(n_components=10)
        
    def transform_historical(self, data, start_time, end_time):
        """
        Build historical context features (512 hours, 70 features)
        
        Args:
            data: dict with keys ['jao', 'entsoe', 'weather']
            start_time: 21 days before prediction
            end_time: prediction time
            
        Returns:
            features: shape (512, 70) - historical context
        """
        n_hours = 512
        features = np.zeros((n_hours, 70))
        
        print("Engineering historical features...")
        
        # Category 1: Historical PTDF Patterns (10 features)
        print("  - PTDF compression...")
        ptdf_historical = data['jao']['ptdf_matrix'][start_time:end_time]
        ptdf_compressed = self.pca.fit_transform(ptdf_historical)
        features[:, 0:10] = ptdf_compressed
        
        # Category 2: Historical RAM Patterns (8 features)
        print("  - RAM patterns...")
        ram_data = data['jao']['ram'][start_time:end_time]
        features[:, 10] = ram_data.rolling(168, min_periods=1).mean()  # 7-day MA
        features[:, 11] = ram_data.rolling(720, min_periods=1).mean()  # 30-day MA
        features[:, 12] = ram_data.rolling(168, min_periods=1).std()
        features[:, 13] = (ram_data < 0.7 * data['jao']['fmax']).rolling(168).sum()
        features[:, 14] = ram_data.rolling(2160, min_periods=1).apply(lambda x: np.percentile(x, 50))
        features[:, 15] = (ram_data.diff() < -0.2 * data['jao']['fmax']).astype(int)
        features[:, 16] = (ram_data < 0.2 * data['jao']['fmax']).rolling(168).mean()
        features[:, 17] = ram_data.rolling(168, min_periods=1).apply(lambda x: np.percentile(x, 10))
        
        # Category 3: Historical CNEC Binding (10 features)
        print("  - CNEC patterns...")
        cnec_binding = data['jao']['cnec_presolved'][start_time:end_time].astype(int)
        features[:, 18] = cnec_binding.rolling(168, min_periods=1).mean()
        features[:, 19] = cnec_binding.rolling(720, min_periods=1).mean()
        
        internal_cnecs = (data['jao']['cnec_type'] == 'internal').astype(int)
        features[:, 20] = internal_cnecs.rolling(168, min_periods=1).mean()
        features[:, 21] = internal_cnecs.rolling(720, min_periods=1).mean()
        
        top_cnec_active = data['jao']['cnec_id'].isin(self.top_cnecs).astype(int)
        features[:, 22] = top_cnec_active.rolling(168, min_periods=1).mean()
        features[:, 23] = (cnec_binding & top_cnec_active).sum() / max(cnec_binding.sum(), 1)
        
        high_wind = (data['entsoe']['DE_LU_wind'] > 20000).astype(int)
        features[:, 24] = (cnec_binding & high_wind).rolling(168, min_periods=1).mean()
        
        high_solar = (data['entsoe']['DE_LU_solar'] > 40000).astype(int)
        features[:, 25] = (cnec_binding & high_solar).rolling(168, min_periods=1).mean()
        
        low_demand = (data['entsoe']['DE_LU_load'] < data['entsoe']['DE_LU_load'].quantile(0.3)).astype(int)
        features[:, 26] = (cnec_binding & low_demand).rolling(168, min_periods=1).mean()
        
        features[:, 27] = cnec_binding.rolling(168, min_periods=1).std()
        
        # Category 4: Historical Capacity (20 features - one per border)
        print("  - Historical capacities...")
        for i, border in enumerate(data['jao']['border'].unique()[:20]):
            border_mask = data['jao']['border'] == border
            features[border_mask, 28+i] = data['jao']['capacity'][border_mask]
        
        # Category 5: Derived Indicators (22 features)
        print("  - Derived patterns...")
        # ... (implement remaining derived features)
        
        print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Historical feature engineering complete")
        print(f"  Features shape: {features.shape}")
        print(f"  Feature completeness: {(~np.isnan(features)).sum() / features.size * 100:.1f}%")
        
        return features

# Test historical feature engineering
engineer = FBMCFeatureEngineer(weather_points=52, top_cnecs=50)

data = {
    'jao': pl.read_parquet('/home/user/data/jao_12m.parquet'),
    'entsoe': pl.read_parquet('/home/user/data/entsoe_12m.parquet'),
    'weather': pl.read_parquet('/home/user/data/weather_12m.parquet')
}

# Example: Prepare features for August 15, 2025
prediction_time = '2025-08-15 06:00:00'
start_historical = prediction_time - timedelta(hours=512)

historical_features = engineer.transform_historical(
    data, 
    start_historical, 
    prediction_time
)

print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Historical features saved")
```

**Afternoon (4 hours): Build Smart Forecast Extension Models**

```python
# src/feature_engineering/forecast_extensions.py

from sklearn.ensemble import GradientBoostingRegressor
from scipy.interpolate import interp1d

class WindForecastExtension:
    """
    Extend ENTSO-E wind forecasts using weather data
    Calibrated on 24-month historical relationship
    """
    
    def __init__(self, zone, historical_data):
        self.zone = zone
        self.power_curve = self._calibrate_power_curve(historical_data)
        self.weather_points = self._get_weather_points(zone)
        self.installed_capacity = {
            'DE_LU': 67000,  # MW
            'FR': 22000,
            'NL': 10000,
            'BE': 5500
        }[zone]
    
    def _calibrate_power_curve(self, historical_data):
        """
        Learn wind_speed_100m ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ generation from 24-month history
        """
        print(f"  Calibrating wind power curve for {self.zone}...")
        
        # Get relevant weather points
        if self.zone == 'DE_LU':
            points = ['DE_north_sea', 'DE_baltic', 'DE_north', 'DE_south']
            weights = [0.35, 0.25, 0.25, 0.15]
        elif self.zone == 'FR':
            points = ['FR_north', 'FR_west', 'FR_brittany']
            weights = [0.4, 0.35, 0.25]
        # ... other zones
        
        # Aggregate wind speeds
        weather = historical_data['weather']
        wind_speeds = []
        for point, weight in zip(points, weights):
            point_data = weather[weather['grid_point'] == point]['windspeed_100m']
            wind_speeds.append(point_data * weight)
        
        wind_avg = sum(wind_speeds)
        
        # Get actual generation
        generation = historical_data['entsoe'][f'{self.zone}_wind_actual']
        
        # Align timestamps
        common_idx = wind_avg.index.intersection(generation.index)
        wind_aligned = wind_avg[common_idx]
        gen_aligned = generation[common_idx]
        
        # Build power curve using bins
        wind_bins = np.arange(0, 30, 0.5)
        power_output = []
        
        for i in range(len(wind_bins) - 1):
            mask = (wind_aligned >= wind_bins[i]) & (wind_aligned < wind_bins[i+1])
            if mask.sum() > 10:
                power_output.append(gen_aligned[mask].median())
            else:
                power_output.append(np.nan)
        
        # Interpolate missing values
        power_series = pd.Series(power_output).interpolate(method='cubic').fillna(0)
        
        # Create smooth function
        power_curve_func = interp1d(
            wind_bins[:-1],
            power_series.values,
            kind='cubic',
            bounds_error=False,
            fill_value=(0, power_series.max())
        )
        
        print(f"  ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Power curve calibrated (capacity: {self.installed_capacity} MW)")
        return power_curve_func
    
    def extend_forecast(self, entsoe_d1_d2, weather_d1_d14):
        """
        Extend 48-hour ENTSO-E forecast to 336 hours using weather
        """
        # Use ENTSO-E for D+1-D+2
        forecast_full = list(entsoe_d1_d2.values)
        
        # Derive from weather for D+3-D+14
        weather_d3_d14 = weather_d1_d14[48:336]
        
        for i, hour_data in enumerate(weather_d3_d14):
            # Aggregate wind speeds from relevant points
            wind_speed = np.average(
                [hour_data[point]['windspeed_100m'] for point in self.weather_points],
                weights=self.weights
            )
            
            # Apply power curve
            generation = self.power_curve(wind_speed)
            
            # Clip to installed capacity
            generation = np.clip(generation, 0, self.installed_capacity)
            
            # For D+10-D+14, blend with seasonal baseline
            hour_ahead = 48 + i
            if hour_ahead > 216:
                blend_weight = (hour_ahead - 216) / 120
                seasonal_avg = self._get_seasonal_baseline(hour_ahead)
                generation = (1 - blend_weight) * generation + blend_weight * seasonal_avg
            
            forecast_full.append(generation)
        
        return np.array(forecast_full[:336])

class SolarForecastExtension:
    """
    Extend ENTSO-E solar forecasts using weather data
    Uses ML model: radiation + temperature + cloud ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ generation
    """
    
    def __init__(self, zone, historical_data):
        self.zone = zone
        self.solar_model = self._calibrate_solar_model(historical_data)
        self.installed_capacity = {
            'DE_LU': 85000,
            'FR': 20000,
            'NL': 22000,
            'BE': 8000
        }[zone]
    
    def _calibrate_solar_model(self, historical_data):
        """
        Learn: radiation + temp + clouds ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ generation
        Using Gradient Boosting (captures non-linear relationships)
        """
        print(f"  Calibrating solar model for {self.zone}...")
        
        weather = historical_data['weather']
        generation = historical_data['entsoe'][f'{self.zone}_solar_actual']
        
        # Get relevant weather points for zone
        if self.zone == 'DE_LU':
            points = ['DE_south', 'DE_west', 'DE_east', 'DE_central']
        # ... other zones
        
        # Extract features
        radiation = weather[weather['grid_point'].isin(points)].groupby(level=0)['shortwave_radiation'].mean()
        temperature = weather[weather['grid_point'].isin(points)].groupby(level=0)['temperature_2m'].mean()
        cloudcover = weather[weather['grid_point'].isin(points)].groupby(level=0)['cloudcover'].mean()
        
        # Align with generation
        common_idx = radiation.index.intersection(generation.index)
        
        X = pl.DataFrame({
            'radiation': radiation[common_idx],
            'temperature': temperature[common_idx],
            'cloudcover': cloudcover[common_idx],
            'hour': common_idx.hour,
            'day_of_year': common_idx.dayofyear,
            'cos_hour': np.cos(2 * np.pi * common_idx.hour / 24),
            'sin_hour': np.sin(2 * np.pi * common_idx.hour / 24),
        })
        
        y = generation[common_idx]
        
        # Fit gradient boosting
        model = GradientBoostingRegressor(
            n_estimators=100,
            max_depth=5,
            learning_rate=0.1,
            random_state=42
        )
        
        model.fit(X, y)
        
        print(f"  ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Solar model calibrated (RÃƒâ€šÃ‚Â²: {model.score(X, y):.3f})")
        return model
    
    def extend_forecast(self, entsoe_d1_d2, weather_d1_d14):
        """
        Extend solar forecast using weather predictions
        """
        # Use ENTSO-E for D+1-D+2
        forecast_full = list(entsoe_d1_d2.values)
        
        # Derive from weather for D+3-D+14
        weather_d3_d14 = weather_d1_d14[48:336]
        
        # Prepare features
        X_future = pl.DataFrame({
            'radiation': weather_d3_d14['shortwave_radiation'],
            'temperature': weather_d3_d14['temperature_2m'],
            'cloudcover': weather_d3_d14['cloudcover'],
            'hour': weather_d3_d14.index.hour,
            'day_of_year': weather_d3_d14.index.dayofyear,
            'cos_hour': np.cos(2 * np.pi * weather_d3_d14.index.hour / 24),
            'sin_hour': np.sin(2 * np.pi * weather_d3_d14.index.hour / 24),
        })
        
        # Predict
        generation_d3_d14 = self.solar_model.predict(X_future)
        
        # Clip to capacity
        generation_d3_d14 = np.clip(generation_d3_d14, 0, self.installed_capacity)
        
        # Zero out nighttime
        for i, timestamp in enumerate(weather_d3_d14.index):
            if timestamp.hour < 6 or timestamp.hour > 20:
                generation_d3_d14[i] = 0
        
        forecast_full.extend(generation_d3_d14)
        
        return np.array(forecast_full[:336])

# Initialize and test forecast extension models
print("Building forecast extension models...")

wind_extenders = {}
solar_extenders = {}

for zone in ['DE_LU', 'FR', 'NL', 'BE']:
    print(f"\nZone: {zone}")
    wind_extenders[zone] = WindForecastExtension(zone, data)
    solar_extenders[zone] = SolarForecastExtension(zone, data)

print("\nÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ All forecast extension models calibrated")

# Test extension on sample data
test_date = '2025-08-15'
entsoe_wind_d1_d2 = fetch_entsoe_forecast('DE_LU', 'wind', test_date, hours=48)
weather_d1_d14 = fetch_weather_forecast(test_date, hours=336)

extended_wind = wind_extenders['DE_LU'].extend_forecast(entsoe_wind_d1_d2, weather_d1_d14)

print(f"\nÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Test extension successful")
print(f"  Original forecast: 48 hours")
print(f"  Extended forecast: {len(extended_wind)} hours")
print(f"  D+1 avg: {extended_wind[:24].mean():.0f} MW")
print(f"  D+7 avg: {extended_wind[144:168].mean():.0f} MW")
print(f"  D+14 avg: {extended_wind[312:336].mean():.0f} MW")
```

**Deliverable**:

```python
# notebooks/01_data_exploration.ipynb
# Interactive exploration of patterns

import polars as pl
import plotly.express as px
import plotly.graph_objects as go

# Load data
jao = pl.read_parquet('/home/user/data/jao_12m.parquet')
features = pl.read_parquet('/home/user/data/features_12m.parquet')
weather = pl.read_parquet('/home/user/data/weather_12m.parquet')

# Identify top 50 CNECs by binding frequency
top_cnecs = jao.group_by('cnec_id').agg([
    pl.col('presolved').sum(),
    pl.col('shadow_price').mean(),
    pl.col('ram').mean()
]).sort('presolved', descending=True).head(50)

print("Top 50 CNECs:")
print(top_cnecs)

# Save to config
top_cnecs.write_json('/home/user/config/cnec_top50.json')

# Visualize binding patterns by border
binding_by_border = jao.group_by('border').agg(
    pl.col('presolved').sum()
).sort('presolved', descending=True)

fig = px.bar(
    x=binding_by_border['border'][:20],
    y=binding_by_border['presolved'][:20],
    title='CNEC Binding Frequency by Border (2 Years)',
    labels={'x': 'Border', 'y': 'Total Binding Events'}
)
fig.show()

# Weather correlation with CNEC binding
north_sea_wind = weather.filter(pl.col('grid_point') == 'DE_north_sea')['windspeed_100m']
cnec_binding_rate = jao.group_by(pl.col('timestamp').dt.date()).agg(
    pl.col('presolved').mean()
)

fig = go.Figure()
fig.add_trace(go.Scatter(x=north_sea_wind.index, y=north_sea_wind, name='North Sea Wind'))
fig.add_trace(go.Scatter(x=cnec_binding_rate.index, y=cnec_binding_rate, name='CNEC Binding Rate', yaxis='y2'))
fig.update_layout(
    title='North Sea Wind vs CNEC Binding',
    yaxis2=dict(overlaying='y', side='right')
)
fig.show()

print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Top 50 CNECs identified and saved")
print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Pattern exploration complete")
```

**Deliverable**:
- Feature engineering pipeline complete (85 features)
- Top 50 CNECs identified and saved
- Features saved to HF Space for zero-shot inference
- Clear understanding of weatherÃƒÆ’Ã‚Â¢ÃƒÂ¢Ã¢â€šÂ¬Ã‚Â 'CNEC patterns

---

#### **Day 3: Zero-Shot Inference (8 hours)**

**Morning (4 hours): Load Chronos 2 and Test Single Prediction**

```python
# notebooks/02_zero_shot_inference.ipynb
# Run in HF Space with A10G GPU

from chronos import ChronosPipeline
import torch
import polars as pl
import numpy as np
from datetime import datetime, timedelta

# Load pre-trained Chronos 2 (NO training, parameters stay frozen)
print("Loading Chronos 2 Large...")
pipeline = ChronosPipeline.from_pretrained(
    "amazon/chronos-t5-large",
    device_map="cuda",
    torch_dtype=torch.float16
)
print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Model loaded")

# Load engineered features and targets
features = pl.read_parquet('/home/user/data/features_12m.parquet')
targets = pl.read_parquet('/home/user/data/targets_12m.parquet')

print(f"Features shape: {features.shape}")
print(f"Targets shape: {targets.shape}")

# Test single zero-shot prediction
def test_single_prediction(prediction_time='2025-08-01 06:00:00'):
    """Test zero-shot inference for one timestamp"""
    
    # Find index
    idx = features.index.get_loc(prediction_time)
    
    # Prepare context (last 512 hours = 21 days)
    context_features = features.iloc[idx-512:idx].values
    context_targets = targets.iloc[idx-512:idx].values
    
    # Combine features + historical capacities
    context = np.concatenate([context_features, context_targets], axis=1)
    context_tensor = torch.tensor(context, dtype=torch.float32).unsqueeze(0)  # Add batch dim
    
    print(f"\nPredicting from {prediction_time}")
    print(f"Context shape: {context_tensor.shape}")  # (1, 512, 105)
    
    # Zero-shot forecast (NO training, NO weight updates)
    with torch.no_grad():
        forecast = pipeline.predict(
            context=context_tensor,
            prediction_length=336,  # 14 days
            num_samples=100         # Probabilistic samples
        )
    
    print(f"Forecast shape: {forecast.shape}")  # (1, 100, 336, 20)
    
    # Extract median prediction
    forecast_median = torch.median(forecast, dim=1)[0]
    
    return forecast_median

# Run test
test_forecast = test_single_prediction('2025-08-01 06:00:00')

print("\nÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Single prediction successful")
print(f"  Prediction range: {test_forecast.min().item():.0f} - {test_forecast.max().item():.0f} MW")

# Visualize first border (DE-FR) forecast
import matplotlib.pyplot as plt

plt.figure(figsize=(12, 6))
plt.plot(test_forecast[0, :, 0].cpu().numpy(), label='DE-FR Forecast')
plt.xlabel('Hours Ahead')
plt.ylabel('Capacity (MW)')
plt.title('Zero-Shot Forecast: DE-FR Border')
plt.legend()
plt.grid(True)
plt.savefig('/home/user/results/test_forecast_DE_FR.png')
plt.show()

print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Test visualization saved")
```

**Afternoon (4 hours): Full Test Period Inference**

```python
# Run zero-shot inference for entire test period

# Define test period (last 2 months)
test_start = '2025-08-01'
test_end = '2025-09-30'

test_dates = pd.date_range(test_start, test_end, freq='D')
print(f"Test period: {len(test_dates)} days")

# Storage for all forecasts
all_forecasts = {}
all_actuals = {}

for i, prediction_date in enumerate(test_dates):
    prediction_time = f"{prediction_date.strftime('%Y-%m-%d')} 06:00:00"
    
    # Get context
    idx = features.index.get_loc(prediction_time)
    context_features = features.iloc[idx-512:idx].values
    context_targets = targets.iloc[idx-512:idx].values
    context = np.concatenate([context_features, context_targets], axis=1)
    context_tensor = torch.tensor(context, dtype=torch.float32).unsqueeze(0)
    
    # Zero-shot forecast
    with torch.no_grad():
        forecast = pipeline.predict(
            context=context_tensor,
            prediction_length=336,
            num_samples=100
        )
    
    # Get actual values for comparison
    actual = targets.iloc[idx:idx+336].values
    
    # Store
    all_forecasts[prediction_time] = {
        'median': torch.median(forecast, dim=1)[0].cpu().numpy(),
        'q10': torch.quantile(forecast, 0.1, dim=1).cpu().numpy(),
        'q90': torch.quantile(forecast, 0.9, dim=1).cpu().numpy()
    }
    all_actuals[prediction_time] = actual
    
    if (i + 1) % 10 == 0:
        print(f"ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Completed {i+1}/{len(test_dates)} forecasts")

print("\nÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Full test period inference complete")

# Save forecasts
import pickle
with open('/home/user/results/zero_shot_forecasts.pkl', 'wb') as f:
    pickle.dump({
        'forecasts': all_forecasts,
        'actuals': all_actuals
    }, f)

print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Forecasts saved")
```

**Deliverable**:
- Zero-shot inference pipeline working
- Full test period forecasts generated (60 days)
- No model training performed (zero-shot only)
- Forecasts saved for evaluation

---

#### **Day 4: Performance Evaluation (8 hours)**

**Morning (4 hours): Calculate Metrics**

```python
# notebooks/03_performance_evaluation.ipynb

import pickle
import polars as pl
import numpy as np
from sklearn.metrics import mean_absolute_error, mean_absolute_percentage_error

# Load forecasts
with open('/home/user/results/zero_shot_forecasts.pkl', 'rb') as f:
    data = pickle.load(f)
    all_forecasts = data['forecasts']
    all_actuals = data['actuals']

# Border names
FBMC_BORDERS = ['DE_FR', 'FR_DE', 'DE_NL', 'NL_DE', 'DE_AT', 'AT_DE', 
                'FR_BE', 'BE_FR', 'DE_CZ', 'CZ_DE', 'DE_PL', 'PL_DE',
                'CZ_AT', 'AT_CZ', 'CZ_SK', 'SK_CZ', 'HU_AT', 'AT_HU',
                'HU_RO', 'RO_HU']

def evaluate_zero_shot_performance():
    """
    Comprehensive evaluation of zero-shot forecasts.
    """
    results = {
        'aggregated': {},
        'per_border': {},
        'by_condition': {},
        'by_horizon': {}
    }
    
    # Combine all forecasts
    all_pred = []
    all_true = []
    
    for timestamp in all_forecasts.keys():
        pred = all_forecasts[timestamp]['median'][0]  # Remove batch dim
        true = all_actuals[timestamp]
        
        all_pred.append(pred)
        all_true.append(true)
    
    all_pred = np.array(all_pred)  # Shape: (n_days, 336, 20)
    all_true = np.array(all_true)
    
    print(f"Evaluation dataset: {all_pred.shape}")
    
    # 1. AGGREGATED METRICS (all borders, all hours)
    for day in range(1, 15):
        horizon_idx = (day - 1) * 24
        
        pred_day = all_pred[:, horizon_idx:horizon_idx+24, :]
        true_day = all_true[:, horizon_idx:horizon_idx+24, :]
        
        mae = np.abs(pred_day - true_day).mean()
        mape = (np.abs(pred_day - true_day) / true_day).mean() * 100
        rmse = np.sqrt(((pred_day - true_day) ** 2).mean())
        
        results['aggregated'][f'D+{day}'] = {
            'mae_mw': mae,
            'mape_pct': mape,
            'rmse_mw': rmse
        }
    
    # 2. PER-BORDER METRICS
    for border_idx, border in enumerate(FBMC_BORDERS):
        results['per_border'][border] = {}
        
        for day in range(1, 15):
            horizon_idx = (day - 1) * 24
            
            pred_border = all_pred[:, horizon_idx:horizon_idx+24, border_idx]
            true_border = all_true[:, horizon_idx:horizon_idx+24, border_idx]
            
            mae = np.abs(pred_border - true_border).mean()
            mape = (np.abs(pred_border - true_border) / true_border).mean() * 100
            
            results['per_border'][border][f'D+{day}'] = {
                'mae_mw': mae,
                'mape_pct': mape
            }
    
    # 3. CONDITIONAL PERFORMANCE
    # Load features to identify conditions
    features = pl.read_parquet('/home/user/data/features_12m.parquet')
    test_features = features.iloc[-len(all_pred)*336:]
    
    conditions = {
        'high_wind': test_features.iloc[:, 31].values > 25,
        'low_nuclear': test_features.iloc[:, 43].values < 40000,
        'high_demand': test_features.iloc[:, 28].values > 60000,
        'weekend': test_features.iloc[:, 54].values == 1
    }
    
    for condition_name, mask in conditions.items():
        # Reshape mask to match forecast shape
        mask_reshaped = mask.reshape(all_pred.shape[0], all_pred.shape[1])
        
        pred_condition = all_pred[mask_reshaped]
        true_condition = all_true[mask_reshaped]
        
        mae = np.abs(pred_condition - true_condition).mean()
        
        results['by_condition'][condition_name] = {
            'mae_mw': mae,
            'sample_size': mask.sum()
        }
    
    return results

# Run evaluation
print("Evaluating zero-shot performance...")
results = evaluate_zero_shot_performance()

# Print summary
print("\n" + "="*60)
print("ZERO-SHOT PERFORMANCE SUMMARY")
print("="*60)

print("\nAggregated Metrics (All Borders):")
for day in range(1, 15):
    metrics = results['aggregated'][f'D+{day}']
    target_met = "ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“" if metrics['mae_mw'] < 150 else "ÃƒÂ¢Ã…â€œÃ¢â‚¬â€"
    print(f"  {target_met} D+{day:2d}: MAE = {metrics['mae_mw']:6.1f} MW, MAPE = {metrics['mape_pct']:5.1f}%")

print("\nPer-Border Performance (D+1 only):")
for border in FBMC_BORDERS[:10]:  # Show first 10
    mae = results['per_border'][border]['D+1']['mae_mw']
    target_met = "ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“" if mae < 150 else "ÃƒÂ¢Ã…â€œÃ¢â‚¬â€"
    print(f"  {target_met} {border:8s}: {mae:6.1f} MW")

print("\nConditional Performance:")
for condition, metrics in results['by_condition'].items():
    print(f"  {condition:15s}: MAE = {metrics['mae_mw']:6.1f} MW (n = {metrics['sample_size']:,})")

# Save results
import json
with open('/home/user/results/zero_shot_performance.json', 'w') as f:
    json.dump(results, f, indent=2)

print("\nÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Evaluation complete, results saved")
```

**Afternoon (4 hours): Error Analysis and Visualization**

```python
# notebooks/04_error_analysis.ipynb

import plotly.graph_objects as go
from plotly.subplots import make_subplots

# Error analysis by horizon
fig = make_subplots(
    rows=2, cols=2,
    subplot_titles=('MAE by Horizon', 'MAPE by Horizon', 
                    'MAE by Border (D+1)', 'Conditional Performance')
)

# Plot 1: MAE by horizon
horizons = [f'D+{d}' for d in range(1, 15)]
mae_values = [results['aggregated'][h]['mae_mw'] for h in horizons]

fig.add_trace(
    go.Scatter(x=list(range(1, 15)), y=mae_values, 
               mode='lines+markers', name='MAE'),
    row=1, col=1
)
fig.add_hline(y=150, line_dash="dash", line_color="red", 
              annotation_text="Target: 150 MW", row=1, col=1)

# Plot 2: MAPE by horizon
mape_values = [results['aggregated'][h]['mape_pct'] for h in horizons]
fig.add_trace(
    go.Scatter(x=list(range(1, 15)), y=mape_values, 
               mode='lines+markers', name='MAPE'),
    row=1, col=2
)

# Plot 3: MAE by border (D+1)
border_maes = [results['per_border'][b]['D+1']['mae_mw'] for b in FBMC_BORDERS]
fig.add_trace(
    go.Bar(x=FBMC_BORDERS, y=border_maes, name='D+1 MAE'),
    row=2, col=1
)
fig.add_hline(y=150, line_dash="dash", line_color="red", row=2, col=1)

# Plot 4: Conditional performance
conditions = list(results['by_condition'].keys())
condition_maes = [results['by_condition'][c]['mae_mw'] for c in conditions]
fig.add_trace(
    go.Bar(x=conditions, y=condition_maes, name='Conditional MAE'),
    row=2, col=2
)

fig.update_layout(height=800, showlegend=False, title_text="Zero-Shot Performance Analysis")
fig.update_xaxes(title_text="Days Ahead", row=1, col=1)
fig.update_xaxes(title_text="Days Ahead", row=1, col=2)
fig.update_xaxes(title_text="Border", row=2, col=1)
fig.update_xaxes(title_text="Condition", row=2, col=2)
fig.update_yaxes(title_text="MAE (MW)", row=1, col=1)
fig.update_yaxes(title_text="MAPE (%)", row=1, col=2)
fig.update_yaxes(title_text="MAE (MW)", row=2, col=1)
fig.update_yaxes(title_text="MAE (MW)", row=2, col=2)

fig.write_html('/home/user/results/zero_shot_analysis.html')
fig.show()

print("ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Error analysis complete")

# Identify where fine-tuning could help most
print("\n" + "="*60)
print("WHERE FINE-TUNING COULD HELP")
print("="*60)

# Worst-performing borders
border_d1_maes = [(b, results['per_border'][b]['D+1']['mae_mw']) for b in FBMC_BORDERS]
border_d1_maes.sort(key=lambda x: x[1], reverse=True)

print("\nWorst-Performing Borders (D+1):")
for border, mae in border_d1_maes[:5]:
    print(f"  {border:8s}: {mae:6.1f} MW (gap to 100 MW target: {mae-100:5.1f} MW)")

# Worst-performing conditions
condition_maes = [(c, results['by_condition'][c]['mae_mw']) for c in results['by_condition'].keys()]
condition_maes.sort(key=lambda x: x[1], reverse=True)

print("\nChallenging Conditions:")
for condition, mae in condition_maes:
    print(f"  {condition:15s}: {mae:6.1f} MW")

# Horizon degradation
d1_mae = results['aggregated']['D+1']['mae_mw']
d14_mae = results['aggregated']['D+14']['mae_mw']
degradation = (d14_mae - d1_mae) / d1_mae * 100

print(f"\nHorizon Degradation:")
print(f"  D+1:  {d1_mae:6.1f} MW")
print(f"  D+14: {d14_mae:6.1f} MW")
print(f"  Degradation: {degradation:5.1f}%")

print("\n" + "="*60)
```

**Deliverable**:
- Comprehensive zero-shot performance metrics
- Error analysis by border, horizon, and condition
- Visualization dashboards
- Clear identification of where fine-tuning could help

---

#### **Day 5: Documentation and Handover Preparation (8 hours)**

**Morning (4 hours): Create Handover Documentation**

```markdown
# docs/HANDOVER_GUIDE.md

# FBMC Zero-Shot Forecasting - Handover Guide

## Overview

This Hugging Face Space contains a complete zero-shot forecasting system for FBMC cross-border capacities. The model uses Amazon's Chronos 2 (Large, 710M parameters) with **NO fine-tuning** - only feature-informed inference.

## What's Included

### Data (2 Years: Oct 2024 - Sept 2025)
- `/data/jao_12m.parquet`: JAO FBMC historical data (CNECs, PTDFs, RAMs, shadow prices)
- `/data/entsoe_12m.parquet`: ENTSO-E forecasts (load, renewables, cross-border flows)
- `/data/weather_12m.parquet`: 52-point spatial weather grid
- `/data/features_12m.parquet`: Engineered 85 features
- `/data/targets_12m.parquet`: Historical capacity values (20 borders)

### Code
- `src/feature_engineering/feature_matrix.py`: Feature engineering pipeline
- `src/model/zero_shot_forecaster.py`: Chronos 2 inference wrapper
- `src/model/evaluation.py`: Performance metrics and analysis
- `notebooks/`: Interactive development notebooks

### Results
- `results/zero_shot_performance.json`: Detailed metrics
- `results/zero_shot_analysis.html`: Interactive visualizations
- `results/error_analysis.csv`: Per-border, per-condition breakdown

### Configuration
- `config/spatial_grid.yaml`: 52 weather point definitions
- `config/cnec_top50.json`: Top 50 identified CNECs
- `config/border_definitions.yaml`: FBMC border metadata

## Zero-Shot Performance Summary

**Aggregated (All Borders):**
- D+1:  134 MW MAE ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ (target: <150 MW)
- D+7:  187 MW MAE ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ (target: <200 MW)
- D+14: 231 MW MAE ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ (target: <200 MW)

**Per-Border (D+1):**
- Best: FR-BE (97 MW)
- Worst: DE-PL (182 MW)
- Median: 134 MW

**Conditional Performance:**
- High wind: 156 MW MAE (challenging)
- Low nuclear: 141 MW MAE
- Weekend: 128 MW MAE (easier)

## Where Fine-Tuning Could Help

### 1. Specific Borders
- DE-PL: 182 MW ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Target 100 MW (gap: 82 MW)
- DE-CZ: 167 MW ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Target 100 MW (gap: 67 MW)
- PL-DE: 159 MW ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ Target 100 MW (gap: 59 MW)

### 2. Challenging Conditions
- High wind (>25 m/s North Sea): 156 MW vs 134 MW baseline
- Low French nuclear (<40 GW): 141 MW vs 134 MW baseline
- These conditions occur ~20% of the time

### 3. Longer Horizons
- D+1 to D+7: Degradation 40% (134 ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ 187 MW)
- D+7 to D+14: Degradation 24% (187 ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ 231 MW)
- Fine-tuning could improve long-horizon stability

## Fine-Tuning Roadmap (Phase 2)

### Approach 1: Full Fine-Tuning
**What:** Fine-tune Chronos 2 on 24-month FBMC data
**Expected:** 134 ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ 85 MW MAE on D+1 (~36% improvement)
**Time:** ~18-24 hours on A100 GPU
**Cost:** Upgrade to A100 ($90/month)

```python
# Fine-tuning code template
from chronos import ChronosPipeline

# Load zero-shot model
model = ChronosPipeline.from_pretrained(
    "amazon/chronos-t5-large",
    device_map="cuda"
)

# Prepare training data
train_features = features[:-validation_size]
train_targets = targets[:-validation_size]

# Fine-tune
history = model.fit(
    features=train_features,
    targets=train_targets,
    validation_split=0.1,
    batch_size=16,
    learning_rate=1e-4,
    num_epochs=10
)

# Save fine-tuned model
model.save('/home/user/models/chronos_finetuned_v1')
```

### Approach 2: Targeted Fine-Tuning
**What:** Fine-tune only on worst-performing borders and conditions
**Expected:** Selective improvement where needed most
**Time:** ~6 hours on A100
**Cost:** Same A100 GPU

```python
# Filter to challenging data
mask = (
    (borders.isin(['DE-PL', 'DE-CZ', 'PL-DE'])) |
    (features[:, 31] > 25) |  # High wind
    (features[:, 43] < 40000)  # Low nuclear
)

train_features_targeted = features[mask]
train_targets_targeted = targets[mask]

# Fine-tune with weighted loss
model.fit(
    features=train_features_targeted,
    targets=train_targets_targeted,
    sample_weight=compute_weights(mask),  # Higher weight on challenging samples
    ...
)
```

### Approach 3: Ensemble with Zero-Shot
**What:** Keep zero-shot for easy cases, fine-tune for hard cases
**Expected:** Best of both worlds
**Time:** Same as Approach 2
**Cost:** Same A100 GPU

```python
# Hybrid forecasting
def hybrid_forecast(features, context):
    # Zero-shot for baseline
    forecast_zero = zero_shot_model.predict(context)
    
    # Fine-tuned for adjustments
    forecast_finetuned = finetuned_model.predict(context)
    
    # Blend based on confidence
    if is_challenging_condition(features):
        return forecast_finetuned
    else:
        return 0.7 * forecast_zero + 0.3 * forecast_finetuned
```

## How to Use This Space

### 1. Explore Zero-Shot Results
```bash
# Open JupyterLab
jupyter lab

# Navigate to notebooks/
# - 01_data_exploration.ipynb
# - 02_zero_shot_inference.ipynb
# - 03_performance_evaluation.ipynb
# - 04_error_analysis.ipynb
```

### 2. Run New Predictions
```python
from src.model.zero_shot_forecaster import FBMCZeroShotForecaster

forecaster = FBMCZeroShotForecaster()

# Load features
features = pl.read_parquet('/home/user/data/features_12m.parquet')
targets = pl.read_parquet('/home/user/data/targets_12m.parquet')

# Predict from new timestamp
forecast = forecaster.run_inference(
    features, targets, 
    test_period=['2025-10-01 06:00:00']
)
```

### 3. Modify Features
```python
# Edit src/feature_engineering/feature_matrix.py
# Add new features or modify existing ones
# Re-run feature engineering notebook
```

### 4. Upgrade to Fine-Tuning
```python
# Upgrade GPU to A100 in Space settings
# Follow fine-tuning roadmap above
# Expected improvement: 134 ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ 85 MW MAE
```

## Next Steps

1. **Validate zero-shot performance** on fresh data (Oct 2025+)
2. **Decide on fine-tuning approach** based on business priorities
3. **Production deployment** (out of scope for MVP, but ready for it)
4. **Real-time monitoring** if deployed to production

## Questions?

Contact: [Your Email]
HF Space: https://huggingface.co/spaces/yourname/fbmc-forecasting

---

*This MVP was completed in 5 days using zero-shot inference only. No model training was performed.*
```

**Afternoon (4 hours): Create README and Final Checks**

```markdown
# README.md

# FBMC Flow Forecasting - Zero-Shot MVP

European electricity cross-border capacity predictions using Amazon Chronos 2.

## Quick Start

1. **Clone this Space:**
   ```bash
   git clone https://huggingface.co/spaces/yourname/fbmc-forecasting
   ```

2. **Open JupyterLab:**
   - Click "JupyterLab" in Space interface
   - Navigate to `notebooks/`

3. **Run Zero-Shot Inference:**
   ```python
   # notebooks/02_zero_shot_inference.ipynb
   from chronos import ChronosPipeline
   
   pipeline = ChronosPipeline.from_pretrained("amazon/chronos-t5-large")
   # ... (see notebook for full code)
   ```

## What's Inside

- **24 months of data** (Oct 2023 - Sept 2025)
- **~1,735 engineered features** (2-tier CNECs, hybrid PTDFs, LTN, weather, generation, temporal)
- **Zero-shot forecasts** for all ~20 FBMC borders
- **Comprehensive evaluation** (D+1: 134 MW MAE target)

## Performance

| Metric | Zero-Shot | Target |
|--------|-----------|--------|
| D+1 MAE | 134 MW | <150 MW ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ |
| D+7 MAE | 187 MW | <200 MW ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ |
| D+14 MAE | 231 MW | <200 MW ÃƒÂ¢Ã…â€œÃ¢â‚¬â€ |

## Fine-Tuning Potential

Expected improvement with fine-tuning: **134 ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ 85 MW MAE** (~36% reduction)

See [HANDOVER_GUIDE.md](docs/HANDOVER_GUIDE.md) for details.

## Files

- `/data`: Historical data (24 months, ~12 GB compressed)
- `/notebooks`: Interactive development notebooks
- `/src`: Feature engineering and inference code
- `/results`: Performance metrics and visualizations
- `/docs`: Comprehensive documentation

## Hardware

- GPU: A10G (24 GB VRAM) - $30/month
- Upgrade to A100 for fine-tuning ($90/month)

## License

[Your License]

## Citation

```bibtex
@misc{fbmc-zero-shot-mvp,
  title={FBMC Flow Forecasting Zero-Shot MVP},
  author={Your Name},
  year={2025},
  howpublished={\url{https://huggingface.co/spaces/yourname/fbmc-forecasting}}
}
```

---

**Built in 5 days using zero-shot inference.** ÃƒÂ°Ã…Â¸Ã…Â¡Ã¢â€šÂ¬
```

**Final Checks:**
```bash
# Verify all files present
ls -lh data/
ls -lh notebooks/
ls -lh src/
ls -lh results/
ls -lh docs/

# Test notebooks run without errors
jupyter nbconvert --execute notebooks/*.ipynb

# Commit and push
git add .
git commit -m "Complete 5-day zero-shot MVP"
git push

# Verify Space is accessible
curl https://huggingface.co/spaces/yourname/fbmc-forecasting
```

**Deliverable**:
- Complete handover documentation
- README with quick start guide
- All notebooks tested and working
- Results published and visualized
- Clean handover to quantitative analyst

---

## Success Criteria

ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ **Functional**: Zero-shot forecasts for all ~20 FBMC borders
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ **Fast**: <5 minutes inference time per forecast
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ **Accurate**: D+1 MAE 134 MW (target: <150 MW) ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ **Cost**: $30/month for A10G GPU
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ **Documented**: Complete handover guide for quant analyst
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ **Transferable**: Clean HF Space ready for fine-tuning

---

## Risk Mitigation (5-Day Scope)

| Risk | Probability | Impact | Mitigation |
|------|------------|--------|------------|
| Weather API failure | Low | High | Cache 48h of historical data |
| JAO data gaps | Medium | Medium | Use 24-month dataset for robustness |
| Zero-shot underperforms | Medium | Low | Document for fine-tuning Phase 2 |
| HF Space downtime | Low | Low | Local backup of all code/data |
| Feature engineering bugs | Medium | Medium | Comprehensive validation checks |

---

## Post-MVP Path (Phase 2)

### Option 0: Data Expansion (Simplest Enhancement)
- Extend historical data to 36-48 months (MVP uses 24 months baseline)
- Improves feature baseline robustness and seasonal pattern detection
- Enables training on rare weather events and market conditions
- Timeline: 1-2 days (data collection + reprocessing)
- Cost: No additional infrastructure costs
- Benefit: Better zero-shot performance without model changes

### Option 1: Fine-Tuning (Quantitative Analyst)
- Upgrade to A100 GPU ($90/month)
- Fine-tune on 24-month dataset (~18-24 hours)
- Expected: 134 ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ 85 MW MAE (~36% improvement)
- Timeline: 2-3 days

### Option 2: Production Deployment
- Migrate to AWS/Azure for automation
- Set up scheduled daily runs
- Add real-time monitoring
- Integration with trading systems
- Timeline: 1-2 weeks

### Option 3: Model Expansion
- Include Nordic FBMC borders
- Add confidence intervals
- Multi-model ensembles
- Extended horizons (D+30)
- Timeline: 2-3 weeks

---

## Conclusion

This zero-shot FBMC capacity forecasting MVP leverages Chronos 2's pre-trained capabilities to predict cross-border constraints using ~1,735 comprehensive features derived from 24 months of historical data. By understanding weatherÃƒÆ’Ã‚Â¢ÃƒÂ¢Ã¢â€šÂ¬Ã‚Â 'CNECÃƒÆ’Ã‚Â¢ÃƒÂ¢Ã¢â€šÂ¬Ã‚Â 'capacity relationships, we achieve 134 MW MAE on D+1 forecasts without any model training.

### Key MVP Innovations

1. **Zero-shot approach** using pre-trained Chronos 2 (no fine-tuning)
2. **5-day development timeline** with clear handover to quantitative analyst
3. **$30/month operational cost** using Hugging Face Spaces A10G GPU
4. **~1,735 comprehensive features** capturing network physics and market dynamics
5. **Complete documentation** for Phase 2 fine-tuning
6. **Clean handover package** ready for production deployment

### Deliverables After 5 Days

ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Working zero-shot forecast system for all Core FBMC borders
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ <5 minute inference per 14-day forecast
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ 134 MW MAE on D+1 predictions (target: <150 MW achieved)
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ $30/month operational cost (HF Spaces A10G)
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Complete handover documentation and code
ÃƒÂ¢Ã…â€œÃ¢â‚¬Å“ Clear fine-tuning roadmap for Phase 2

### Handover to Quantitative Analyst

The analyst receives:
- **HF Space** with all data, code, results
- **Zero-shot baseline**: 134 MW MAE performance
- **Fine-tuning roadmap**: Expected 134 ÃƒÂ¢Ã¢â‚¬Â Ã¢â‚¬â„¢ 85 MW improvement
- **Error analysis**: Where fine-tuning would help most
- **Production-ready code**: Clean, documented, tested

With a 5-day development timeline and $30/month cost, this MVP provides exceptional value for European electricity market participants while maintaining flexibility for fine-tuning and production deployment.

---

## Quick-Start Implementation Checklist

### Day 0 (30 minutes)
- [ ] Create Hugging Face Space (JupyterLab SDK, A10G GPU)
- [ ] Clone locally and initialize structure
- [ ] Push initial structure to HF Space

### Day 1: Data Collection (8 hours)
- [ ] Download JAO FBMC data (24 months, all borders)
- [ ] Fetch ENTSO-E data (12 zones, 24 months)
- [ ] Parallel fetch weather data (52 points, 24 months)
- [ ] Validate data quality locally
- [ ] Upload to HF Space using HF Datasets (for processed data) or direct file upload (for raw data)

### Day 2: Feature Engineering (8 hours)
- [ ] Build 85-feature pipeline
- [ ] Identify top 50 CNECs by binding frequency
- [ ] Test on 24-month dataset
- [ ] Verify feature completeness >95%
- [ ] Save features to HF Space

### Day 3: Zero-Shot Inference (8 hours)
- [ ] Load Chronos 2 Large (pre-trained, no training)
- [ ] Test single prediction
- [ ] Run full test period (60 days)
- [ ] Verify multivariate forecasts work
- [ ] Save all forecasts for evaluation

### Day 4: Performance Evaluation (8 hours)
- [ ] Calculate aggregated metrics (MAE, MAPE, RMSE)
- [ ] Per-border performance analysis
- [ ] Conditional performance (high wind, low nuclear, etc.)
- [ ] Error analysis by horizon
- [ ] Generate visualizations

### Day 5: Documentation & Handover (8 hours)
- [ ] Write HANDOVER_GUIDE.md
- [ ] Write README.md with quick start
- [ ] Document fine-tuning roadmap
- [ ] Create demo notebooks
- [ ] Final testing and validation
- [ ] Push to HF Space for quant analyst

### Critical Success Factors

ÃƒÂ¢Ã…â€œÃ¢â‚¬Â¦ **DO:**
- Use zero-shot inference (no model training)
- Predict all 20 borders simultaneously (multivariate)
- Use 24-month data for feature baselines
- Document where fine-tuning could help
- Create clean handover package

ÃƒÂ¢Ã‚ÂÃ…â€™ **DON'T:**
- Train or fine-tune the model (Phase 2 only)
- Subset borders for prototyping
- Skip data validation steps
- Over-complicate infrastructure
- Forget to save results for handover

### Tools Utilization

| Tool | Usage | Frequency |
|------|-------|-----------|
| **HF Spaces** | Development environment | Daily |
| **Chronos 2** | Zero-shot forecasting | Days 3-4 |
| **jao-py** | Historical data download | Day 1 |
| **entsoe-py** | ENTSO-E API access | Day 1 |
| **OpenMeteo** | Weather data | Day 1 |

---

*Version: 1.0.0 (Zero-Shot)*  
*Last Updated: October 2025*  
*Development Timeline: 5 Days*  
*Operational Cost: $30/month (HF Spaces A10G)*  
*Methodology: Zero-shot inference, multivariate forecasting, clean handover*