LOB%
Strand rate — how often a pitcher leaves runners on base. Divergence from league baseline indicates luck vs skill.
殘壘效率:投手運氣 vs 實力的關鍵指標。
A full sabermetrics portal for Taiwan's professional baseball league — built from scratch, end-to-end.
從零建起的中華職棒進階數據分析系統——ETL、API、Dashboard、視覺化一條龍。
為什麼做這個
Taiwan's baseball community deserves the same depth of analysis that MLB fans take for granted. Tools like FanGraphs and Baseball Savant don't cover CPBL, and the league's own data portal stops at basic counting stats.
I wanted to prove that a single engineer could close that gap — build a production-grade sabermetrics pipeline for CPBL covering ETL, API, statistical modeling, and interactive visualization.
台灣棒球圈值得跟 MLB 球迷一樣深度的數據分析。FanGraphs、Baseball Savant 都不涵蓋中華職棒,而聯盟官方的數據只停留在傳統統計。我想證明一個工程師就能補上這個缺口——打造一套 production-grade 的 sabermetrics pipeline,涵蓋 ETL、API、統計建模、互動視覺化。
量化成果
分析模組
Strand rate — how often a pitcher leaves runners on base. Divergence from league baseline indicates luck vs skill.
殘壘效率:投手運氣 vs 實力的關鍵指標。
RE24-based win probability framework measuring how players perform in high-pressure moments.
關鍵時刻表現:RE24 壓力指數框架。
Per-count performance breakdown revealing pitcher tendencies and batter chase rates.
球數拆分 + 熱區圖:配球策略研究。
Per-15-pitch performance buckets with changepoint detection to identify effective pitch limits.
投手疲勞曲線:每 15 球切 bucket,自動找衰退點。
Park-factor-adjusted run creation — the definitive hitter quality metric. Built from scratch for CPBL.
wRC+:經球場係數調整的打者價值指標,為 CPBL 獨立建構。
Both team-based and venue-based park factor calculation with multi-season smoothing.
球場係數:主客場與場館雙維度計算。
Mean-reversion analysis identifying overperformers and underperformers likely to normalize.
BABIP 回歸分析:誰該回春,誰該崩盤。
Upper/lower half comparison with rolling wOBA to detect in-season trends and regression candidates.
上下半季分割:搭配 rolling wOBA 追趨勢。
技術棧
學到了什麼
看看實際運作
cpblanalysis.mursfoto.com ↗