PyLahman

pylahman is a Python package for accessing the Lahman Baseball Database via pandas.

The data used in this package is provided by SABR and is licensed under CC BY-SA 3.0. The data was last updated based on the source data available from https://sabr.org/lahman-database/ on 2025-07-18.

The surrounding software is licensed under the MIT License.

Installation

The pylahman package can be installed via pip.

pip install pylahman

Documentation and Usage

Each table in the Lahman Baseball Database has a corresponding data loading function in pylahman with the same name. For example, the Pitching table is accessed via the Pitching() function. Descriptions of each data table can be found in the documentation for the corresponding data loading function in the API Reference.

import pylahman

Pitching = pylahman.Pitching()
print(Pitching.head())
    playerID  yearID  stint teamID lgID  W  L   G  GS  CG  ...  IBB  WP  HBP  \
0  aardsda01    2004      1    SFN   NL  1  0  11   0   0  ...    0   0    2   
1  aardsda01    2006      1    CHN   NL  3  0  45   0   0  ...    0   1    1   
2  aardsda01    2007      1    CHA   AL  2  1  25   0   0  ...    3   2    1   
3  aardsda01    2008      1    BOS   AL  4  2  47   0   0  ...    2   3    5   
4  aardsda01    2009      1    SEA   AL  3  6  73   0   0  ...    3   2    0   

   BK  BFP  GF   R  SH  SF  GIDP  
0   0   61   5   8   0   1     1  
1   0  225   9  25   1   3     2  
2   0  151   7  24   2   1     1  
3   0  228   7  32   3   2     4  
4   0  296  53  23   2   1     2  

[5 rows x 30 columns]