actio_python_utils.spark_functions.load_excel_to_dataframe

actio_python_utils.spark_functions.load_excel_to_dataframe(self, xl_fn, header=True, load_config_options=None, **kwargs)[source]

Load and return the specified Excel spreadsheet with PySpark

Parameters:
  • self (SparkSession) – The PySpark session to use

  • xl_fn (str) – The path to the data source to load

  • header (bool, default: True) – Whether the data source has a header or not

  • load_config_options (Optional[Iterable[tuple[str, str]]], default: None) – Any additonal config options to load data

  • **kwargs – Any additional named arguments

Return type:

DataFrame

Returns:

The dataframe requested