dqm package

Subpackages

Submodules

dqm.main module

This script is the entry point for using DQM with command line and docker

dqm.main.load_dataframe(config_dict)[source]

This function loads a pandas dataframe from the config dict passed as input. This config dict comes from a pipeline configuration: An example of such pipeline is present in examples/ folder

Parameters:

config_dict (dict) – Dict containing a metric configuration

dqm.main.load_raw_data(file, separator)[source]

This function load a raw data file content as a pandas dataframe

Parameters:
  • file (str) – Path of the file to load

  • separator (str) – Separator to use when processing csv and txt format file

Returns:

Output dataframe

Return type:

df (pandas.DataFrame)

dqm.main.main()[source]

Main script of DQM component:

Parameters:
  • pipeline_config_path (str) – Path to the pipeline definition you want to apply

  • result_file_path – (str): Path the output YAML file where all computed metrics scores are stored

Module contents