# skchem.data.converters.base
Defines the base converter class.
skchem.data.converters.base.Converter(directory, output_directory, output_filename='default.h5')[source]¶Bases: object
Create a fuel dataset from molecules and targets.
run(ms, y, output_path, splits=None, features=None, pytables_kws={'complib': 'bzip2', 'complevel': 9})[source]¶Args:
source_names¶split_names¶skchem.data.converters.base.Feature(fper, key, axis_names)¶Bases: tuple
axis_names¶Alias for field number 2
fper¶Alias for field number 0
key¶Alias for field number 1
skchem.data.converters.base.Split(mask, name, converter)[source]¶Bases: object
contiguous¶indices¶ref¶skchem.data.converters.base.contiguous_order(to_order, splits)[source]¶Determine a contiguous order from non-overlapping splits, and put data in that order.
| Parameters: |
|
|---|---|
| Returns: | The data in contiguous order. |
| Return type: | iterable<pd.Series, pd.DataFrame, pd.Panel> |
# skchem.data.coverters.example
Formatter for the example dataset.
skchem.data.converters.nmrshiftdb2.NMRShiftDB2Converter(directory, output_directory, output_filename='nmrshiftdb2.h5')[source]¶Bases: skchem.data.converters.base.Converter
combine_duplicates(data)[source]¶Collect duplicate spectra into one dictionary. All shifts are collected into lists.
process_spectra(data)[source]¶Turn the string representations found in sdf file into a dictionary.
## skchem.data.transformers.tox21
Module defining transformation techniques for tox21.
skchem.data.converters.tox21.Tox21Converter(directory, output_directory, output_filename='tox21.h5')[source]¶Bases: skchem.data.converters.base.Converter
Class to build tox21 dataset.
skchem.data.converters.DiversityConverter(directory, output_directory, output_filename='diversity.h5')[source]¶Bases: skchem.data.converters.base.Converter
Example Converter, using the NCI DTP Diversity Set III.
skchem.data.converters.BursiAmesConverter(directory, output_directory, output_filename='bursi_ames.h5')[source]¶skchem.data.converters.MullerAmesConverter(directory, output_directory, output_filename='muller_ames.h5')[source]¶skchem.data.converters.PhysPropConverter(directory, output_directory, output_filename='physprop.h5')[source]¶skchem.data.converters.BradleyOpenMPConverter(directory, output_directory, output_filename='bradley_open_mp.h5')[source]¶skchem.data.converters.NMRShiftDB2Converter(directory, output_directory, output_filename='nmrshiftdb2.h5')[source]¶Bases: skchem.data.converters.base.Converter
combine_duplicates(data)[source]¶Collect duplicate spectra into one dictionary. All shifts are collected into lists.
process_spectra(data)[source]¶Turn the string representations found in sdf file into a dictionary.
skchem.data.converters.Tox21Converter(directory, output_directory, output_filename='tox21.h5')[source]¶Bases: skchem.data.converters.base.Converter
Class to build tox21 dataset.