# skchem.data.converters.base
Defines the base converter class.
skchem.data.converters.base.
Converter
(directory, output_directory, output_filename='default.h5')[source]¶Bases: object
Create a fuel dataset from molecules and targets.
run
(ms, y, output_path, splits=None, features=None, pytables_kws={'complib': 'bzip2', 'complevel': 9})[source]¶Args:
source_names
¶split_names
¶skchem.data.converters.base.
Feature
(fper, key, axis_names)¶Bases: tuple
axis_names
¶Alias for field number 2
fper
¶Alias for field number 0
key
¶Alias for field number 1
skchem.data.converters.base.
Split
(mask, name, converter)[source]¶Bases: object
contiguous
¶indices
¶ref
¶skchem.data.converters.base.
contiguous_order
(to_order, splits)[source]¶Determine a contiguous order from non-overlapping splits, and put data in that order.
Parameters: |
|
---|---|
Returns: | The data in contiguous order. |
Return type: | iterable<pd.Series, pd.DataFrame, pd.Panel> |
# skchem.data.coverters.example
Formatter for the example dataset.
skchem.data.converters.nmrshiftdb2.
NMRShiftDB2Converter
(directory, output_directory, output_filename='nmrshiftdb2.h5')[source]¶Bases: skchem.data.converters.base.Converter
combine_duplicates
(data)[source]¶Collect duplicate spectra into one dictionary. All shifts are collected into lists.
process_spectra
(data)[source]¶Turn the string representations found in sdf file into a dictionary.
## skchem.data.transformers.tox21
Module defining transformation techniques for tox21.
skchem.data.converters.tox21.
Tox21Converter
(directory, output_directory, output_filename='tox21.h5')[source]¶Bases: skchem.data.converters.base.Converter
Class to build tox21 dataset.
skchem.data.converters.
DiversityConverter
(directory, output_directory, output_filename='diversity.h5')[source]¶Bases: skchem.data.converters.base.Converter
Example Converter, using the NCI DTP Diversity Set III.
skchem.data.converters.
BursiAmesConverter
(directory, output_directory, output_filename='bursi_ames.h5')[source]¶skchem.data.converters.
MullerAmesConverter
(directory, output_directory, output_filename='muller_ames.h5')[source]¶skchem.data.converters.
PhysPropConverter
(directory, output_directory, output_filename='physprop.h5')[source]¶skchem.data.converters.
BradleyOpenMPConverter
(directory, output_directory, output_filename='bradley_open_mp.h5')[source]¶skchem.data.converters.
NMRShiftDB2Converter
(directory, output_directory, output_filename='nmrshiftdb2.h5')[source]¶Bases: skchem.data.converters.base.Converter
combine_duplicates
(data)[source]¶Collect duplicate spectra into one dictionary. All shifts are collected into lists.
process_spectra
(data)[source]¶Turn the string representations found in sdf file into a dictionary.
skchem.data.converters.
Tox21Converter
(directory, output_directory, output_filename='tox21.h5')[source]¶Bases: skchem.data.converters.base.Converter
Class to build tox21 dataset.
skchem.data.converters.
ChEMBLConverter
(directory, output_directory, output_filename='chembl.h5')[source]¶Bases: skchem.data.converters.base.Converter
Converter for the ChEMBL dataset.