Skip to main content
Version: 1.5.0

DirectoryDataAsset

Signature

class great_expectations.datasource.fluent.data_asset.path.directory_asset.DirectoryDataAsset(
*,
name: <pydantic.v1.fields.DeferredType object at 0x7f37b9af61b0>,
type: <pydantic.v1.fields.DeferredType object at 0x7f37b9af6270>,
id: <pydantic.v1.fields.DeferredType object at 0x7f37b9af6330> = None,
order_by: <pydantic.v1.fields.DeferredType object at 0x7f37b9af63f0> = None,
batch_metadata: <pydantic.v1.fields.DeferredType object at 0x7f37b9af6540> = None,
batch_definitions: <pydantic.v1.fields.DeferredType object at 0x7f37b9af66f0> = None,
connect_options: <pydantic.v1.fields.DeferredType object at 0x7f37b9af67b0> = None,
data_directory: pathlib.Path,
**extra_data: typing.Any
)

Base class for PathDataAssets which batch by combining the contents of a directory.

Methods

add_batch_definition_daily

Signature

add_batch_definition_daily(
name: str,
column: str
) → BatchDefinition

Add a BatchDefinition, which creates a single Batch for each day in the directory.

Parameters

NameDescription

name

Name of the Batch Definition.

column

Column to partition on.

Returns

TypeDescription

BatchDefinition

A BatchDefinition that is partitioned daily.

add_batch_definition_monthly

Signature

add_batch_definition_monthly(
name: str,
column: str
) → BatchDefinition

Add a BatchDefinition which creates a single batch for each month in the directory.

Parameters

NameDescription

name

Name of the Batch Definition.

column

Column to partition on.

Returns

TypeDescription

BatchDefinition

A BatchDefinition that is partitioned monthly.

add_batch_definition_whole_directory

Signature

add_batch_definition_whole_directory(
name: str
) → BatchDefinition

Add a BatchDefinition which creates a single batch for the entire directory.

add_batch_definition_yearly

Signature

add_batch_definition_yearly(
name: str,
column: str
) → BatchDefinition

Add a BatchDefinition which creates a single batch for each year in the directory.

Parameters

NameDescription

name

Name of the Batch Definition.

column

Column to partition on.

Returns

TypeDescription

BatchDefinition

A BatchDefinition that is partitioned yearly.

delete_batch_definition

Signature

delete_batch_definition(
name: str
)None

Delete a batch definition.

Parameters

NameDescription

name (str)

Name of the BatchDefinition to delete.

get_batch_definition

Signature

get_batch_definition(
name: str
) → great_expectations.core.batch_definition.BatchDefinition

Get a batch definition.

Parameters

NameDescription

name (str)

Name of the BatchDefinition to get.

Raises

TypeDescription

KeyError

If the BatchDefinition does not exist.